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NOVEL HUMAN GENE RELATING TO RESPIRATORY DISEASES, 
OBESITY, AND INFLAMMATORY BOWEL DISEASE 
5 RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Application Serial 
Number 09/834,597, filed April 13, 2001, and International Application No. 
PCT/US01/12245, filed April 13, 2001. 
FIELD OF THE INVENTION 

10 This invention relates to genes identified from human chromosome 

20p13-p12, including Gene 216, which are associated with asthma, obesity, 
inflammatory bowel disease, and other human diseases. The invention also 
relates to the nucleotide sequences of these genes, including genomic DNA 
sequences, cDNA sequences, single nucleotide polymorphisms, alleles, and 

15 haplotypes. The invention further relates to isolated nucleic acids comprising 
these nucleotide sequences, and isolated polypeptides or peptides encoded 
thereby. Also related are expression vectors and host cells comprising the 
disclosed nucleic acids or fragments thereof, as well as antibodies that bind to 
the encoded polypeptides or peptides. The present invention further relates 

20 to Iigands that modulate the activity of the disclosed genes or gene products. 
In addition, the invention relates to diagnostics and therapeutics for various 
diseases, including asthma, utilizing the disclosed nucleic acids, polypeptides 
or peptides, antibodies, and/or Iigands. 
BACKGROUND 

25 Mouse chromosome 2 has been linked to a variety of disorders 

including airway hyperesponsiveness and obesity (DeSanctis et al., 1995, 
Nature Genetics, 11:150-154; Nagle et al., 1999, Nature, 398:148-152). This 
region of the mouse genome is homologous to portions of human chromosome 
20 including 20p13-p12. Although human chromosome 20p13-12p has been 

30 linked to a variety of genetic disorders including diabetes insipidus, 
neurohypophyseal, congenital endothelial dystrophy of cornea, insomnia, 
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neurodegeneration with brain iron accumulation 1 (Hallervorden-Spatz 
syndrome), fibrodysplasia ossificans progressiva, alagille syndrome, 
hydrometrocolpos (McKusick-Kaufman syndrome), Creutzfeldt-Jakob disease 
and Gerstmann-Straussler disease (see NCBI; National Center for 
5 Biotechnology Information, National Library of Medicine, 38A, 8N905, 8600 
Rockville Pike, Bethesda, MD 20894; on the world wide web at 
ncbi.nlm.nih.gov) the genes affecting these disorders have yet to be 
discovered. There is a need in the art for identifying specific genes relating to 
these disorders, as well as genes associated with obesity, lung disease, 

10 particularly, inflammatory lung disease phenotypes such as Chronic 
Obstructive Lung Disease (COPD), Adult Respiratory Distress Syndrome 
(ARDS), and asthma. Identification and characterization of such genes will 
make possible the development of effective diagnostics and therapeutic means 
to treat lung-related disorders. 

15 SUMMARY OF THE INVENTION 

This invention relates to Gene 216 located on human chromosome 
20p13-p12. In specific embodiments, the invention relates to isolated nucleic 
acids comprising Gene 216 genomic sequences (e.g., SEQ ID NO:5 and SEQ 
ID NO:6), cDNA sequences (e.g., SEQ ID NO:1 and SEQ ID NO:3), 

20 orthologous sequences (e.g., SEQ ID NO:364 and SEQ ID NO:365), 
complementary sequences, sequence variants, or fragments thereof, as 
described herein. The present invention also encompasses nucleic acid 
probes or primers useful for assaying a biological sample for the presence or 
expression of Gene 216. The invention further encompasses nucleic acids 

25 variants comprising alleles or haplotypes of single nucleotide polymorphisms 
(SNPs) identified in several genes, including Gene 216 (e.g., SEQ ID NO:241- 
288, and SEQ ID NO:373-420, and fragments thereof). Nucleic acid variants 
comprising SNP alleles or haplotypes can be used to diagnose diseases such 
as asthma, or to determine a genetic predisposition thereto. In addition, the 

30 present invention encompasses nucleic acids comprising alternate splicing 
variants (e.g., SEQ ID NO:2 and SEQ ID NO:350-362). 
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This invention also relates to vectors and host cells comprising vectors 
comprising the Gene 216 nucleic acid sequences disclosed herein. Such 
vectors can be used for nucleic acid preparations, including antisense nucleic 
acids, and for the expression of encoded polypeptides or peptides. Host cells 
5 can be prokaryotic or eukaryotic cells. In specific embodiments, an expression 
vector comprises a DNA sequence encoding the Gene 216 polypeptide 
sequence (e.g., SEQ ID NO:4 or SEQ ID NO:363), orthologous polypeptides 
(e.g., SEQ ID NO:366), sequence variants, or fragments thereof, as described 
herein. 

1 0 The present invention further relates to isolated Gene 216 polypeptides 

and peptides. In specific embodiments, the polypeptides or peptides comprise 
the amino acid sequence of the Gene 216 (e.g., SEQ ID NO:4 or SEQ ID 
NO:363), orthologous polypeptides (e.g., SEQ ID NO:366), sequence variants, 
or portions thereof, as described herein. In addition, this invention 

15 encompasses isolated fusion proteins comprising Gene 216 polypeptides or 
peptides. 

The present invention also relates to isolated antibodies, including 
monoclonal and polyclonal antibodies, and antibody fragments, that are 
specifically reactive with the Gene 216 polypeptides, fusion proteins, or 

20 variants, or portions thereof, as disclosed herein. In specific embodiments, 
monoclonal antibodies are prepared to be specifically reactive with the Gene 
216 polypeptide (e.g., SEQ ID NO:4 or SEQ ID NO:363), orthologous 
polypeptides (e.g., SEQ ID NO:366), or peptides, or sequence variants thereof. 
In addition, the present invention relates to methods of obtaining Gene 

25 216 polynucleotides and polypeptides, variant sequences, or fragments 
thereof, as disclosed herein. Also related are methods of obtaining anti-Gene 
216 antibodies and antibody fragments. The present invention also 
encompasses methods of obtaining Gene 216 ligands, e.g., agonists, 
antagonists, inhibitors, and binding factors. Such ligands can be used as 

30 therapeutics for asthma and related diseases. 

The present invention also relates to diagnostic methods and kits 
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utilizing Gene 216 (wild-type, mutant, or variant) nucleic acids, polypeptides, 
antibodies, or functional fragments thereof. Such factors can be used, for 
example, in diagnostic methods and kits for measuring expression levels of 
Gene 216, and to screen for various Gene 216-related diseases, especially 
5 asthma. In addition, the nucleic acids described herein can be used to identify 
chromosomal abnormalities affecting Gene 216, and to identify allelic variants 
or mutations of Gene 216 in an individual or population. 

The present invention further relates to methods and therapeutics for 
the treatment of various diseases, including asthma. In various embodiments, 

10 therapeutics comprising the disclosed Gene 216 nucleic acids, polypeptides, 
antibodies, ligands, or variants, derivatives, or portions thereof, are 
administered to a subject to treat, prevent, or ameliorate asthma. Specifically 
related are therapeutics comprising Gene 216 antisense nucleic acids, 
monoclonal antibodies, metalloprotease inhibitors, and gene therapy vectors. 

15 Such therapeutics can be administered alone, or in combination with one or 
more asthma treatments. 

In addition, this invention relates to non-human transgenic animals and 
cell lines comprising one or more of the disclosed Gene 216 nucleic acids, 
which can be used for drug screening, protein production, and other purposes. 

20 Also related are non-human knock-out animals and cell lines, wherein one or 
more endogenous Gene 216 genes (i.e., orthologs), or portions thereof, are 
deleted or replaced by marker genes. 

This invention further relates to methods of identifying proteins that are 
candidates for being involved in asthma (i.e., a "candidate protein"). Such 

25 proteins are identified by a method comprising: 1 ) identifying a protein in a first 
individual having the asthma phenotype; 2) identifying a protein in a second 
individual not having the asthma phenotype; and 3) comparing the protein of 
the first individual to the protein of the second individual, wherein a) the protein 
that is present in the second individual but not the first individual is the 

30 candidate protein; or b) the protein that is present in a higher amount in the 
second individual than in the first individual is the candidate protein; or c) the 
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protein that is present in a lower amount in the second individual than in the 
first individual is the candidate protein. 
BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts the LOD Plot of Linkage to Asthma. 
5 Figure 2 depicts the LOD Plot of Linkage to BHR (PC20 <=4 mg/ml) & 

Asthma. 

Figure 3 depicts the LOD Plot of Linkage to BHR (PC20 <=16 mg/ml) 
& Asthma 

Figure 4 depicts the LOD Plot of Linkage to High Total IgE & Asthma 
1 0 Figure 5 depicts the LOD Plot of Linkage to High Specific IgE & Asthma 

Figure 6 depicts the BAC/STS content contig map of human 
chromosome 20p13-p12. 

Figure 7 depicts the BAC1098L22 nucleotide sequence (SEQ ID NO:5). 

Figure 8 depicts the locations of single nucleotide polymorphisms, 
15 corresponding amino acid changes, and domains in the Gene 216 transcript. 
The exons of the transcript are marked from A to V and the size of each one 
is indicated. Above the exons, the 8 domains are labeled and a black bar 
represents the approximate location of each one. Underneath the black bars 
are the approximate location of the amino acid changes that have been 
20 identified. The amino acids boxed in white are the alleles that are most 
frequently observed. The nucleotides boxed in gray are the alleles that are 
most frequently observed. Single nucleotide polymorphisms are unboxed, and 
the polymorphism names appear underneath. The uterus cDNA clone does 
not contain all of Exon A, and does not contain the sequence CAG between 
25 Exon U and V. 

Figure 9 depicts alternate splice variants of Gene 216 obtained from 
lung tissue, including rt672 (SEQ ID NO:350), rt690 (SEQ ID NO:351), rt709 
(SEQ ID NO:352), rt71 1 (SEQ ID NO:353), rt713 (SEQ ID NO:354), and rt720 
(SEQ ID NO:355). 

30 Figure 10 depicts alternate splice variants of Gene 216 obtained from 

lung tissue, including rt725 (SEQ ID NO:356), rt727 (SEQ ID NO:357), rt733 



WO 02/083077 



PCT/US02/12063 



(SEQ ID NO:358), rt735 (SEQ ID NO:359), rt764 (SEQ ID NO:360), rt772 
(SEQ ID NO:361), and rt774 (SEQ ID NO:362). 

Figure 11 depicts the structure of the genomic sequence of Gene 216. 

Figure 12 depicts the alternate AG splice sequences at the junction of 
5 Intron UV and Exon V in Gene 216. 

Figure 13 depicts the promoter region of Gene 216. The Gene 216 
promoter sequence is shown in SEQ ID NO:8; the Gene 216 enhancer 
sequence is shown in SEQ ID NO:7. 

Figure 14 depicts a dendrogram of the ADAM family members and the 
1 0 relationship of Gene 21 6 to ADAMs that possesses an active meta I lo protease 
domain. 

Figure 15 depict Northern Blots illustrating Gene 216 expression 
patterns. 

Figure 16 depicts a Dot Blot that shows Gene 216 expression in various 
15 tissue types. 

Figure 17 depicts RT-PCR analysis of Gene 216 expression in primary 
cells from lung tissue. 

Figure 18 depicts an amino acid sequence alignment (Piieup) of 5 
ADAM family members that are closely related to Gene 216. Amino acids 
20 highlighted in black show 100% identity within the Piieup; dark gray show 80% 
identity; and light gray show 60% identity. The boxed amino acids represent 
the cysteine switch, the metalloprotease domain, and the "met-turn". The 
labeled arrows show the locations of the 8 domains. 

Figure 19 depicts the amino acid sequence of Gene 216 (SEQ ID 
25 NO:4). Labeled arrows above the sequence denote domain and corresponding 
length. Black boxes represent the signal sequence and the transmembrane 
domain identified by hydrophobicity plots. The underlined cysteine residue at 
position 1 33 is predicted to be involved in the cysteine switch, the dashed box 
represents the metalloprotease domain, and the methionine underlined twice 
30 is the "met-turn". The gray boxes represent the signaling binding sites 
identified in the cytoplasmic tail. The amino acid changes corresponding to 



WO 02/083077 



PCT/US02/12063 



single nucleotide polymorphisms are indicated in bold. The alanine deleted in 
the uterus cDNA clone is marked within a black triangle, and if present would 
have been between the glutamine and the aspartic acid. 

Figure 20 depicts the Kyte-Doolittle hydrophobicity plot for the Gene 
5 216 amino acid sequence. 

Figures 21 depicts the genomic sequence of the mouse ortholog of 
Gene 216 (SEQ ID NO:364). 

Figure 22 depicts the cDNA nucleotide sequence (SEQ ID NO:365) and 
predicted amino acid sequence (SEQ ID NO:366) of the mouse ortholog of 
10 Gene 216. 

Figure 23 depicts an amino acid sequence alignment (Pileup) of human 
Gene 216 polypeptide (SEQ ID NO:4) and the mouse ortholog of Gene 216 
(SEQ ID NO:366). Vertical lines indicate identical amino acid residues. Dots 
indicate similar amino acid residues. 

15 Figure 24 depicts the nucleotide sequence (SEQ ID NO:1) and encoded 

amino acid sequence (SEQ ID NO:4) determined from the master cDNA 
sequence of Gene 216. The master cDNA sequence combines the sequence 
information from the uterine cDNA clone and 5 s RACE clone. Identified single 
nucleotide polymorphism positions are underlined. 

20 Figure 25 depicts the results of a case control study p-value plot that 

shows single nucleotide polymorphism association with the asthma phenotype 
in the combined US and UK populations. 

Figure 26 depicts the results of a case control study p-value plot that 
shows single nucleotide polymorphism association with the asthma phenotype 

25 in the US and UK populations, separately. 

Figure 27 depicts the results of a case control study p-value plot that 
shows single nucleotide polymorphism association with the bronchial hyper- 
responsiveness and asthma phenotypes in the US and UK combined 
population. 

30 Figure 28 depicts the results of a case control study p-value plot that 

shows single nucleotide polymorphism association with the bronchial hyper- 
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responsiveness and asthma phenotypes in the US and UK populations, 
separately. 

Figure 29 depicts the genomic nucleotide sequence (SEQ ID NO:6) 
determined for Gene 216. Identified single nucleotide polymorphism positions 
5 are underlined. 

Figure 30 depicts the nucleotide sequence (SEQ ID NO:3) and encoded 
amino acid sequence (SEQ ID NO: 363) of Gene 216 determined from the 
uterus cDNA clone. Identified single nucleotide polymorphism positions are 
underlined. 

10 Figure 31 depicts the nucleotide sequence (SEQ ID NO:350) and 

encoded amino acid sequence (SEQ ID NO:337) of Gene 216 alternate splice 
variant rt672. 

Figure 32 depicts the nucleotide sequence (SEQ ID NO:351) and 
encoded amino acid sequence (SEQ ID NO:338) of Gene 216 alternate splice 
15 variant rt690. 

Figure 33 depicts the nucleotide sequence (SEQ ID NO:352) and 
encoded amino acid sequence (SEQ ID NO:339) of Gene 216 alternate splice 
variant rt709. 

Figure 34 depicts the nucleotide sequence (SEQ ID NO:353) and 
20 encoded amino acid sequence (SEQ ID NO:340) of Gene 216 alternate splice 
variant rt71 1 . 

Figure 35 depicts the nucleotide sequence (SEQ ID NO:354) and 
encoded amino acid sequence (SEQ ID NO:341) of Gene 216 alternate splice 
variant rt713. 

25 Figure 36 depicts the nucleotide sequence (SEQ ID NO:355) and 

encoded amino acid sequence (SEQ ID NO:342) of Gene 216 alternate splice 
variant rt720. 

Figure 37 depicts the nucleotide sequence (SEQ ID NO:356) and 
encoded amino acid sequence (SEQ ID NO:343) of Gene 216 alternate splice 
30 variant rt725. 

Figure 38 depicts the nucleotide sequence (SEQ ID NO:357) and 
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encoded amino acid sequence (SEQ ID NO:344) of Gene 216 alternate splice 
variant rt727. 

Figure 39 depicts the nucleotide sequence (SEQ ID NO:358) and 
encoded amino acid sequence (SEQ ID NO:345) of Gene 216 alternate splice 
5 variant rt733. 

Figure 40 depicts the nucleotide sequence (SEQ ID NO:359) and 
encoded amino acid sequence (SEQ ID NO:346) of Gene 216 alternate splice 
variant rt735. 

Figure 41 depicts the nucleotide sequence (SEQ ID NO:360) and 
10 encoded amino acid sequence (SEQ ID NO:347) of Gene 216 alternate splice 
variant rt764. 

Figure 42 depicts the nucleotide sequence (SEQ ID NO:361) and 
encoded amino acid sequence (SEQ ID NO:348) of Gene 216 alternate splice 
variant rt772. 

15 Figure 43 depicts the nucleotide sequence (SEQ ID NO:362) and 

encoded amino acid sequence (SEQ (D NO:349) of Gene 216 alternate splice 
variant rt774. 

DETAILED DESCRIPTION OF THE INVENTION 

Gene 216 was identified by extensive analysis of the region of human 
20 chromosome 20p1 3-p12 associated with airway hyperresponsiveness, asthma, 
and atopy. This region has also been implicated in other diseases such as 
obesity (Wilson, 1999, Arch. Intern. Med. 159:2513-4). Bronchial asthma, 
furthermore, has been linked to intestinal conditions such as inflammatory 
bowel disease (B. Wallaert et al. f 1995, J. Exp. Med. 182:1897-1904). Thus, 
25 there was a need to identify and isolate the gene(s) associated with this region 
of human chromosome 20. 
Definitions 

To aid in the understanding of the specification and claims, the following 
definitions are provided. 
30 "Disorder region" refers to a portion of the human chromosome 20 

bounded by the markers D20S502 and D20S851. A "disorder-associated" 
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nucleic acid or polypeptide sequence refers to a nucleic acid sequence that 
maps to region 20p13-p12 or the polypeptides encoded therein (e.g., Gene 
216 nucleic acids, and polypeptides). For nucleic acids, this encompasses 
sequences that are identical or complementary to the Gene 216 sequence, as 
5 well as sequence-conservative, function-conservative, and non-conservative 
variants thereof. For polypeptides, this encompasses sequences that are 
identical to the Gene 216 polypeptide, as well as function-conservative and 
non-conservative variants thereof. Included are naturally-occurring mutations 
of Gene 216 causative of respiratory diseases or obesity, such as but not 

10 limited to mutations which cause altered protein levels or stability (e.g., 
decreased levels, increased levels, expression in an inappropriate tissue type, 
increased stability, and decreased stability). 

As used herein, the "reference sequence" for Gene 216 is BAC1098L22 
(SEQ ID NO:5). The BAC1098L22 sequence is also the source of the 

1 5 disclosed Gene 21 6 genomic sequence (SEQ ID NO:6). 'Variant" sequences 
refer to nucleotide sequences (and the encoded amino acid sequences) that 
differ from the reference sequence at one or more positions. Non-limiting 
examples of variant sequences include the disclosed Gene 216 single 
nucleotide polymorphisms (SNPs), alternate splice variants, and the amino 

20 acid sequences encoded by these variants. 

The term "SNP" as used herein refers to a site in a nucleic acid 
sequence which contains a nucleotide polymorphism. In accordance with this 
invention, a SNP may comprise one of two possible "alleles". For example, 
SNP A-2 may comprise allele C or allele A (Table 10, below). Thus, a nucleic 

25 acid molecule comprising SNP A-2 may include a C or A at the polymorphic 
position. For a combination of SNPs, the term "haplotype" is used. As an 
example, the haplotype T/A is observed for SNP combination D1/ST+4 (Table 
21, below). Thus, T is present at the polymorphic position in SNP D1 and A is 
present at the polymorphic position in SNP ST+4. It should be noted that the 

30 haplotype representation "T/A" does not indicate "T or A". Instead, the 
haplotype representation "T/A" indicates that both the T allele and the A allele 
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are present at. their respective SNPs. In addition, the SNP representation 
"D1/ST+4" does not indicate "D1 or ST+4\ Rather, "D1/ST+4" indicates that 
both SNPs are present. In some instances, a specific allele or haplotype may 
be associated with susceptibility to a disease or condition of interest, e.g., 
5 asthma. In other instances, an allele or haplotype may be associated with a 
decrease in susceptibility to a disease or condition of interest, i.e., a protective 
sequence. For example, as described herein, the C allele of SNP V-1 
(Example 12) and the C/A haplotype of SNPs Q-1/ST+4 (Example 13) are 
associated with increased susceptibility to asthma, whereas the C/G haplotype 
10 of SNPs ST+4/V-3 (Example 13) is associated with a protective effect. 

"Sequence-conservative" variants are those in which a change of one 
or more nucleotides in a given codon position results in no alteration in the 
amino acid encoded at that position (i.e., silent mutations). "Function- 
conservative" variants are those in which a change in one or more nucleotides 
15 in a given codon position results in a polypeptide sequence in which a given 
amino acid residue in the polypeptide has been replaced by a conservative 
amino acid substitution as described in detail herein. "Function-conservative" 
variants also include analogs of a given polypeptide and any polypeptides that 
have the ability to elicit antibodies specific to a designated polypeptide. "Non- 
20 conservative" variants are those in which a change in one or more nucleotides 
in a given codon position results in a polypeptide sequence in which a given 
amino acid residue in a polypeptide has been replaced by a non-conservative 
amino acid substitution as described herein below. "Non-conservative" variants 
also include polypeptides comprising non-conservative amino acid 
25 substitutions. 

As used herein, the term "ortholog" denotes a gene or polypeptide 
obtained from one species that has homology to an analogous gene or 
polypeptide from a different species. The term "paralog" denotes a gene or 
polypeptide obtained from a given species that has homology to a distinct gene 
30 or polypeptide from that same species. For example, the disclosed mouse and 
human Gene 216 sequences are orthologs, whereas human Gene 216 and 
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human ADAM 19 are paralogs. 

"Nucleic acid or "polynucleotide" as used herein refers to purine- and 
pyrimidine-containing polymers of any length, either polyribonucleotides or 
polydeoxyribonucleotide or mixed polyribo-polydeoxyribonucleotides. This 
5 includes single-and double-stranded molecules, i.e., DNA-DNA, DNA-RNA and 
RNA-RNA hybrids, as well as "protein nucleic acids" (PNA) formed by 
conjugating bases to an amino acid backbone. This also includes nucleic acids 
containing modified bases. 

As used herein, "isolated" nucleic acids are nucleic acids separated 

10 away from other components (e.g., DIM A, RNA, and protein) with which they 
are associated (e.g., as obtained from cells, chemical synthesis systems, or 
phage or nucleic acid libraries). Isolated nucleic acids are at least 60% free, 
preferably 75% free, and most preferably 90% free from other associated 
components, in accordance with the present invention, isolated nucleic acids 

1 5 can be obtained by methods described herein, or other established methods, 
including isolation from natural sources (e.g., cells, tissues, or organs), 
chemical synthesis, recombinant methods, combinations of recombinant and 
chemical methods, and library screening methods. 

Nucleic acids referred to herein as "recombinant" are nucleic acids 

20 which have been produced by recombinant DNA methodology, including those 
nucleic acids that are generated by procedures which rely upon a method of 
artificial replication, such as the polymerase chain reaction (PCR) and/or 
cloning into a vector using restriction enzymes. Portions of recombinant 
nucleic acids which code for polypeptides can be identified and isolated by, for 

25 example, the method of M. Jasin et al., U.S. Patent No. 4,952,501 . 

A "coding sequence" or a "protein-coding sequence" is a polynucleotide 
sequence capable of being transcribed into mRNA and/or capable of being 
translated into a polypeptide or peptide. The boundaries of the coding 
sequence are typically determined by a translation start codon at the 5'- 

30 terminus and a translation stop codon at the 3'-terminus. 

A "complement" of a nucleic acid sequence as used herein refers to the 
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"antisense" sequence that participates in Watson-Crick base-pairing with the 

original sequence. 

A "probe" or "primer" refers to a nucleic acid or oligonucleotide that 

forms a hybrid structure with a sequence in a target region due to 
5 complementarily of the probe or primer sequence to at least one portion of the 

target region sequence. 

Nucleic acids are "hybrid izable" to each other when at least one strand 

of the nucleic acid can anneal to another nucleic acid strand under defined 

stringency conditions. Hybridization requires that the two nucleic acids contain 
10 substantially complementary sequences; depending on the stringency of 

hybridization, however, mismatches may be tolerated. The appropriate 

stringency for hybridizing nucleic acids depends on the length of the nucleic 

acids and the degree of complementarily, and can be determined in 

accordance with the methods described herein. 
15 As used herein, "portion" and "fragment" are synonymous. A "portion" 

as used with regard to a nucleic acid or polynucleotide, refers to fragments of 

that nucleic acid or polynucleotide. The fragments can range in size from 8 

nucleotides to all but one nucleotide of the entire Gene 216 sequence. 

Preferably, The fragments are at least 8 to 10 nucleotides in length; more 
20 preferably at least 12 nucleotides in length; still more preferably at least 15 to 

20 nucleotides in length; yet more preferably at least 25 nucleotides in length; 

and most preferably at least 35 to 55 nucleotides in length. 

"cDNA" refers to complementary or copy DNA produced from an RNA 

template by the action of RNA-dependent DNA polymerase (reverse 
25 transcriptase). Thus, a "cDNA clone" means a duplex DNA sequence 

complementary to an RNA molecule of interest, included in a cloning vector or 

PCR amplified. This term includes genes from which the intervening 

sequences have been removed. 

"Cloning" refers to the use of recombination techniques to insert a 
30 particular gene or other DNA sequence into a vector molecule. In order to 

successfully clone a desired gene, it is necessary to use methods for 
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generating DNA fragments, for joining the fragments to vector molecules, for 
introducing the composite DNA molecule into a host cell in which it can 
replicate, and for selecting the clone having the target gene from amongst the 
recipient host cells. 

5 "cDNA library" refers to a collection of recombinant DNA molecules 

containing cDNA inserts that together comprise essentially all of the expressed 
genes of an organism. A cDNA library can be prepared by methods known to 
one skilled in the art (see, e.g., Cowell and Austin, 1997, "cDNA Library 
Protocols," Methods in Molecular Biology). Generally, RNA is first isolated 
10 from the cells of the desired organism, and the RNA is used to prepare cDNA 
molecules. 

"Cloning vector" refers to a plasmid or phage DNA or other DNA that is 
able to replicate in a host cell. The cloning vector is typically characterized by 
one or more endonuclease recognition sites at which such DNA sequences 

1 5 may be cut in a determinable fashion without loss of an essential biological 
function of the DNA, which may contain a marker suitable for use in the 
identification of cells containing the vector. 

"Regulatory sequence" refers to a nucleic acid sequence that controls 
or regulates expression of structural genes when operably linked to those 

20 genes. These include, for example, the lac systems, the trp system, major 
operator and promoter regions of the phage lambda, the control region of fd 
coat protein and other sequences known to control the expression of genes in 
prokaryotic or eukaryotic cells. Regulatory sequences will vary depending on 
whether the vector is designed to express the operably linked gene in a 

25 prokaryotic or eukaryotic host, and may contain transcriptional elements such 
as enhancer elements, termination sequences, tissue-specificity elements 
and/or translational initiation and termination sites. 

"Expression vector" refers to a vehicle or plasmid that is capable of 
expressing a gene that has been cloned into it, after transformation or 

30 integration in a host cell. The cloned gene is usually placed under the control 
of (i.e., operably linked to) a regulatory sequence. 
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"Operably linked" means that the promoter controls the initiation of 
expression of the gene. A promoter is operably linked to a sequence of 
proximal DNA if upon introduction into a host cell the promoter determines the 
transcription of the proximal DNA sequence(s) into one or more species of 
5 RNA. A promoter is operably linked to a DNA sequence if the promoter is 
capable of initiating transcription of that DNA sequence. 

"Host" includes prokaryotes and eukaryotes. The term includes an 
organism or cell that is the recipient of an expression vector (e.g., 
autonomously replicating or integrating vector). 

10 "Amplification" of nucleic acids refers to methods such as polymerase 

chain reaction (PCR), ligation amplification (or ligase chain reaction, LCR) and 
amplification methods based on the use of Q-beta replicase. These methods 
are well known in the art and described, for example, in U.S. Patent Nos. 
4,683,195 and 4,683,202. Reagents and hardware for conducting PCR are 

15 commercially available. Primers useful for amplifying sequences from the 
disorder region are preferably complementary to, and preferably hybridize 
specifically to, sequences in the 20p13-p12 region or in regions that flank a 
target region therein. Gene 216 generated by amplification may be sequenced 
directly. Alternatively, the amplified sequence(s) may be cloned prior to 

20 sequence analysis. 

"Gene" refers to a DNA sequence that encodes through its template or 
messenger RNA a sequence of amino acids characteristic of a specific peptide, 
polypeptide, or protein. The term "gene" as used herein with reference to 
genomic DNA includes intervening, non-coding regions, as well as regulatory 

25 regions, and can include 5' and 3" ends. 

A gene sequence is "wild-type" if such sequence is usually found in 
individuals unaffected by the disease or condition of interest. However, 
environmental factors and other genes can also play an important role in the 
ultimate determination of the disease. In the context of complex diseases 

30 involving multiple genes ("oligogenic disease"), the "wild type", or normal 
sequence can also be associated with a measurable risk or susceptibility, 
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receiving its reference status based on its frequency in the general population. 
As used herein, "wild-type Gene 216" refers to the reference sequence, 
BAC1098L22 (SEQ ID NO:5). The wild-type Gene 216 sequence was used to 
identify the variants (single nucleotide polymorphisms, alleles, and haplotypes) 
5 described in detail herein. 

A gene sequence is a "mutant" sequence if it differs from the wild-type 
sequence. For example, a Gene 216 nucleic acid containing a particular allele 
of a single nucleotide polymorphism may be a mutant sequence. In some 
cases, the individual carrying this allele has increased susceptibility toward the 

1 0 disease or condition of interest. In other cases, the "mutant" sequence might 
also refer to an allele that decreases the susceptibilty toward a disease or 
condition of interest, and thus acts in a protective manner. Also a gene is a 
"mutant" gene if too much ("overexpressed") or too little ("underexpressed") of 
such gene is expressed in the tissues in which such gene is normally 

1 5 expressed, thereby causing the disease or condition of interest. 

A nucleic acid or fragment thereof is "substantially homologous" to 
another if, when optimally aligned (with appropriate nucleotide insertions and/or 
deletions) with the other nucleic acid (or its complementary strand), there is 
nucleotide sequence identity in at least 60% of the nucleotide bases, usually 

20 at least 70%, more usually at least 80%, preferably at least 90%, and more 
preferably at least 95-98% of the nucleotide bases. 

Alternatively, substantial homology exists when a nucleic acid or 
fragment thereof will hybridize, under selective hybridization conditions, to 
another nucleic acid (or a complementary strand thereof). Selectivity of 

25 hybridization exists when hybridization which is substantially more selective 
than total lack of specificity occurs. Typically, selective hybridization will occur 
when there is at least about 55% sequence identity over a stretch of at least 
about nine or more nucleotides, preferably at least about 65%, more preferably 
at least about 75%, and most preferably at least about 90% (M. Kanehisa, 

30 1 984, Nucl. Acids Res. 11:203-213). The length of homology comparison, as 
described, may be over longer stretches, and in certain embodiments will often 
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be over a stretch of at least 14 nucleotides, usually at least 20 nucleotides, 
more usually at least 24 nucleotides, typically at least 28 nucleotides, more 
typically at least 32 nucleotides, and preferably at least 36 or more nucleotides. 
As used herein, the terms "protein" and "polypeptide" are synonymous. 
5 "Peptides" are defined as fragments or portions of polypeptides, preferably 
fragments or portions having at least one functional activity (e.g., proteolysis, 
adhesion, fusion, antigenic, or intracellular activity) as the complete polypeptide 
sequence. 

"Isolated" polypeptides or peptides are those that are separated from 

10 other components (e.g., DNA, RNA, and other polypeptides or peptides) with 
which they are associated (e.g., as obtained from cells, translation systems, or 
chemical synthesis systems). In a preferred embodiment, isolated 
polypeptides or peptides are at least 1 0% pure; more preferably, 80 or 90% 
pure. Isolated polypeptides and peptides include those obtained by methods 

1 5 described herein, or other established methods, including isolation from natural 
sources (e.g., cells, tissues, or organs), chemical synthesis, recombinant 
methods, or combinations of recombinant and chemical methods. Proteins or 
polypeptides referred to herein as "recombinant" are proteins or polypeptides 
produced by the expression of recombinant nucleic acids. 

20 A "portion" as used herein with regard to a protein or polypeptide, refers 

to fragments of that protein or polypeptide. The fragments can range in size 
from 5 amino acid residues to all but one residue of the entire protein 
sequence. Thus, a portion or fragment can be at least 5, 5-50, 50-100, 100- 
200, 200-400, 400-800, or more consecutive amino acid residues of a Gene 

25 216 protein or polypeptide (e.g., SEQ ID NO:4 or SEQ ID NO:363). 

An "immunogenic component", is a moiety that is capable of eliciting a 
humoral and/or cellular immune response in a host animal. 

An "antigenic component" is a moiety that binds to its specific antibody 
with sufficiently high affinity to form a detectable antigen-antibody complex. 

30 A "sample" as used herein refers to a biological sample, such as, for 

example, tissue or fluid isolated from an individual (including, without limitation, 
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plasma, serum, cerebrospinal fluid, lymph, tears, saliva, milk, pus, and tissue 
exudates and secretions) or from in vitro cell culture constituents, as well as 
samples obtained from, for example, a laboratory procedure. 

"Antibodies" refer to polyclonal and/or monoclonal antibodies and 
5 fragments thereof, and immunologic binding equivalents thereof, that can bind 
to asthma proteins and fragments thereof or to nucleic acid sequences from 
the 20p1 3-p12 region, particularly from the asthma locus or a portion thereof. 
The term antibody is used both to refer to a homogeneous molecular entity, 
or a mixture such as a serum product made up of a plurality of different 

10 molecular entities. Proteins may be prepared synthetically in a protein 
synthesizer and coupled to a carrier molecule and injected over several months 
into rabbits. Rabbit sera is tested for immunoreactivity to the protein or 
fragment. Monoclonal antibodies may be made by injecting mice with the 
proteins, or fragments thereof. Monoclonal antibodies will be screened by 

15 ELISA and tested for specific immunoreactivity with protein or fragments 
thereof. (Harlow et al., 1988, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY). These antibodies will be useful 
in assays as well as therapeutics. 

"Identity," as known in the art, is a relationship between two or more 

20 polypeptide sequences or two or more polynucleotide sequences, as 
determined by comparing the sequences. In the art, "identity" also means the 
degree of sequence relatedness between polypeptide or polynucleotide 
sequences, as the case may be, as determined by the match between strings 
of such sequences. "Identity" and "similarity" can be readily calculated by 

25 known methods, including but not limited to those described in (A.M. Lesk (ed), 
1988, Computational Molecular Biology, Oxford University Press, NY; D.W. 
Smith (ed), 1993, Biocomputing. Informatics and Genome Projects, Academic 
Press, NY; A.M. Griffin and H.G. Griffin, H. G (eds), 1994, Computer Analysis 
of Sequence Data, Part I, Humana Press, NJ; G. von Heinje, 1987, Sequence 

30 Analysis in Molecular Biology, Academic Press; and M. Gribskov and J. 
Devereux (eds), 1991 , Sequence Analysis Primer, M Stockton Press, NY; H. 
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Carillo and D. Lipman, 1988, SI AM J. Applied Math., 48:1073. 

Technical and scientific terms used herein have the meanings 
commonly understood by one of ordinary skill in the art to which the present 
invention pertains, unless otherwise defined. Reference is made herein to 
5 various methodologies known to those of skill in the art. Publications and other 
materials setting forth such known methodologies to which reference is made 
are incorporated herein by reference in their entireties as though set forth in 
full. 

Standard reference works setting forth the general principles of 

10 recombinant DNA technology include J. Sambrook et al., 1989, Molecular 
Cloning: A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY; P.B. Kaufman et al., (eds), 1995, Handbook of 
Molecular and Cellular Methods in Biology and Medicine, CRC Press, Boca 
Raton; M.J. McPherson (ed), 1991, Directed Mutagenesis: A Practical 

15 Approach, IRL Press, Oxford; J. Jones, 1992, Amino Acid and Peptide 
Synthesis, Oxford Science Publications, Oxford; B.M. Austen and O.M.R. 
Westwood, 1991, Protein Targeting and Secretion, IRL Press, Oxford; D.N 
Glover (ed), 1985, DNA Cloning, Volumes I and II; M.J. Gait (ed), 1984, 
Oligonucleotide Synthesis] B.D. Hames and S.J. Higgins (eds), 1984, Nucleic 

20 Acid Hybridization; Wu and Grossman (eds), Methods in Enzymoloqy 
(Academic Press, Inc.), Vol. 154 and Vol. 155; Quirke and Taylor (eds), 1991, 
PCR-A Practical Approach; Hames and Higgins (eds), 1984, Transcription and 
Translation; R.I. Freshney (ed), 1986, Animal Cell Culture; Immobilized Cells 
and Enzymes, 1986, IRL Press; Perbal, 1984, A Practical Guide to Molecular 

25 Cloning; J. H. Miller and M. P. Calos (eds), 1987, Gene Transfer Vectors for 
Mammalian Cells, Cold Spring Harbor Laboratory Press; M.J. Bishop (ed), 
1998, Guide to Human Genome Computing, 2d Ed., Academic Press, San 
Diego, CA; L.F. Peruski and A.H. Peruski, 1997, The Internet and the New 
Biology: Tools for Genomic and Molecular Research, American Society for 

30 Microbiology, Washington, D.C. 

Standard reference works setting forth the general principles of 
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immunology include S. Sell, 1996, Immunology, Immunopathofogy & Immunity, 
5th Ed., Appieton & Lange, Pub!., Stamford, CT; D. Male et al., 1996, 
Advanced Immunology, 3d Ed., Times Mirror Int'i Publishers Ltd., Publ., 
London; D.P. Stites and A.I. Terr, 1991, Basic and Clinical Immunology, 7th 
5 Ed., Appieton & Lange, Publ., Norwalk, CT; and A.K. Abbas et al., 1991, 
Cellular and Molecular Immunology, W. B. Saunders Co., Publ., Philadelphia, 
PA. Any suitable materials and/or methods known to those of skill can be 
utilized in carrying out the present invention; however, preferred materials 
and/or methods are described. Materials, reagents, and the like to which 
10 reference is made in the following description and examples are generally 
obtainable from commercial sources, and specific vendors are cited herein. 
Nucleic Acids 

The present invention relates to isolated Gene 216 nucleic acids 
comprising genomic DNA within BAC RPCM098L22 (e.g., SEQ ID NO:5), the 

15 corresponding cDNA sequences (e.g., SEQ ID NO:1 or SEQ ID NO:3), RNA, 
fragments of the genomic, cDNA, or RNA nucleic acids comprising at least 15, 
20, 40, 60, 100, 200, 500, 1520, 2070, 3915, 5009, 6875, or more contiguous 
nucleotides, and the complements thereof. Closely related variants are also 
included as part of this invention, as well as nucleic acids sharing at least 50, 

20 60, 70, 80, or 90% identity with the nucleic acids described above, and nucleic 
acids which would be identical to a Gene 216 nucleic acids except for one or 
a few substitutions, deletions, or additions. 

The invention also relates to isolated nucleic acids comprising regions 
required for accurate expression of Gene 216 (e.g., Gene 216 promoter (e.g., 

25 SEQ ID NO:8), enhancer (e.g., SEQ ID NO:7), and polyadenylation 
sequences). In a preferred embodiment, the present invention is directed to 
at least 15 contiguous nucleotides of the nucleic acid sequence of SEQ ID 
NO:1 or SEQ ID NO:6. More particularly, embodiments of this invention 
include the BAC clone containing segments of Gene 216 including 

30 RPCM 098L22 as set forth in SEQ ID NO:5 (Figure 7). 

The invention further relates to nucleic acids (e.g., DNA or RNA) that 
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hybridize to a) a nucleic acid encoding a Gene 216 polypeptide, such as a 
nucleic acid having the sequence of SEQ ID NO:1 or SEQ ID NO:6; b) 
sequence-conservative, function-conservative, and non-conservative variants 
of (a); and c) fragments or portions of (a) or (b). Nucleic acids that hybridize 
5 to the sequence of SEQ ID NO:1 or SEQ ID NO:6 can be double- or single- 
stranded. Hybridization to the sequence of SEQ ID NO:1 or SEQ ID NO:6 
includes hybridization to the strand shown or its complementary strand. 

The present invention also relates to nucleic acids that encode a 
polypeptide having the amino acid sequence of SEQ ID NO;4 or SEQ ID 

1 0 NO:363, or functional equivalents thereof. A functional equivalent of a Gene 
216 protein includes fragments or variants that perform at least on 
characteristic function of the Gene 216 protein (e.g., proteolysis, adhesion, 
fusion, antigenic, or intracellular activity). Preferably, a functional equivalent 
will share at least 65% sequence identity with the Gene 216 polypeptide. 

15 In preferred embodiments, nucleic acids of the present invention share 

at least 50%, preferably at least 60-70%, more preferably at least 70-80% 
sequence identity, and even more preferably at least 90-100% sequence 
identity with the sequences of SEQ ID NO:1 or SEQ ID NO:6, or fragments or 
portions thereof. Sequence identity calculations can be performed using 

20 computer programs, hybridization methods, or calculations. Preferred 
computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, the GCG program package, 
BLASTN, BLASTX, TBLASTX, and FASTA (J. Devereux et al., 1984, Nucleic 
Acids Research 12(1):387; S.F. Altschul et al., 1990, J. Molec. Biol. 215:403- 

25 410; W. Gish and D.J. States, 1994, Nature Genet 3:266-272; W.R. Pearson 
and D.J. Lipman, 1988, Proc Natl. Acad. ScL USA 85(8):2444-8). The BLAST 
programs are publicly available from NCBI and other sources . The well-known 
Smith Waterman algorithm may also be used to determine identity. 

For example, nucleotide sequence identity can be determined by 

30 comparing a query sequences to sequences in publicly available sequence 
databases (NCBI) using the BLASTN 2 algorithm (S.F. Altschul et al., 1997, 
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Nucf. Acids Res., 25:3389-3402). The parameters for a typical search are: E 
= 0.05, v = 50, B = 50, wherein E is the expected probability score cutoff, V is 
the number of database entries returned in the reporting of the results, and B 
is the number of sequence alignments returned in the reporting of the results 
5 (S.F. Altschul et al., 1990, J. Mo/. Biol., 215:403-410). 

In another approach, nucleotide sequence identity can be calculated 
using the following equation: % identity = (number of identical nucleotides) / 
(alignment length in nucleotides) * 100. For this calculation, alignment length 
includes internal gaps but not includes terminal gaps. Alternatively, nucleotide 

10 sequence identity can be determined experimentally using the specific 
hybridization conditions described below. 

In accordance with the present invention, polynucleotide alterations are 
selected from the group consisting of at least one nucleotide deletion, 
substitution, including transition and trans version, insertion, or modification 

1 5 (e.g., via RNA or DNA analogs). Alterations may occur at the 5' or 3' terminal 
positions of the reference nucleotide sequence or anywhere between those 
terminal positions, interspersed either individually among the nucleotides in the 
reference sequence or in one or more contiguous groups within the reference 
sequence. Alterations of a polynucleotide sequence of SEQ ID NO:1 or SEQ 

20 ID NO:6 may create nonsense, missense, or frameshift mutations in this coding 
sequence, and thereby alter the polypeptide encoded by the polynucleotide 
following such alterations. 

Such altered nucleic acids, including DNA or RNA, can be detected and 
isolated by hybridization under high stringency conditions or moderate 

25 stringency conditions, for example, which are chosen to prevent hybridization 
of nucleic acids having non-complementary sequences. "Stringency 
conditions" for hybridizations is a term of art which refers to the conditions of 
temperature and buffer concentration which permit hybridization of a particular 
nucleic acid to another nucleic acid in which the first nucleic acid may be 

30 perfectly complementary to the second, or the first and second may share 
some degree of complementarity which is less than perfect. 
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For example, certain high stringency conditions can be used which 
distinguish perfectly complementary nucleic acids from those of less 
complementarity. "High stringency conditions" and "moderate stringency 
conditions" for nucleic acid hybridizations are explained in F.M. Ausubel et al. 
5 (eds), 1 995, Currant Protocols in Molecular Biology, John Wiley and Sons, Inc., 
New York, NY, the teachings of which are hereby incorporated by reference. 
In particular, see pages 2.10.1-2.10.16 (especially pages 2.10.8-2.10.11) and 
pages 6.3.1-6.3.6. The exact conditions which determine the stringency of 
hybridization depend not only on ionic strength, temperature and the 

10 concentration of destabilizing agents such as formamide, but also on factors 
such as the length of the nucleic acid sequence, base composition, percent 
mismatch between hybridizing sequences and the frequency of occurrence of 
subsets of that sequence within other non-identical sequences. Thus, high or 
moderate stringency conditions can be determined empirically. 

15 By varying hybridization conditions from a level of stringency at which 

no hybridization occurs to a level at which hybridization is first observed, 
conditions which will allow a given sequence to hybridize with the most similar 
sequences in the sample can be determined. Preferably the hybridizing 
sequences will have 60-70% sequence identity, more preferably 70-85% 

20 sequence identity, and even more preferably 90-100% sequence identity. 

Typically, the hybridization reaction is initially performed under 
conditions of low stringency, followed by washes of varying, but higher 
stringency. Reference to hybridization stringency, e.g., high, moderate, or low 
stringency, typically relates to such washing conditions. Hybridization 

25 conditions are based on the melting temperature (Tm) of the nucleic acid probe 
or primer and are typically classified by degree of stringency of the conditions 
under which hybridization is measured (Ausubel et al., 1995). For example, 
high stringency hybridization typically occurs at about 5-10% C below the T m ; 
moderate stringency hybridization occurs at about 10-20% below the T m ; and 

30 low stringency hybridization occurs at about 20-25% below the T m . The melting, 
temperature can be approximated by the formulas as known in the art, 
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depending on a number of parameters, such as the length of the hybrid or 
probe in number of nucleotides, or hybridization buffer ingredients and 
conditions. As a general guide, T m decreases approximately 1°C with every 
1 % decrease in sequence identity at any given SSC concentration. Generally, 
5 doubling the concentration of SSC results in an increase in T m of ~1 7°C. Using 
these guidelines, the washing temperature can be determined empirically for 
moderate or low stringency, depending on the level of mismatch sought 

High stringency hybridization conditions are typically carried out at 65 
to 68°C in 0.1 X SSC and 0.1% SDS. Highly stringent conditions allow 

10 hybridization of nucleic acid molecules having about 95 to 100% sequence 
identity. Moderate stringency hybridization conditions are typically carried out 
at 50 to 65°C in 1 X SSC and 0.1 % SDS. Moderate stringency conditions allow 
hybridization of sequences having at least about 80 to 95% nucleotide 
sequence identity. Low stringency hybridization conditions are typically carried 

15 out at 40 to 50°C in 6 X SSC and 0.1% SDS. Low stringency hybridization 
conditions allow detection of specific hybridization of nucleic acid molecules 
having at least about 50 to 80% nucleotide sequence identity. 

For example, high stringency conditions can be attained by hybridization 
in 50% formamide, 5 X Denhardt's solution, 5 X SSPE or SSC (1 X SSPE 

20 buffer comprises 0.15 M NaCI, 10 mM Na 2 HP0 4 , 1 mM EDTA; 1 X SSC buffer 
comprises 150 mM NaCI, 15 mM sodium citrate, pH 7.0), 0.2% SDS at about 
42°C, followed by washing in 1 X SSPE or SSC and 0.1% SDS at a 
temperature of at least about 42°C, preferably about 55°C, more preferably 
about 65°C. Moderate stringency conditions can be attained, for example, by 

25 hybridization in 50% formamide, 5 X Denhardt's solution, 5 X SSPE or SSC, 
and 0.2% SDS at 42°C to about 50°C, followed by washing in 0.2 X SSPE or 
SSC and 0.2% SDS at a temperature of at least about 42°C, preferably about 
55 D C, more preferably about 65°C. Low stringency conditions can be attained, 
for example, by hybridization in 10% formamide, 5 X Denhardt's solution, 6 X 

30 SSPE or SSC, and 0.2% SDS at 42°C, followed by washing in 1 X SSPE or 
SSC, and 0.2% SDS at a temperature of about 45°C, preferably about 50°C 
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in 4 X SSC at 60°C for 30 min. 

High stringency hybridization procedures typically (1) employ low ionic 
strength and high temperature for washing, such as 0.015 M NaCI/ 0.0015 M 
sodium citrate, pH 7.0 (0.1 X SSC) with 0.1% sodium dodecyl sulfate (SDS) at 
5 50°C; (2) employ during hybridization 50% (vol/vol) formamide with 5 X 
Denhardt's solution (0.1% weight/volume highly purified bovine serum 
albumin/0.1% wt/vol Ficoll/0.1% wt/vol polyvinylpyrrolidone), 50 mM sodium 
phosphate buffer at pH 6.5 and 5 X SSC at 42°C; or (3) employ hybridization 
with 50% formamide, 5 X SSC, 50 mM sodium phosphate (pH 6.8), 0.1% 
1 0 sodium pyrophosphate, 5 X Denhardt's solution, sonicated salmon sperm DNA 
(50 jjg/ml), 0.1 % SDS, and 1 0% dextran sulfate at 42°C, with washes at 42°C 
in 0.2 X SSC and 0.1% SDS. 

In one particular embodiment, high stringency hybridization conditions 
may be attained by: 

1 5 - Prehybridization treatment of the support (e.g. nitrocellulose filter 

or nylon membrane), to which is bound the nucleic acid capable of hybridizing 
with any of the sequences of the invention, is carried out at 65°C for 6 hr with 
a solution having the following composition: 4 X SSC, 10 X Denhardt's (1 X 
Denhardt's comprises 1% Ficoil, 1% polyvinylpyrrolidone, 1% BSA (bovine 

20 serum albumin); 1 X SSC comprises of 0.15 M of NaCI and 0.015 M of sodium 
citrate, pH 7); 

Replacement of the pre-hybridization solution in contact with the 
support by a buffer solution having the following composition: 4 X SSC, 1 X 
Denhardt's, 25 mM NaP0 4 , pH 7, 2 mM EDTA, 0.5% SDS, 100 ug/ml of 
25 sonicated salmon sperm DNA containing a nucleic acid derived from the 
sequences of the invention as probe, in particular a radioactive probe, and 
previously denatured by a treatment at 100°C for 3 min; 

Incubation for 12 hr at 65°C; 

Successive washings with the following solutions: 1) four 
30 washings with 2 X SSC, 1 X Denhardt's, 0.5% SDS for 45 min at 65°C; 2) two 
washings with 0.2 X SSC, 0.1 X SSC for 45 min at 65°C; and 3) 0.1 x SSC, 
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0.1 % SDS for 45 min at 65°C. 

Additional examples of high, medium, and low stringency conditions can 
be found in Sambrook et al., 1989. Exemplary conditions are also described 
in M.H. Krause and S.A. Aaronson, 1991 , Methods in Enzymotogy, 200:546- 
5 556; Ausubel et al., 1995. It is to be understood that the low, moderate and 
high stringency hybridization/washing conditions may be varied using a variety 
of ingredients, buffers, and temperatures well known to and practiced by the 
skilled practitioner. 

Isolated nucleic acids that are characterized by their ability to hybridize 

10 to (a) a nucleic acid encoding a Gene 216 polypeptide, such as the nucleic 
acids depicted as SEQ ID NO:1 or SEQ ID NO:6, b) the complement of (a), (c) 
or a portion of (a) or (b) (e.g., under high or moderate stringency conditions), 
may further encode a protein or polypeptide having at least one function 
characteristic of a Gene 216 polypeptide, such as proteolysis, adhesion, fusion, 

15 and intracellular activity, or binding of antibodies that also bind to non- 
recombinant Gene 21 6 protein or polypeptide. The catalytic or binding function 
of a protein or polypeptide encoded by the hybridizing nucleic acid may be 
detected by standard enzymatic assays for activity or binding (e.g., assays that 
measure the binding of a transit peptide or a precursor, or other components 

20 of the translocation machinery). Enzymatic assays, complementation tests, or 
other suitable methods can also be used in procedures for the identification 
and/or isolation of nucleic acids which encode a polypeptide having the amino 
acid sequence of SEQ ID NO:4 or SEQ ID NO:363, or a functional equivalent 
of this polypeptide. The antigenic properties of proteins or polypeptides 

25 encoded by hybridizing nucleic acids can be determined by immunological 
methods employing antibodies that bind to a Gene 216 polypeptide such as 
immunoblot, immuno precipitation and radioimmunoassay. PCR methodology, 
including RAGE (Rapid Amplification of Genomic DNA Ends), can also be used 
to screen for and detect the presence of nucleic acids which encode Gene 

30 216-like proteins and polypeptides, and to assist in cloning such nucleic acids 
from genomic DNA. PCR methods for these purposes can be found in M.A. 
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Innis et al., 1990, PCR Protocols: A Guide to Methods and Applications, 
Academic Press, Inc., San Diego, CA., incorporated herein by reference. 

It is understood that, as a result of the degeneracy of the genetic code, 
many nucleic acid sequences are possible which encode a Gene 216-like 
5 protein or polypeptide. Some of these will share little identity to the nucleotide 
sequences of any known or naturally-occurring Gene 216-like gene but can be 
used to produce the proteins and polypeptides of this invention by selection of 
combinations of nucleotide triplets based on codon choices. Such variants, 
while not hybridizable to a naturally-occurring Gene 216 gene under conditions 

10 of high stringency, are contemplated within this invention. 

Also encompassed by the present invention are alternate splice variants 
produced by differential processing of the primary transcript(s) from Gene 216 
genomic DNA. An alternate splice variant may comprise, for example, the 
sequence of any one of SEQ ID NO:2 and SEQ ID NO:350-362. Alternate 

1 5 splice variants can also comprise other combinations of introns/exons of SEQ 
ID NO:1 or SEQ ID NO:6, which can be determined by those of skill in the art. 
Alternate splice variants can be determined experimentally, for example, by 
isolating and analyzing cellular RNAs (e.g., Southern blotting or PCR), or by 
screening cDNA libraries using the Gene 216 nucleic acid probes or primers 

20 described herein. In another approach, alternate splice variants can be 
predicted using various methods, computer programs, or computer systems 
available to practitioners in the field. 

General methods for splice site prediction can be found in Nakata, 
1985, Nucleic Acids Res. 13:5327-5340. In addition, splice sites can be 

25 predicted using, for example, the GRAIL™ (E.G. Uberbacher and R.J. Mural, 
1991, Proc. Natl. Acad. Sci. USA, 88:11261-11265; E.G. Uberbacher, 1995, 
Trends Biotech., 13:497-500; available online at hypertext transfer protocol 
grail.lsd.ornl.gov/grailexp); GenView (L. Milanesi et al., 1993, Proceedings of 
the Second International Conference on Bioinformatics f Supercomputing, and 

30 Complex Genome Analysis, H.A. Lim et al. (eds), World Scientific Publishing, 
Singapore, pp. 573-588); SpliceView (Shapiro and Senapathy, 1987, Nucleic 
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Acids Res. 15:7155-7174; Rogozin and Miianesi, 1997, J. Mol. Evol. 45:50-59; 
available online at the WebGene website at hypertext transfer protocol on the 
world wide web at itba.mi.cnr.it/webgene); and HSPL (V.V. Solovyev et a!., 
1994, Nucleic Acids Res. 22:5156-5163; V.V. Solovyev et al., 1994, "The 
5 Prediction of Human Exons by Oligonucleotide Composition and Discriminant 
Analysis of Spliceable Open Reading Frames," R. Altman et al. (eds), The 
Second International conference on Intelligent systems for Molecular Biology, 
AAAI Press, Menlo Park, CA, pp. 354-362; V.V. Solovyev et al., 1993, 
"Identification Of Human Gene Functional Regions Based On Oligonucleotide 

10 Composition," L. Hunter et al. (eds), In Proceedings of First International 
conference on Intelligent System for Molecular Biology, Bethesda, pp. 371- 
379) computer systems. 

Additionally, computer programs such as GeneParser (E.E. Snyder and 
G.D. Stormo, 1995, J. Mol. Biol. 248: 1-18; E.E. Snyder and G.D. Stormo, 

15 1993, Nucl. Acids Res. 21(3): 607-613; available online at hypertext transfer 
protocol mcdb.colorado.edu/~eesnyder/ GeneParser.html); MZEF (M.Q. 
Zhang, 1997, Proc. Natl. Acad. ScL USA, 94:565-568; available online at 
hypertext transfer protocol argon.cshl.org/genefinder); MORGAN (S. Salzberg 
et al., 1998, J. Comp. Biol. 5:667-680; S. Salzberg et al. (eds), 1998, 

20 Computational Methods in Molecular Biology, Elsevier Science, New York, NY, 
pp. 187-203); VEIL (J. Henderson et al., 1997, J. Comp. Biol. 4:127-141); 
GeneScan (S. Tiwari et al., 1997, CABIOS (Biolnformatics) 13: 263-270); 
GeneBuilder (L. Miianesi etal., 1999, Biolnformatics 15:612-621); Eukaryotic 
GeneMark (J. Besemer et al., 1999, Nucl. Acids Res. 27:3911-3920); and 

25 FEXH (V.V. Solovyev et al., 1994, Nucleic Acids Res. 22:5156-5163). In 
addition, splice sites (i.e., former or potential splice sites) in cDNA sequences 
can be predicted using, for example, the RNASPL (V.V. Solovyev et al., 1994, 
Nucleic Acids Res. 22:5156-5163); or INTRON (A. Globek et a!., 1991, 
INTRON version 1.1 manual, Laboratory of Biochemical Genetics, NIMH, 

30 Washington, D.C.) programs. 

The present invention also encompasses naturally-occurring 
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polymorphisms of Gene 216. As will be understood by those in the art, the 
genomes of all organisms undergo spontaneous mutation in the course of their 
continuing evolution generating variant forms of gene sequences (Gusella, 

1986, Ann. Rev. Biochem. 55:831-854). Restriction fragment length 
5 polymorphisms (RFLPs) include variations in DNA sequences that alter the 

length of a restriction fragment in the sequence (Botstein et al., 1980, Am. J. 
Hum. Genet 32, 314-331 (1980). RFLPs have been widely used in human 
and animal genetic analyses (see WO 90/13668; WO90/1 1369; Donis-Keller, 

1987, Cell 51:319-337; Lander et al., 1989, Genetics 121: 85-99). Short 
10 tandem repeats (STRs) include tandem di-, tri- and tetra nucleotide repeated 

motifs, also termed variable number tandem repeat (VNTR) polymorphisms. 
VNTRs have been used in identity and paternity analysis (U.S. Pat. No. 
5,075,217; Armour et al., 1992, FEBS Lett 307:113-115; Horn et al., WO 
91/14003; Jeffreys, EP 370,719), and in a large number of genetic mapping 
1 5 studies. 

Single nucleotide polymorphisms (SNPs) are far more frequent than 
RFLPS, STRs, and VNTRs. SNPs may occur in protein coding (e.g., exon), or 
non-coding (e.g., intron, 5'UTR, 3'UTR) sequences. SNPs in protein coding 
regions may comprise silent mutations that do not alter the amino acid 

20 sequence of a protein. Alternatively, SNPs in protein coding regions may 
produce conservative or non-conservative amino acid changes, described in 
detail below. In some cases, SNPs may give rise to the expression of a 
defective or other variant protein and, potentially, a genetic disease. SNPs 
within protein-coding sequences can give rise to genetic diseases, for example, 

25 in the (3-globin (sickle cell anemia) and CFTR (cystic fibrosis) genes. In non- 
coding sequences, SNPs may also result in defective protein expression (e.g., 
as a result of defective splicing). Other single nucleotide polymorphisms have 
no phenotypic effects. 

Single nucleotide polymorphisms can be used in the same manner as 

30 RFLPs and VNTRs, but offer several advantages. Single nucleotide 
polymorphisms tend to occur with greater frequency and are typically spaced 
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more uniformly throughout the genome than other polymorphisms. Also, 
different SNPs are often easier to distinguish than other types of 
polymorphisms (e.g., by use of assays employing allele-specific hybridization 
probes or primers). In one embodiment of the present invention, a Gene 21 6 
5 nucleic acid contains at least one allele of one SNP as set forth in Table 10, 
herein below. Various combinations of these alleles (termed "haplotypes") are 
also encompassed by the invention. In a preferred aspect, a Gene 216 allele 
or haplotype is associated with a lung-related disorder, such as asthma. 

The nucleic acid sequences of the present invention may be derived 

1 0 from a variety of sources including DNA, cDNA, synthetic DNA, synthetic RNA, 
or combinations thereof. Such sequences may comprise genomic DNA, which 
may or may not include naturally occurring introns. Moreover, such genomic 
DNA may be obtained in association with promoter regions or poly (A) 
sequences. The sequences, genomic DNA, or cDNA may be obtained in any 

15 of several ways. Genomic DNA can be extracted and purified from suitable 
cells by means well known in the, art. Alternatively, mRNA can be isolated from 
a cell and used to produce cDNA by reverse transcription or other means. 

The nucleic acids described herein are used in the methods of the 
present invention for production of proteins or polypeptides, through 

20 incorporation into cells, tissues, or organisms. In one embodiment, DNA 
containing all or part of the coding sequence for a Gene 216 polypeptide, or 
DNA which hybridizes to DNA having the sequence SEQ ID NO:1 or SEQ ID 
NO:6, is incorporated into a vector for expression of the encoded polypeptide 
in suitable host cells. The encoded polypeptide consisting of Gene 216, or its 

25 functional equivalent is capable of normal activity, such as proteolysis, 
adhesion, fusion, and intracellular activity. 

The invention also concerns the use of the nucleotide sequence of the 
nucleic acids of this invention to identify DNA probes for Gene 21 6 genes, PCR 
primers to amplify Gene 216 genes, nucleotide polymorphisms in Gene 216 

30 genes, and regulatory elements of the Gene 216 genes. 

The nucleic acids of the present invention find use as primers 
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and templates for the recombinant production of disorder-associated peptides 
or polypeptides, for chromosome and gene mapping, to provide antisense 
sequences, for tissue distribution studies, to locate and obtain full length 
genes, to identify and obtain homologous sequences (wild-type and mutants), 
5 and in diagnostic applications. 

Probes may also be used for the detection of Gene 216-related 
sequences, and should preferably contain at least 50%, preferably at least 
80%, identity to Gene 216 polynucleotide, or a complementary sequence, or 
fragments thereof. The probes of this invention may be DNA or RNA, the 
10 probes may comprise all or a portion of the nucleotide sequence of SEQ ID 
NO:1 or SEQ ID NO:6, or a complementary sequence thereof, and may Include 
promoter, enhancer elements, and introns of the naturally occurring Gene 216 
polynucleotide. 

The probes and primers based on the Gene 216 gene sequences 

15 disclosed herein are used to identify homologous Gene 216 gene sequences 
and proteins in other species. These Gene 216 gene sequences and proteins 
are used in the diagnostic/prognostic, therapeutic and drug-screening methods 
described herein for the species from which they have been isolated. 
Vectors and Host Cells » 

20 The invention also provides vectors comprising the disorder- 

associated sequences, or derivatives or fragments thereof, and host cells for 
the production of purified proteins. A large number of vectors, including 
bacterial, yeast, and mammalian vectors, have been described for replication 
and/or expression in various host cells or cell-free systems, and may be used 

25 for gene therapy as well as for simple cloning or protein expression. 

In one aspect, an expression vectors comprises a nucleic acid encoding 
a Gene 216 polypeptide or peptide, as described herein, operably linked to at 
least one regulatory sequence. Regulatory sequences are known in the art 
and are selected to direct expression of the desired protein in an appropriate 

30 host cell. Accordingly, the term regulatory sequence includes promoters, 
enhancers and other expression control elements (see D.V. Goeddel (1990) 
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Methods Enzymol. 185:3-7). Enhancer and other expression control 
sequences are described in Enhancers and Eukaryotic Gene Expression, Cold 
Spring Harbor Press, Cold Spring Harbor, NY (1983). It should be understood 
that the design of the expression vector may depend on such factors as the 
5 choice of the host cell to be transfected and/or the type of polypeptide desired 
to be expressed. 

Several regulatory elements (e.g., promoters) have been isolated and 
shown to be effective in the transcription and translation of heterologous 
proteins in the various hosts. Such regulatory regions, methods of isolation, 

10 manner of manipulation, etc. are known in the art. Non-limiting examples of 
bacterial promoters include the (3-lactamase (penicillinase) promoter; lactose 
promoter; tryptophan (trp) promoter; araBAD (arabinose) operon promoter; 
lambda-derived Pi promoter and N gene ribosome binding site; and the hybrid 
tac promoter derived from sequences of the trp and lac UV5 promoters. Non- 

15 limiting examples of yeast promoters include the 3-phosphoglycerate kinase 
promoter, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter, 
galactokinase (GAL1) promoter, galactoepimerase promoter, and alcohol 
dehydrogenase (ADH1) promoter. Suitable promoters for mammalian cells 
include, without limitation, viral promoters, such as those from Simian Virus 40 

20 (S V40), Rous sarcoma virus (RSV), adenovirus (ADV), and bovine papilloma 
virus (BPV). Preferred replication and inheritance systems include M13, 
ColE1, SV40, baculovirus, lambda, adenovirus, CEN ARS, 2jum ARS and the 
like. While expression vectors may replicate autonomously, they may also 
replicate by being inserted into the genome of the host cell, by methods well 

25 known in the art. 

To obtain expression in eukaryotic cells, terminator sequences, 
polyadenylation sequences, and enhancer sequences that modulate gene 
expression may be required. Sequences that cause amplification of the gene 
may also be desirable. These sequences are well known in the art. 

30 Furthermore, sequences that facilitate secretion of the recombinant product 
from cells, including, but not limited to, bacteria, yeast, and animal cells, such 
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as secretory signal sequences and/or preprotein or proprotein sequences, may 
also be included. Such sequences are well described in the art. 

Expression and cloning vectors will likely contain a selectable marker, 
a gene encoding a protein necessary for survival or growth of a host cell 
5 transformed with the vector. The presence of this gene ensures growth of only 
those host cells that express the inserts. Typical selection genes encode 
proteins that 1 ) confer resistance to antibiotics or other toxic substances, e.g. 
ampicillin, neomycin, methotrexate, etc.; 2) complement auxotrophic 
deficiencies, or 3) supply critical nutrients not available from complex media, 

10 e.g., the gene encoding D-alanine racemase for Bacilli. Markers may be an 
inducible or non-inducible gene and will generally allow for positive selection. 
Non-limiting examples of markers include the ampicillin resistance marker (i.e., 
beta-lactamase), tetracycline resistance marker, neomycin/kanamycin 
resistance marker (i.e., neomycin phosphotransferase), dihydrofoiate 

15 reductase, glutamine synthetase, and the like. The choice of the proper 
selectable marker will depend on the host cell, and appropriate markers for 
different hosts as understood by those of skill in the art. 

Suitable expression vectors for use with the present invention include, 
but are not limited to, pUC, pBluescript (Stratagene), pET (Novagen, Inc., 

20 Madison, Wl), and pREP (Invitrogen) plasmids. Vectors can contain one or 
more replication and inheritance systems for cloning or expression, one or 
more markers for selection in the host, e.g. antibiotic resistance, and one or 
more expression cassettes. The inserted coding sequences can be 
synthesized by standard methods, isolated from natural sources, or prepared 

25 as hybrids. Ligation of the coding sequences to transcriptional regulatory 
elements (e.g., promoters, enhancers, and/or insulators) and/or to other amino 
acid encoding sequences can be carried out using established methods. 

Suitable cell-free expression systems for use with the present invention 
include, without limitation, rabbit reticulocyte lysate, wheat germ extract, canine 

30 pancreatic microsomal membranes, E. coli S30 extract, and coupled 
transcription/translation systems (Promega Corp., Madison, Wl). These 
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systems allow the expression of recombinant polypeptides or peptides upon 
the addition of cloning vectors, DNA fragments, or RNA sequences containing 
protein-coding regions and appropriate promoter elements. 

Non-limiting examples of suitable host ceils include bacteria, archea, 
5 insect, fungi (e.g., yeast), plant, and animal cells (e.g., mammalian, especially 
human). Of particular interest are Escherichia coli, Bacillus subtilis, 
Saccharomyces cerevisiae, SF9 cells, C129 cells, 293 cells, Neurospora, and 
immortalized mammalian myeloid and lymphoid cell lines. Techniques for the 
propagation of mammalian cells in culture are well-known (see, Jakoby and 

10 Pastan (eds), 1979, Cell Culture. Methods in Enzymoiogy, volume 58, 
Academic Press, Inc., Harcourt Brace Jovanovich, NY). Examples of 
commonly used mammalian host cell lines are VERO and HeLa cells, CHO 
cells, and WI38, BHK, and COS cell lines, although it will be appreciated by the 
skilled practitioner that other ceil lines may be used, e.g., to provide higher 

1 5 expression desirable glycosylation patterns, or other features. 

Host cells can be transformed, transfected, or infected as appropriate 
by any suitable method including electroporation, calcium chloride-, lithium 
chloride-, lithium acetate/polyethylene glycol-, calcium phosphate-, DEAE- 
dextran-, liposome-mediated DNA uptake, spheroplasting, injection, 

20 microinjection, microprojectile bombardment, phage infection, viral infection, 
or other established methods. Alternatively, vectors containing the nucleic 
acids of interest can be transcribed in vitro, and the resulting RNA introduced 
into the host cell by well-known methods, e.g., by injection (see, Kubo et ah, 
1988, FEBS Letts. 241:119). The cells into which have been introduced 

25 nucleic acids described above are meant to also include the progeny of such 
cells. 

The nucleic acids of the invention may be isolated directly from cells. 
Alternatively, the polymerase chain reaction (PGR) method can be used to 
produce the nucleic acids of the invention, using either RNA (e.g., mRNA) or 
30 DNA (e.g., genomic DNA) as templates. Primers used for PCR can be 
synthesized using the sequence information provided herein and can further 
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be designed to introduce appropriate new restriction sites, if desirable, to 
facilitate incorporation into a given vector for recombinant expression. 

Using the information provided in SEQ ID NO:1 and SEQ ID NO:6, one 
skilled in the art will be able to clone and sequence all representative nucleic 
5 acids of interest, including nucleic acids encoding complete protein-coding 
sequences. It is to be understood that non-protein-coding sequences 
contained within SEQ ID NO:1 and SEQ ID NO:3 and the genomic sequences 
of SEQ ID NO:6 and SEQ ID NO:5 are also within the scope of the invention. 
Such sequences include, without limitation, sequences important for 

10 replication, recombination, transcription, and translation. Non-limiting 
examples include promoters and regulatory binding sites involved in regulation 
of gene expression, and 5'- and 3'- untranslated sequences (e.g., ribosome- 
binding sites) that form part of mRNA molecules. 

The nucleic acids of this invention can be produced in large quantities 

15 by replication in a suitable host cell. Natural or synthetic nucleic acid 
fragments, comprising at least ten contiguous bases coding for a desired 
peptide or polypeptide can be incorporated into recombinant nucleic acid 
constructs, usually DNA constructs, capable of introduction into and replication 
in a prokaryotic or eukaryotic cell. Usually the nucleic acid constructs will be 

20 suitable for replication in a unicellular host, such as yeast or bacteria, but may 
also be intended for introduction to (with and without integration within the 
genome) cultured mammalian or plant or other eukaryotic cells, cell lines, 
tissues, or organisms. The purification of nucleic acids produced by the 
methods of the present invention is described, for example, in Sambrook et al., 

25 1989; F.M. Ausubel et al., 1992, Current Protocols in Molecular Biology, J. 
Wiley and Sons, New York, NY. 

The nucleic acids of the present invention can also be produced by 
chemical synthesis, e.g., by the phosphoramidite method described by 
Beaucage et al., 1981, Tetra. Letts. 22:1859-1862, or the triester method 

30 according to Matteucci et al., 1981, J. Am. Chem. Soc, 103:3185, and can 
performed on commercial, automated oligonucleotide synthesizers. A double- 
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stranded fragment may be obtained from the single-stranded product of 
chemical synthesis either by synthesizing the complementary strand and 
annealing the strands together under appropriate conditions or by adding the 
complementary strand using DNA polymerase with an appropriate primer 
5 sequence. 

These nucleic acids can encode full-length variant forms of proteins as 
well as the wild-type protein. The variant proteins (which could be especially 
useful for detection and treatment of disorders) will have the variant amino acid 
sequences encoded by the polymorphisms described in Table 10, when said 

10 polymorphisms are read so as to be in-frame with the full-length coding 
sequence of which it is a component. 

Large quantities of the nucleic acids and proteins of the present 
invention may be prepared by expressing the Gene 216 nucleic acids or 
portions thereof in vectors or other expression vehicles in compatible 

15 prokaryotic or eukaryotic host cells. The most commonly used prokaryotic 
hosts are strains of Escherichia coli, although other prokaryotes, such as 
Bacillus subtilis or Pseudomonas may also be used. Mammalian or other 
eukaryotic host cells, such as those of yeast, filamentous fungi, plant, insect, 
or amphibian or avian species, may also be useful for production of the 

20 proteins of the present invention. For example, insect cell systems (i.e., 
lepidopteran host cells and baculovirus expression vectors) are particularly 
suited for large-scale protein production. 

Host cells carrying an expression vector (i.e., transformants or clones) 
are selected using markers depending on the mode of the vector construction. 

25 The marker may be on the same or a different DNA molecule, preferably the 
same DNA molecule. In prokaryotic hosts, the transformant may be selected, 
e.g., by resistance to ampicillin, tetracycline or other antibiotics. Production of 
a particular product based on temperature sensitivity may also serve as an 
appropriate marker. 

30 Prokaryotic or eukaryotic cells comprising the nucleic acids of the 

present invention will be useful not only for the production of the nucleic acids 



-36- 



WO 02/083077 



PCT/US02/12063 



and proteins of the present invention, but also, for example, in studying the 
characteristics of Gene 216 proteins. Cells and animals that carry the Gene 
216 gene can be used as model systems to study and test for substances that 
have potential as therapeutic agents. The cells are typically cultured 
5 mesenchymal stem cells. These may be isolated from individuals with somatic 
or germline Gene 216 gene. Alternatively, the cell line can be engineered to 
carry the Gene 216 genes, as described above. After a test substance is 
applied to the cells, the transformed phenotype of the cell is determined. Any 
trait of transformed cells can be assessed, including respiratory diseases 
10 including asthma, atopy, and response to application of putative therapeutic 
agents. 

Antisense Nucleic Acids 

A further embodiment of the invention is antisense nucleic acids or 
oligonucleotides that are complementary, in whole or in part, to a target 

15 molecule comprising a sense strand of Gene 216. The Gene 216 target can 
be DNA, or its RNA counterpart (i.e., wherein thymine (T) is present in DNA 
and uracil (U) is present in RNA). When introduced into a cell, antisense 
nucleic acids or oligonucleotides can hybridize to all or a part of the sense 
strand of Gene 216, thereby inhibiting gene expression or replication. 

20 In a particular embodiment of the invention, an antisense nucleic acid 

or oligonucleotide is wholly or partially complementary to, and can hybridize 
with, a target nucleic acid (either DNA or RNA) having the sequence of SEQ 
ID NO:1 or SEQ ID NO:6. For example, an antisense nucleic acid or 
oligonucleotide comprising 16 nucleotides can be sufficient to inhibit 

25 expression of the Gene 216 protein. Alternatively, an antisense nucleic acid 
or oligonucleotide can be complementary to 5" or 3' untranslated regions, or 
can overlap the translation initiation codon (5' untranslated and translated 
regions) of the Gene 216 gene, or its functional equivalent. In another 
embodiment, the antisense nucleic acid is wholly or partially complementary 

30 to, and can hybridize with, a target nucleic acid that encodes a Gene 216 
polypeptide. 
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In addition, oligonucleotides can be constructed which will bind to 
duplex nucleic acid (i.e., DNA:DNA or DNA:RNA), to form a stable triple helix- 
containing or triplex nucleic acid. Such triplex oligonucleotides can inhibit 
transcription and/or expression of a gene encoding Gene 216, or its functional 
5 equivalent (M.D. Frank-Kamenetskii and S.M. Mirkin, 1995, Ann. Rev. 
Biochem. 64:65-95). Triplex oligonucleotides are constructed using the base- 
pairing rules of triple helix formation and the nucleotide sequence of the gene 
or mRNA for Gene 21 6. 

The present invention encompasses methods of using oligonucleotides 

10 in antisense inhibition of the function of Gene 216. In the context of this 
invention, the term "oligonucleotide" refers to naturally-occurring species or 
synthetic species formed from naturally-occurring subunits or their close 
homologs. The term may also refer to moieties that function similarly to 
oligonucleotides, but have non-naturally-occurring portions. Thus, 

15 oligonucleotides may have altered sugar moieties or inter-sugar linkages. 
Exemplary among these are phosphorothioate and other sulfur containing 
species which are known in the art. 

In preferred embodiments, at least one of the phosphodiester bonds of 
the oligonucleotide has been substituted with a structure that functions to 

20 enhance the ability of the compositions to penetrate into the region of cells 
where the RNA whose activity is to be modulated is located. It is preferred that 
such substitutions comprise phosphorothioate bonds, methyl phosphonate 
bonds, or short chain alkyl or cycloalkyl structures. In accordance with other 
preferred embodiments, the phosphodiester bonds are substituted with 

25 structures which are, at once, substantially non-ionic and non-chiral, or with 
structures which are chiral and enantiomerically specific. Persons of ordinary 
skill in the art will be able to select other linkages for use in the practice of the 
invention. 

Oligonucleotides may also include species that include at least some 
30 modified base forms. Thus, purines and pyrimidines other than those normally 
found in nature may be so employed. Similarly, modifications on the furanosyl 
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portions of the nucleotide subunits may also be effected, as long as the 
essential tenets of this invention are adhered to. Examples of such 
modifications are 2 f -0-alkyl- and 2-halogen-substituted nucleotides. Some 
non-limiting examples of modifications at the 2' position of sugar moieties 
5 which are useful in the present invention include OH, SH, SCH 3 , F, OCH 3 , 
OCN, 0(CH 2 ) n NH 2 and 0(CH 2 ) n CH 3 , where n is from 1 to about 10. Such 
oligonucleotides are functionally interchangeable with natural oligonucleotides 
or synthesized oligonucleotides, which have one or more differences from the 
natural structure. All such analogs are comprehended by this invention so long 

10 as they function effectively to hybridize with Gene 21 6 DNA or RNA to inhibit 
the function thereof. 

The oligonucleotides in accordance with this invention preferably 
comprise from about 3 to about 50 subunits. It is more preferred that such 
oligonucleotides and analogs comprise from about 8 to about 25 subunits and 

15 still more preferred to have from about 12 to about 20 subunits. As defined 
herein, a "subunit" is a base and sugar combination suitably bound to adjacent 
subunits through phosphodiester or other bonds. 

Antisense nucleic acids or oligonucleotides can be produced by 
standard techniques (see, e.g., Shewmaker et al., U.S. Patent No. 5,107,065. 

20 The oligonucleotides used in accordance with this invention may be 
conveniently and routinely made through the well-known technique of solid 
phase synthesis. Equipment for such synthesis is available from several 
vendors, including PE Applied Biosystems (Foster City, CA). Any other means 
for such synthesis may also be employed, however, the actual synthesis of the 

25 oligonucleotides is well within the abilities of the practitioner. It is also will 
known to prepare other oligonucleotide such as phosphorothioates and 
alkylated derivatives. 

The oligonucleotides of this invention are designed to be hybridizable 
with Gene 216 RNA (e.g., mRNA) or DNA. For example, an oligonucleotide 

30 (e.g., DNA oligonucleotide) that hybridizes to Gene 216 mRNA can be used to 
target the mRNA for RnaseH digestion. Alternatively; an oligonucleotide that 
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hybridizes to the translation initiation site of Gene 216 mRNA can be used to 
prevent translation of the mRNA. In another approach, oligonucleotides that 
bind to the double-stranded DNA of Gene 216 can be administered. Such 
oligonucleotides can form a triplex construct and inhibit the transcription of the 
5 DNA encoding Gene 21 6 polypeptides. Triple helix pairing prevents the double 
helix from opening sufficiently to allow the binding of polymerases, transcription 
factors, or regulatory molecules. Recent therapeutic advances using triplex 
DNA have been described (see, e.g., J.E. Gee et a)., 1994, Molecular and 
Immunologic Approaches, Futura Publishing Co., Mt. Kisco, NY). 

10 As non-limiting examples, antisense oligonucleotides may be targeted 

to hybridize to the following regions: mRNA cap region; translation initiation 
site; translational termination site; transcription initiation site; transcription 
termination site; polyadenylation signal; 3' untranslated region; 5' untranslated 
region; 5' coding region; mid coding region; and 3' coding region. Preferably, 

1 5 the complementary oligonucleotide is designed to hybridize to the most unique 
5' sequence Gene 216, including any of about 15-35 nucleotides spanning the 
5 1 coding sequence. Appropriate oligonucleotides can be designed using 
OLIGO software (Molecular Biology Insights, Inc., Cascade, CO; available 
online at hyperlink transfer protocol on the world wide web atoligo.net). 

20 In accordance with the present invention, the antisense oligonucleotide 

can be synthesized, formulated as a pharmaceutical composition, and 
administered to a subject. The synthesis and utilization of antisense and triplex 
oligonucleotides have been previously described (e.g., H. Simon et al., 1999, 
Antisense Nucleic Acid Drug Dev. 9:527-31 ; F.X. Barre et a!., 2000, Proc. Natl. 

25 Acad. Sci. USA 97:3084-3088; R. Elez et al., 2000, Biochem. Biophys. Res. 
Commun. 269:352-6; E.R. Sauter et al., 2000, din. Cancer Res. 6:654-60). 
Alternatively, expression vectors derived from retroviruses, adenovirus, herpes 
or vaccinia viruses, or from various bacterial plasmids may be used for delivery 
of nucleotide sequences to the targeted organ, tissue or cell population. 

30 Methods which are well known to those skilled in the art can be used to 
construct recombinant vectors which will express nucleic acid sequence that 



-40- 



WO 02/083077 



PCT/US02/12063 



is complementary to the nucleic acid sequence encoding a Gene 216 
polypeptide. These techniques are described both in Sambrook et aL, 1989 
and in Ausubel et al., 1992. For example, Gene 216 expression can be 
inhibited by transforming a cell or tissue with an expression vector that 
5 expresses high levels of untranslatable sense or antisense Gene 216 
sequences. Even in the absence of integration into the DNA, such vectors may 
continue to transcribe RNA molecules until they are disabled by endogenous 
nucleases. Transient expression may last for a month or more with a non- 
replicating vector, and even longer if appropriate replication elements included 

1 0 in the vector system. 

Various assays may be used to test the ability of Gene 216-specific 
antisense oligonucleotides to inhibit Gene 21 6 expression. For example, Gene 
216 mRNA levels can be assessed northern blot analysis (Sambrook et al., 
1989; Ausubel et al., 1992; J.C. Alwine et al. 1977, Proc. Natl. Acad. ScL USA 

1 5 74:5350-5354; I.M. Bird, 1 998, Methods Mol. Biol. 105:325-36), quantitative or 
semi-quantitative RT-PCR analysis (see, e.g., W.M. Freeman et al., 1999, 
Biotechniques 26:1 12-122; Ren etal., 1998, Mol. Brain Res. 59:256-63; J.M. 
Cale et al., 1998, Methods Mol. Biol. 105:351-71), or in situ hybridization 
(reviewed by AX. Raap, 1998, Mutat Res. 400:287-298). Alternatively, 

20 antisense oligonucleotides may be assessed by measuring levels of Gene 216 
polypeptide, e.g., by western blot analysis, indirect immunofluorescence, 
immunoprecipitation techniques (see, e.g., J.M. Walker, 1998, Protein 
Protocols on CD-ROM, Humana Press, Totowa, NJ). 
Polypeptides 

25 The invention also relates to polypeptides and peptides encoded by the 

novel nucleic acids described herein. The polypeptides and peptides of this 
invention can be isolated and/or recombinant In a preferred embodiment, the 
Gene 21 6 polypeptide, or analog or portion thereof, has at least one function 
characteristic of a Gene 216 protein, for example, proteolysis, adhesion, 

30 fusion, antigenic, and intracellular activity. Protein analogs include, for 
example, naturally-occurring or genetically engineered Gene 216 variants (e.g. 
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mutants) and portions thereof. Variants may differ from wild-type Gene 216 
protein by the addition, deletion, or substitution of one, or more amino acid 
residues. In specific embodiments, polypeptide variants are encoded by Gene 
216 nucleic acids containing one or more of the alleles or haplotypes disclosed 
5 herein. Variants also include polypeptides in which one or more residues are 
modified (i.e., by phosphorylation, sulfation, acylation, etc.), and mutants 
comprising one or more modified residues. 

Variant polypeptides can have conservative changes, wherein a 
substituted amino acid has similar structural or chemical properties, e.g., 

10 replacement of leucine with isoleucine. More infrequently, a variant 
polypeptide can have non-conservative changes, e.g., substitution of a glycine 
with a tryptophan. Guidance in determining which amino acid residues can be 
substituted, inserted, or deleted without abolishing biological or immunological 
activity can be found using computer programs well known in the art, for 

1 5 example, DNASTAR software (DNASTAR, Inc., Madison, Wl) 

As non-limiting examples, conservative substitutions in the Gene 216 
amino acid sequence can be made in accordance with the following table: 



Original Residue 


Conservative Substitution(s) 


Ala 


Ser 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Asn, Gin 


lie 


Leu, Val 


Leu 


lie, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, lie 


Phe 


Met, Leu, Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


lie, Leu 
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Substantial changes in function or immunogenicity can be made by 
selecting substitutions that are less conservative than those shown in the table, 
above. For example, non-conservative substitutions can be made which more 
5 significantly affect the structure of the polypeptide in the area of the alteration, 
for example, the alpha-helical, or beta-sheet structure; the charge or 
hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which generally are expected to produce the greatest 
changes in the polypeptide's properties are those where 1) a hydrophilic 

1 0 residue, e.g., seryl or threonyl, is substituted for (or by) a hydrophobic residue, 
e.g., leucyl, isoleucyl, phenylalanyl, valyl, oralanyl; 2) a cysteine or proline is 
substituted for (or by) any other residue; 3) a residue having an electropositive 
side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or 4) a residue having a 

15 bulky side chain, e.g., phenylalanine, is substituted for (or by) a residue that 
does not have a side chain, e.g., glycine. 

In one embodiment, polypeptides of the present invention share at least 
50% amino acid sequence identity with a Gene 216 polypeptide, such as SEQ 
ID NO:4, or fragments thereof. Preferably, the polypeptides share at least 65% 

20 amino acid sequence identity; more preferably, the polypeptides share at least 
75% amino acid sequence identity; even more preferably, the polypeptides 
share at least 80% amino acid sequence identity with a Gene 216 polypeptide; 
still more preferably the polypeptides share at least 90% amino acid sequence 
identity with a Gene 216 polypeptide. 

25 Percent sequence identity can be calculated using computer programs 

or direct sequence comparison. Preferred computer program methods to 
determine identity between two sequences include, but are not limited to, the 
GCG program package, FASTA, BLASTP, and TBLASTN (see, e.g., D.W. 
Mount, 2001 , Bioinformatics: Sequence and Genome Analysis, Cold Spring 

30 Harbor Laboratory Press, Cold Spring Harbor, NY). The BLASTP and 
TBLASTN programs are publicly available from NCBI and other sources. The 
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well-known Smith Waterman algorithm may also be used to determine identity. 

Exemplary parameters for amino acid sequence comparison include the 
following: 1) algorithm from Needleman and Wunsch, 1970, J Moi Bio!. 
48:443-453; 2) BLOSSUM62 comparison matrix from Hentikoff and Hentikoff, 
5 1992, Prac. Natl. Acad. Sci. USA 89:10915-10919; 3) gap penalty = 12; and 
4) gap length penalty =4. A program useful with these parameters is publicly 
available as the "gap" program (Genetics Computer Group, Madison, Wl). The 
aforementioned parameters are the default parameters for polypeptide 
comparisons (with no penalty for end gaps). 

1 0 Alternatively, polypeptide sequence identity can be calculated using the 

following equation: % identity = (the number of identical residues) / (alignment 
length in amino acid residues) * 100. For this calculation, alignment length 
includes internal gaps but does not include terminal gaps. 

In accordance with the present invention, polypeptide sequences may 

15 be identical to the sequence of SEQ ID NO:4, or may include up to a certain 
integer number of amino acid alterations. Polypeptide alterations are selected 
from the group consisting of at least one amino acid deletion, substitution, 
including conservative and non-conservative substitution, or insertion. 
Alterations may occur at the amino- or carboxy-terminal positions of the 

20 reference polypeptide sequence or anywhere between those terminal 
positions, interspersed either individually among the amino acids in the 
reference sequence or in one or more contiguous groups within the reference 
sequence. In specific embodiments, polypeptide variants may be encoded by 
Gene 216 nucleic acids comprising SNP-related alleles or haplotypes and/or 

25 alternate splice variants. 

The invention also relates to isolated, synthesized and/or recombinant 
portions or fragments of a Gene 216 protein or polypeptide as described 
herein. Polypeptide fragments (i.e., peptides) can be made which have full or 
partial function on their own, or which when mixed together (though fully, 

30 partially, or nonfunctional alone), spontaneously assemble with one or more 
other polypeptides to reconstitute a functional protein having at least one 
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functional characteristic of a Gene 216 protein of this invention. In addition, 
Gene 216 polypeptide fragments may comprise, for example, one or more 
domains of the Gene 216 polypeptide (e.g., the pre-, pro-, catalytic, cysteine- 
rich, disintegrin, EGF, transmembrane, and cytoplasmic domains) disclosed 
5 herein. 

Polypeptides according to the invention can comprise at least 5 amino 
acid residues; preferably the polypeptides comprise at least 12 residues; more 
preferably the polypeptides comprise at least 20 residues; and yet more 
preferably the polypeptides comprise at least 30 residues. Nucleic acids 

1 0 comprising protein-coding sequences can be used to direct the expression of 
asthma-associated polypeptides in intact cells or in cell-free translation 
systems. The coding sequence can be tailored, if desired, for more efficient 
expression in a given host organism, and can be used to synthesize 
oligonucleotides encoding the desired amino acid sequences. The resulting 

1 5 oligonucleotides can be inserted into an appropriate vector and expressed in 
a compatible host organism or translation system. 

The polypeptides of the present invention, including function- 
conservative variants, may be isolated from wild-type or mutant cells (e.g., 
human cells or cell lines), from heterologous organisms or cells (e.g., bacteria, 

20 yeast, insect, plant, and mammalian cells), or from cell-free translation systems 
(e.g., wheat germ, microsomal membrane, or bacterial extracts) in which a 
protein-coding sequence has been introduced and expressed. Furthermore, 
the polypeptides may be part of recombinant fusion proteins. The polypeptides 
can also, advantageously, be made by synthetic chemistry. Polypeptides may 

25 be chemically synthesized by commercially available automated procedures, 
including, without limitation, exclusive solid phase synthesis, partial solid phase 
methods, fragment condensation or classical solution synthesis. 

Methods for polypeptide purification are well-known in the art, including, 
without limitation, preparative disc-gel electrophoresis, isoelectric focusing, 

30 HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition 
chromatography, and countercurrent distribution. For some purposes, it is 



-45- 



WO 02/083077 



PCT/US02/12063 



preferable to produce the polypeptide in a recombinant system in which the 
protein contains an additional sequence (e.g., epitope or protein) tag that 
facilitates purification. Non-limiting examples of epitope tags include c-myc, 
haemagglutinin (HA), polyhistidine (6X-HIS) (SEQ ID NO:32), GLU-GLU, and 
5 DYKDDDDK (SEQ ID NO:33) (FLAG®) epitope tags. Non-limiting examples 
of protein tags include glutathione-S-transferase (GST), green fluorescent 
protein (GFP), and maltose binding protein (MBP). 

In one approach, the coding sequence of a polypeptide or peptide can 
be cloned into a vector that creates a fusion with a sequence tag of interest. 

10 Suitable vectors include, without limitation, pRSET (Invitrogen Corp., San 
Diego, CA), pGEX (Amersham-Pharmacia Biotech, Inc., Piscataway, NJ), 
pEGFP (CLONTECH Laboratories, Inc., Palo Alto, CA), and pMAL™ (New 
England BioLabs (NEB), Inc., Beverly, MA) plasmids. Following expression, 
the epitope, or protein tagged polypeptide or peptide can be purified from a 

1 5 crude lysate of the translation system or host cell by chromatography on an 
appropriate solid-phase matrix. In some cases, it may be preferable to remove 
the epitope or protein tag (i.e., via protease cleavage) following purification. As 
an alternative approach, antibodies produced against a disorder-associated 
protein or against peptides derived therefrom can be used as purification 

20 reagents. Other purification methods are possible. 

The present invention also encompasses polypeptide derivatives of 
Gene 216. The isolated polypeptides may be modified by, for example, 
phosphorylation, sulfation, acylation, or other protein modifications. They may 
also be modified with a label capable of providing a detectable signal, either 

25 directly or indirectly, including, but not limited to, radioisotopes and fluorescent 
compounds. 

Both the naturally occurring and recombinant forms of the polypeptides 
of the invention can advantageously be used to screen compounds for binding 
activity. Many methods of screening for binding activity are known by those 
30 skilled in the art and may be used to practice the invention. Several methods 
of automated assays have been developed in recent years so as to permit 
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screening of tens of thousands of compounds in a short period of time. Such 
high-throughput screening methods are particularly preferred. The use of high- 
throughput screening assays to test for inhibitors is greatly facilitated by the 
availability of large amounts of purified polypeptides, as provided by the 
5 invention. The polypeptides of the invention also find use as therapeutic 
agents as well as antigenic components to prepare antibodies. 

The polypeptides of this invention find use as immunogenic components 
useful as antigens for preparing antibodies by standard methods. It is well 
known in the art that immunogenic epitopes generally contain at least about 

10 five amino acid residues (Ohno et al., 1985, Proc. Natl. Acad. Sci. USA 
82:2945). Therefore, the immunogenic components of this invention will 
typically comprise at least 5 amino acid residues of the sequence of the 
complete polypeptide chains. Preferably, they will contain at least 7, and most 
preferably at least about 10 amino acid residues or more to ensure that they 

1 5 will be immunogenic. Whether a given component is immunogenic can readily 
be determined by routine experimentation Such immunogenic components 
can be produced by proteolytic cleavage of larger polypeptides or by chemical 
synthesis or recombinant technology and are thus not limited by proteolytic 
cleavage sites. The present invention thus encompasses antibodies that 

20 specifically recognize asthma-associated immunogenic components. 
Structural Studies 

A purified Gene 216 polypeptide can be analyzed by well-established 
methods (e.g., X-ray crystallography, NMR, CD, etc.) to determine the three- 
dimensional structure of the molecule. The three-dimensional structure, in 

25 turn, can be used to model intermolecular interactions. Exemplary methods for 
crystallization and X-ray crystallography are found in P.G. Jones, 1981, 
Chemistry in Britain, 17:222-225; C. Jones et al. (eds), Crystallographic 
Methods and Protocols, Humana Press, Totowa, NJ; A. McPherson, 1982, 
Preparation and Analysis of Protein Crystals, John Wiley & Sons, New York, 

30 NY; T.L. Blundell and L.N. Johnson, 1976, Protein Crystallography, Academic 
Press, Inc., New York, NY; A. Holden and P. Singer, 1960, Crystals and Crystal 
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Growing, Anchor Books-Doubleday, New York, NY; R.A. Laudise, 1970, The 
Growth of Single Crystals, Solid State Physical Electronics Series, N. 
Holonyak, Jr., (ed), Prentice-Hall, Inc.; G.H. Stout and L.H. Jensen, 1989, X- 
ray Structure Determination: A Practical Guide, 2nd edition, John Wiliey & 
5 Sons, New York, NY; Fundamentals of Analytical Chemistry, 3rd. edition, 
Saunders Golden Sunburst Series, Holt, Rinehart and Winston, Philadelphia, 
PA, 1976; P.D. Boyle of the Department of Chemistry of North Carolina State 
University website at hypertext transfer protocol laue.chem.ncsu.edu/web/Grow 
Xtal.html; M.B. Berry, 1995, Protein Crystalization: Theory and Practice, 

10 Structure and Dynamics of E. coli Adenylate Kinase, Doctoral Thesis, Rice 
University, Houston TX. 

For X-ray diffraction studies, single crystals can be grown to suitable 
size. Preferably, a crystal has a size of 0.2 to 0.4 mm in at least two of the 
three dimensions. Crystals can be formed in a solution comprising a Gene 216 

15 polypeptide (e.g., 1.5-200 mg/ml) and reagents that reduce the solubility to 
conditions close to spontaneous precipitation. Factors that affect the formation 
of polypeptide crystals include: 1) purity; 2) substrates or co-factors; 3) pH; 4) 
temperature; 5) polypeptide concentration; and 6) characteristics of the 
precipitant. Preferably, the Gene 216 polypeptides are pure, i.e., free from 

20 contaminating components (at least 95% pure), and free from denatured Gene 
216 polypeptides, in particular, polypeptides can be purified by FPLC and 
HPLC techniques to assure homogeneity (see, Lin et al., 1992, J. Crystal. 
Growth. 122:242-245). Optionally, Gene 216 polypeptide substrates or co- 
factors can be added to stabilize the quaternary structure of the protein and 

25 promote lattice packing. 

Suitable precipitants for crystallization include, but are not limited to, 
salts (e.g., ammonium sulphate, potassium phosphate); polymers (e.g., 
polyethylene glycol (PEG) 6000); alcohols (e.g., ethanol); polyalcohols (e.g., 
1-methyl-2,4 pentane diol (MPD)); organic solvents; sulfonic dyes; and 

30 deionized water. The ability of a salt to precipitate polypeptides can be 
generally described by the Hofmeister series: PQ 4 3 " > HPQ 4 2 ~ = S0 4 2 " > citrate 
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> CH 3 C0 2 "> CI - > Br"> N0 3 "> CIO4" > SCN"; and NH 4 + > K + > Na + > Li + . Non- 
limiting examples of salt precipitants are shown below (see Berry, 1995). 



Precipitant 


Maximum concentration 


(NH 4 7Na7l_f ) 2 or Mg 2 +S<V 


4.0/1.5/2.1 12.5 M 


NH 4 7Na7K + PO4 3 " 


3.0/4.0/4.0 M 


NH 4 + /K + /Na + /Li + citrate 


-1.8 M 


NH 4 7K7Na7Li + acetate 


-3.0 M 


NH 4 7K7Na7Li + Cl" 


5.2 / 9.8 / 4.2 / 5.4 M 


NH 4 + N0 3 - 


-8.0 M 



5 High molecular weight polymers useful as precipitating agents include 

polyethylene glycol (PEG), dextran, polyvinyl alcohol, and polyvinyl pyrrolidone 
(A. Poison et al., 1964, Biochem. Bfophys. Acta. 82:463-475). In general, 
polyethylene glycol (PEG) is the most effective for forming crystals. PEG 
compounds with molecular weights less than 1000 can be used at 

1 0 concentrations above 40% v/v. PEGs with molecular weights above 1 000 can 
be used at concentration 5-50% w/v. Typically, PEG solutions are mixed with 
-0.I % sodium azide to prevent bacterial growth. 

Typically, crystallization requires the addition of buffers and a specific 
salt content to maintain the proper pH and ionic strength for a protein's stability. 

1 5 Suitable additives include, but are not limited to sodium chloride (e.g., 50-500 
mM as additive to PEG and MPD; 0.15-2 M as additive to PEG); potassium 
chloride (e.g., 0.05-2 M); lithium chloride (e.g., 0.05-2 M); sodium fluoride (e.g., 
20-300 mM); ammonium sulfate (e.g., 20-300 mM); lithium sulfate (e.g., 0.05-2 
M); sodium or ammonium thiocyanate (e.g., 50-500 mM); MPD (e.g., 0.5-50%); 

20 1,6 hexane diol (e.g., 0.5-10%); 1,2,3 heptane triol (e.g., 0.5-15%); and 
benzamidine (e.g., 0.5-15%). 

Detergents may be used to maintain protein solubility and prevent 
aggregation. Suitable detergents include, but are not limited to non-ionic 
detergents such as sugar derivatives, oligoethyleneglycol derivatives, 

25 dimethylamine-N-oxides, cholate derivatives, N-octyl hydroxyalkylsulphoxides, 
sulphobetains, and lipid-like detergents. Sugar-derived detergents include alkyl 
giucopyranosides (e.g., C8-GP, C9-GP), alkyl thio-glucopyranosides (e.g., C8- 
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tGP), alkyl maltopyranosides (e.g., C10-M, C12-M; CYMAL-3, CYMAL-5, 
CYMAL-6), alkyl thio-maltopyranosides, alkyl galactopyranosides, alkyl 
sucroses (e.g., N-octanoylsucrose), and glucamides (e.g., HECAMEG, C- 
HEGA-10; MEGA-8). Oligoethyleneglycol-derived detergents include alkyl 
5 polyoxyethylenes (e.g., C8-E5, C8-En; C12-E8; C12-E9) and phenyl 
polyoxyethylenes (e.g., Triton X-100). Dimethylamine-N-oxide detergents 
include, e.g., C10-DAO; DDAO; LDAO. Cholate-derived detergents include, 
e.g., Deoxy-Big CHAP, digitonin. Lipid-like detergents include phosphocholine 
compounds. Suitable detergents further include zwitter-ionic detergents (e.g., 

10 ZWITTERGENT 3-10; 2WITTERGENT 3-12); and ionic detergents (e.g., SDS). 

Crystallization of macromolecules has been performed at temperatures 
ranging from 60°C to less than 0°C. However, most molecules can be 
crystallized at 4°C or 22°C. Lower temperatures promote stabilization of 
polypeptides and inhibit bacterial growth. In general, polypeptides are more 

15 soluble in salt solutions at lower temperatures (e.g., 4°C), but less soluble in 
PEG and MPD solutions at lower temperatures. To allow crystallization at 4°C 
or 22°C, the precipitant or protein concentration can be increased or decreased 
as required. Heating, melting, and cooling of crystals or aggregates can be 
used to enlarge crystals. In addition, crystallization at both 4°C and 22°C can 

20 be assessed (A. McPherson, 1992, J. Cryst Growth. 122:161-167; C.W. 
Carter, Jr. and C.W. Carter, 1979, J. Biol. Chem. 254:12219-12223; T. 
Bergfors, 1993, Crystalization Lab Manual). 

A crystallization protocol can be adapted to a particular polypeptide or 
peptide. In particular, the physical and chemical properties of the polypeptide 

25 can be considered (e.g., aggregation, stability, adherence to membranes or 
tubing, internal disulfide linkages, surface cysteines, chelating ions, etc.). For 
initial experiments, the standard set of crystalization reagents can be used 
(Hampton Research, Laguna Niguel, CA). In addition, the CRYSTOOL 
program can provide guidance in determining optimal crystallization conditions 

30 (Brent Segelke, 1 995, Efficiency analysis of sampling protocols used in protein 
crystallization screening and crystal structure from two novel crystal forms of 
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PLA2, Ph.D. Thesis, University of California, San Diego). Exemplary 
crystallization conditions are shown below (see Berry, 1995). 



Major Precipitant 


Additive 


Concentration of 
Major Precipitant 


Concentration 
of Additive 


(NH 4 ) 2 S0 4 


PEG 400-2000, MPD, 
ethanol, or methanol 


2.0-4.0 M 


6%-0.5% 


Na citrate 


PEG 400-2000, MPD, 
ethano!, or methanol 


1.4-1.8 M 


6%-0.5% 


PEG 1000-20000 


(NH4) 2 S0 4 , NaCI, or Na 
formate 


40-50% 


0.2-0.6 M 



5 Robots can be used for automatic screening and optimization of 

crystallization conditions. For example, the IMPAX and Oryx systems can be 
used (Douglas Instruments, Ltd., East Garston, United Kingdom). The 
CRYSTOOL program (Segelke, supra) can be integrated with the robotics 
programming, in addition, the Xact program can be used to construct, 

10 maintain, and record the results of various crystallization experiments (see, 
e.g., D.E. Brodersen et al., 1999, J. AppL Cryst. 32: 1012-1016; G.R. Andersen 
and J. Nyborg, 1996, J. AppL Cryst 29:236-240). The Xact program supports 
multiple users and organizes the results of crystallization experiments into 
hierarchies. Advantageously, Xact is compatible with both CRYSTOOL and 

1 5 Microsoft® Excel programs. 

Four methods are commonly employed to crystallize macromolecules: 
vapor diffusion, free interface diffusion, batch, and dialysis. The vapor 
diffusion technique is typically performed by formulating a 1:1 mixture of a 
solution comprising the polypeptide of interest and a solution containing the 

20 precipitant at the final concentration that is to be achieved after vapor 
equilibration. The drop containing the 1:1 mixture of protein and precipitant is 
then suspended and sealed over the well solution, which contains the 
precipitant at the target concentration, as either a hanging or sitting drop. 
Vapor diffusion can be used to screen a large number of crystallization 

25 conditions or when small amounts of polypeptide are available. For screening, 
drop sizes of 1 to 2 jnl can be used. Once preliminary crystallization conditions 
have been determined, drop sizes such as 10 can be used. Notably, results 
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from hanging drops may be improved with agarose gels (see K. Provost and 
M.-C. Robert, 1991, J. Cryst. Growth. 110:258-264). 

Free interface diffusion is performed by layering of a low density 
solution onto one of higher density, usually in the form of concentrated protein 
5 onto concentrated salt. Since the solute to be crystallized must be 
concentrated, this method typically requires relatively large amounts of protein. 
However, the method can be adapted to work with small amounts of protein. 
In a representative experiment, 2 to 5 u.l of sample is pipetted into one end of 
a 20 jliI microcapillary pipet. Next, 2 to 5 ut of precipitant is pipetted into the 
10 capillary without introducing an air bubble, and the ends of the pipet are 
sealed. With sufficient amounts of protein, this method can be used to obtain 
relatively large crystals (see, e.g., S.M. Althoff et al., 1988, J. Mol. Biol. 
199:665-666). 

The batch technique is performed by mixing concentrated polypeptide 

15 with concentrated precipitant to produce a final concentration that is 
supersaturated for the solute macromolecule. Notably, this method can 
employ relatively large amounts of solution (e.g., milliliter quantities), and can 
produce large crystals. For that reason, the batch technique is not 
recommended for screening initial crystallization conditions. 

20 The dialysis technique is performed by diffusing precipitant molecules 

through a semipermeable membrane to slowly increase the concentration of 
the solute inside the membrane. Dialysis tubing can be used to dialyze milliliter 
quantities of sample, whereas dialysis buttons can be used to dialyze microliter 
quantities (e.g., 7-200 uJ). Dialysis buttons may be constructed out of glass, 

25 perspex, or Teflon™ (see, e.g., Cambridge Repetition Engineers Ltd., Greens 
Road, Cambridge CB4 3EQ, UK; Hampton Research). Using this method, the 
precipitating solution can be varied by moving the entire dialysis button or sack 
into a different solution. In this way, polypeptides can be "reused" until the 
correct conditions for crystallization are found (see, e.g., C.W. Carter, Jr. et al., 

30 1988, J. Cryst Growth. 90:60-73). However, this method is not recommended 
for precipitants comprising concentrated PEG solutions. 
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Various strategies have been designed to screen crystallization 
conditions, including 1) pi screening; 2) grid screening; 3) factorials; 4) 
solubility assays; 5) perturbation; and 6) sparse matrices. In accordance with 
the pi screening method, the pi of a polypeptide is presumed to be its 
5 crystallization point. Screening at the pi can be performed by dialysis against 
low concentrations of buffer (less than 20 mM) at the appropriate pH, or by use 
of conventional precipitants. 

The grid screening method can be performed on two-dimensional 
matrices. Typically, the precipitant concentration is plotted against pH. The 

10 optimal conditions can be determined for each axis, and then combined. At 
that point, additional factors can be tested (e.g., temperature, additives). This 
method works best with fast-forming crystals, and can be readily automated 
(see MJ. Cox and P.C. Weber, 1988, J. Cryst Growth. 90:318-324). Grid 
screens are commercially available for popular precipitants such as ammonium 

15 sulphate, PEG 6000, MPD, PEG/LiCI, and NaCl (see, e.g., Hamilton 
Research). 

The incomplete factorial method can be performed by 1 ) selecting a set 
of -20 conditions; 2) randomly assigning combinations of these conditions; 3) 
grading the success of the results of each experiment using an objective scale; 

20 and 4) statistically evaluating the effects of each of the conditions on crystal 
formation (see, e.g., C.W. Carter, Jr. et a!., 1988, J. Cryst Growth. 90:60-73). 
In particular, conditions such as pH, temperature, precipitating agent, and 
cations can be tested. Dialysis buttons are preferably used with this method. 
Typically, optimal conditions/combinations can be determined within 35 tests. 

25 Similar approaches, such as "footprinting" conditions, may also be employed 
(see, e.g., E.A. Stura et al., 1991, J. Cryst. Growth. 110:1-2). 

The perturbation approach can be performed by altering crystallization 
conditions by introducing a series of additives designed to test the effects of 
altering the structure of bulk solvent and the solvent dielectric on crystal 

30 formation (see, e.g., Whitaker et al., 1995, Biochem. 34:8221-8226). Additives 
for increasing the solvent dialectric include, but are not limited to, NaCl, KCI, 
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or LiCI (e.g., 200 mM); Na formate (e.g., 200 mM); Na 2 HP0 4 or K 2 HP0 4 (e.g., 
200 mM); urea, triachloroacetate, guanidium HCI, or KSCN (e.g., 20-50 mM). 
A non-limiting list of additives for decreasing the solvent dialectric include 
methanol, ethanol, isopropanol, or tert-butanol (e.g., 1-5%); MPD (e.g., 1%); 
5 PEG 400, PEG 600, or PEG 1000 (e.g., 1-4%); PEG MME (monomethylether) 
550, PEG MME 750, PEG MME 2000 (e.g., 1-4%). 

As an alternative to the above-screening methods, the sparse matrix 
approach can be used (see, e.g., J. Jancarik and S.-H.J. Kim, 1991, Appf. 
Cryst 24:409-411; A. McPherson, 1992, J. Cryst Growth. 122:161-167; B. 

10 Cudney et aL, 1994, Acta. Cryst. 050:414-423). Sparse matrix screens are 
commercially available (see, e.g., Hampton Research; Molecular Dimensions, 
Inc., Apopka, FL; Emerald Biostructures, Inc., Lemont, IL). Notably, data from 
Hampton Research sparse matrix screens can be stored and analyzed using 
ASPRUN software (Douglas Instruments). 

1 5 Exemplary conditions for an initial screen are shown below (see Berry, 



1995). 

TABLE 1 

Tray 1: 



PEG 8000 (wells 1-6) 


Ammonium sulfate (wells 7-12) 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


20% 


20% 


20% 


35% 


35% 


35% 


2.0 M 


2.0 M 


2.0 M 


2.5 M 


2.5 M 


2.5 M 


pH 5.0 


pH 7.0 


pH 8.6 


PH5.0 


pH 7.0 


pH 8.6 


pH 5.0 


PH7.0 


pH 8.8 


pH 5.0 


pH 7.0 


pH 8.8 


MPD (weNs 13-16) 


Na Citrate (we\\s 17-20) 


Na/K Phosphate (wells 21-24) 


13 


14 


15 


16 


17 


18 


19 


20 


21 


22 


23 


24 


30% 


30% 


50% 


50% 


1.3 M 


1.3 M 


1.5 M 


1.5 M 


2.0 M 


2.0 M 


2.5 M 


2.5 M 


pH 5.8 


pH 7.6 


PH5.8 


pH 7.6 


pH 5.8 


pH 7.5 


pH 5.8 


pH7.5 


pH 6.0 


pH 7.4 


pH 6.0 


pH 7.4 



20 Tray 2: 



PEG 2000 


MME/0.2 M Ammon. sulfate (wells 25-30) 


25 


26 


27 


28 


29 


30 


25% 
pH 5.5 


25% 
pH 7.0 


25% 
pH 8.5 


40% 
pH5.5 


40% 
pH7.0 


40% 
pH 8.5 


Random for wells 31 to 48 



The initial screen can be used with hanging or sitting drops. To 
conserve the sample, tray 2 can be set up several weeks following tray 1 . 
Wells 31-48 of tray 2 can comprise a random set of solutions. Alternatively, 
25 solutions can be formulated using sparse methods. Preferably, test solutions 
cover a broad range of precipitants, additives, and pH (especially pH 5.0-9.0). 
Seeding can be used to trigger nucleation and crystal growth (Stura and 
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Wilson, 1990, J. Cryst Growth. 110:270-282; C. Thaller et al., 1981, J. Mol. 
Biol. 147:465-469; A. McPherson and P. Schlichta, 1988, J. Cryst Growth. 
90:47-50). In general, seeding can performed by transferring crystal seeds into 
a polypeptide solution to allow polypeptide molecules to deposit on the surface 
5 of the seeds and produce crystals. Two seeding methods can be used: 
microseeding and macroseeding. For microseeding, a crystal can be ground 
into tiny pieces and transferred into the protein solution. Alternatively, seeds 
can be transferred by adding 1-2 \i\ of the seed solution directly to the 
equilibrated protein solution. In another approach, seeds can be transferred 

10 by dipping a hair in the seed solution and then streaking the hair across the 
surface of the drop (streak seeding; see Stura and Wilson, supra). For 
macroseeding, an intact crystal can be transferred into the protein solution 
(see, e.g., C. Thaller et al., 1981, J. Mol. Biol. 147:465-469). Preferably, the 
surface of the crystal seed is washed to regenerate the growing surface prior 

15 to being transferred. Optimally, the protein solution for crystallization is close 
to saturation and the crystal seed is not completely dissolved upon transfer. 
Antibodies 

An isolated Gene 216 polypeptide or a portion or fragment thereof, can 
be used as an immunogen to generate anti-Gene 216 antibodies using 

20 standard techniques for polyclonal and monoclonal antibody preparation. The 
full-length Gene 216 polypeptide can be used or, alternatively, the invention 
provides antigenic peptide fragments of Gene 216 for use as immunogens. 
The antigenic peptide of Gene 216 comprises at least 5 amino acid residues 
of the amino acid sequence shown in SEQ ID NO:4, and encompasses an 

25 epitope of Gene 21 6 such that an antibody raised against the peptide forms a 
specific immune complex with Gene 216 amino acid sequence. 

Accordingly, another aspect of the invention pertains to anti-Gene 216 
antibodies. The invention provides polyclonal and monoclonal antibodies that 
bind Gene 216 polypeptides or peptides. The term "monoclonal antibody" or 

30 "monoclonal antibody composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an antigen binding site 
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capable of immune-reacting with a particular epitope of a Gene 216 polypeptide 
or peptide. A monoclonal antibody composition thus typically displays a single 
binding affinity for a particular Gene 216 polypeptide or peptide with which it 
immunoreacts. 

5 A Gene 216 immunogen typically is used to prepare antibodies by 

immunizing a suitable subject, (e.g., rabbit, goat, mouse, or other non-human 
mammal) with the immunogen. An appropriate immunogenic preparation can 
contain, for example, recombinantly expressed Gene 216 polypeptide or a 
chemically synthesized Gene 216 polypeptide, or fragments thereof. The 

10 preparation can further include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar immunostimulatory agent. Immunization of a 
suitable subject with an immunogenic Gene 216 preparation induces a 
polyclonal anti-Gene 216 antibody response. 

A number of adjuvants are known and used by those skilled in the art. 

15 Non-limiting examples of suitable adjuvants include incomplete Freund's 
adjuvant, mineral gels such as alum, aluminum phosphate, aluminum 
hydroxide, aluminum silica, and surface-active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. Further examples of adjuvants include N- 

20 acetyl-muramy!-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L- 
alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylmuramyl- 
Lalanyl-D-isoglutaminyl-L-alanine-2-(1'-2'-dipalmitoyl-sn-glycero-3 
hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), and 
RIBI, which contains three components extracted from bacteria, 

25 monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton 
(MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. A particularly useful 
adjuvant comprises 5% (wt/vol) squalene, 2.5% Pluronic L121 polymer and 
0.2% polysorbate in phosphate buffered saline (Kwak et aL, 1992, New Eng. 
J. Med. 327:1209-1215). Preferred adjuvants include complete BCG, Detox, 

30 (RIBI, Immunochem Research Inc.), ISCOMS, and aluminum hydroxide 
adjuvant (Superphos, Biosector). The effectiveness of an adjuvant may be 
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determined by measuring the amount of antibodies directed against the 
immunogenic peptide. 

Polyclonal anti-Gene 216 antibodies can be prepared as described 
above by immunizing a suitable subject with a Gene 216 immunogen. The 
5 anti-Gene 216 antibody titer in the immunized subject can be monitored over 
time by standard techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized Gene 216. If desired, the antibody molecules 
directed against Gene 216 can be isolated from the mammal (e.g., from the 
blood) and further purified by well-known techniques, such as protein A 

10 chromatography to obtain the IgG fraction. 

At an appropriate time after immunization, e.g., when the anti-Gene 216 
antibody titers are highest, antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, 
such as the hybridoma technique (see Kohler and Milstein, 1975, Nature 

15 256:495-497; Brown et al., 1981 , J. Immunol. 127:539-46; Brown et al. T 1980, 
J. Biol Chem. 255:4980-83; Yeh et al., 1976, PNAS 76:2927-31; and Yeh et 
al., 1982, Int. J. Cancer 29:269-75), the human B cell hybridoma technique 
(Kozbor et al., 1983, Immunol. Today 4:72), the EBV-hybridoma technique 
(Cole et al., 1985, Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 

20 Inc., pp. 77-96) or trioma techniques. 

The technology for producing hybridomas is well-known (see generally 
R. H. Kenneth, 1980, Monoclonal Antibodies: A New Dimension In Biological 
Analyses, Plenum Publishing Corp., New York, NY; E.A. Lerner, 1981, Yale J. 
Biol. Med., 54:387-402; M.L. Gefter et al., 1977, Somatic Cell Genet 3:231- 

25 36). In general, an immortal cell line (typically a myeloma) is fused to 
lymphocytes (typically splenocytes) from a mammal immunized with a Gene 
216 immunogen as described above, and the culture supernatants of the 
resulting hybridoma cells are screened to identify a hybridoma producing a 
monoclonal antibody that binds Gene 216 polypeptides or peptides. 

30 Any of the many well known protocols used for fusing lymphocytes and 

immortalized cell lines can be applied for the purpose of generating an anti- 
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Gene 216 monoclonal antibody (see, e.g., G. Galfre et a!., 1977, Nature 
266:55052; Getter et aL, 1977; Lerner, 1981; Kenneth, 1980). Moreover, the 
ordinarily skilled worker will appreciate that there are many variations of such 
methods. Typically, the immortal eel! line (e.g., a myeloma cell line) is derived 
5 from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fusing lymphocytes from a mouse immunized with 
an immunogenic preparation of the present invention with an immortalized 
mouse cell line. Preferred immortal cell lines are mouse myeloma cell lines 
that are sensitive to culture medium containing hypoxanthine, aminopterin, and 

1 0 thymidine (HAT medium). Any of a number of myeloma cell lines can be used 
as a fusion partner according to standard techniques, e.g., the P3-NS1/1-Ag4- 
1 , P3-x63-Ag8.653, or Sp2/0-Ag14 myeloma lines. These myeloma lines are 
available from ATCC (American Type Culture Collection, Manassas, VA). 
Typically, HAT-sensitive mouse myeloma ceils are fused to mouse splenocytes 

15 using polyethylene glycol (PEG). Hybridoma cells resulting from the fusion arc 
then selected using HAT medium, which kills unfused and unproductively fused 
myeloma cells (unfused splenocytes die after several days because they are 
not transformed). Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture supernatants for 

20 antibodies that bind Gene 216 polypeptides or peptides, e.g., using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti-Gene 216 antibody can be identified and isolated by screening 
a recombinant combinatorial immunoglobulin library (e.g., an antibody phage 

25 display library) with Gene 216 to thereby isolate immunoglobulin library 
members that bind Gene 216. Kits for generating and screening phage display 
libraries are commercially available (e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the Stratagene SurfZAP™ 
Phage Display Kit, Catalog No. 240612). 

30 Additionally, examples of methods and reagents particularly amenable 

for use in generating and screening antibody display library can be found in, 
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for example, Ladner et ai. U.S. Pat. No. 5,223,409; Kang et ai. PCT 
International Publication No. WO 92/18619; Dower et al. PCT International 
Publication No. WO 91/17271 ; Winter et al. PCT international Publication WO 
92/20791; Markland et al. PCT International Publication No. WO 92/15679; 
5 Breitling et al. PCT International Publication WO 93/01288; McCafferty et al. 
PCT International Publication No. WO 92/01047; Garrard et al. PCT 
International Publication No. WO 92/09690; Ladner et al. PCT International 
Publication No. WO 90/02809; Fuchs et al., 1991, Bio/Technology 9:1 370- 
1372; Hay et al., 1992, Hum. Antibod. Hybridomas 3:81-85; Huse et al., 1989, 

10 Science 246:1275-1281; Griffiths et al., 1993, EMBO J 12:725-734; Hawkins 
et al., 1992, J. Mol. Biol. 226:889-896; Clarkson et al., 1991 , Nature 352:624- 
628; Gram et al., 1992, PNAS 89:3576-3580; Garrad et al., 1991, 
Bio/Technology 9:1373-1377; Hoogenboom et all, 1991, Nuc. Acid Res. 
19:4133-4137; Barbas et al., 1991 , PNAS 88:7978-7982; and McCafferty et al., 

15 1990, Nature 348:552-55. 

Additionally, recombinant anti-Gene 216 antibodies, such as chimeric 
and humanized monoclonal antibodies, comprising both human and non- 
human portions, which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such chimeric and 

20 humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in Robinson 
et al. International Application No. PCT/US86/02269; Akira, et al. European 
Patent Application 184,187; Taniguchi, M., European Patent Application 
171,496; Morrison et ai. European Patent Application 173,494; Neuberger et 

25 al. PCT International Publication No. WO 86/01 533; Cabiily et al. U.S. Pat. No. 
4,816,567; Cabiily et al. European Patent Application 125,023; Better et al., 
1988, Science 240:1041-1043; Liu et al., 1987, PNAS 84:3439-3443; Liu et al., 
1987, J. Immunol. 139:3521-3526; Sun et al., 1987, PNAS 84:214-218; 
Nishimura et al., 1987, Cane. Res. 47:999-1005; Wood et ai., 1985, Nature 

30 314:446-449; and Shaw et al., 1988, J. NatL Cancer Inst. 80:1553-1559; S.L. 
Morrison, 1985, Science 229:1202-1207; Oi etal., 1986, BioTechniques 4:214; 
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Winter U.S. Pat. No. 5,225,539; Jones et al., 1986, Nature 321:552-525; 
Verhoeyan et al., 1988, Science 239:1534; and Bcidler et a!., 1988, J. 
Immunol. 141:4053-4060. 

An anti-Gene 216 antibody (e.g., monoclonal antibody) can be used to 
5 isolate Gene 216 by standard techniques, such as affinity chromatography or 
immunoprecipitation. An anti-Gene 216 antibody can also facilitate the 
purification of natural Gene 216 polypeptide from cells and of recombinantly 
produced Gene 216 polypeptides or peptides expressed in host cells. Further, 
an anti-Gene 216 antibody can be used to detect Gene 216 protein (e.g., in a 

10 cellular lysate or cell supernatant) in order to evaluate the abundance and 
pattern of expression of the Gene 216 protein. Anti-Gene 216 antibodies can 
be used diagnostically to monitor protein levels in tissue as part of a clinical 
testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen as described in detail herein. In addition, and anti-Gene 

15 216 antibody can be used as therapeutics for the treatment of diseases related 
to abnormal Gene 216 expression or function, e.g., asthma. 
Ligands 

The Gene 216 polypeptides, polynucleotides, variants, or fragments 
thereof, can be used to screen for ligands (e.g., agonists, antagonists, or 

20 inhibitors) that modulate the levels or activity of the Gene 216 polypeptide. In 
addition, these Gene 216 molecules can be used to identify endogenous 
ligands that bind to Gene 216 polypeptides or polynucleotides in the cell. In 
one aspect of the present invention, the full-length Gene 216 polypeptide (e.g., 
SEQ ID NO:4) is used to identify ligands. Alternatively, variants or fragments 

25 of a Gene 216 polypeptide are used. Such fragments may comprise, for 
example, one or more domains of the Gene 216 polypeptide (e.g., the pre-, 
pro-, catalytic, cysteine-rich, disintegrin, EGF, transmembrane, and cytoplasmic 
domains) disclosed herein. Of particular interest are screening assays that 
identify agents that have relatively low levels of toxicity in human cells. A wide 

30 variety of assays may be used for this purpose, including in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays, and the 
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like. 

The term "ligand" as used herein describes any molecule, protein, 
peptide, or compound with the capability of directly or indirectly altering the 
physiological function, stability, or levels of the Gene 216 polypeptide. Ligands 
5 that bind to the Gene 216 polypeptides or polynucleotides of the invention are 
potentially useful in diagnostic applications and/or pharmaceutical 
compositions, as described in detail herein. Ligands may encompass 
numerous chemical classes, though typically they are organic molecules, 
preferably small organic compounds having a molecular weight of more than 

10 50 and less than about 2,500 daltons. Such ligands can comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl 
group, preferably at least two of the functional chemical groups. Ligands often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or 

1 5 polyaromatic structures substituted with one or more of the above functional 
groups. Ligands can also comprise biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 
analogs, or combinations thereof. 

Ligands may include, for example, 1) peptides such as soluble peptides, 

20 including Ig-tailed fusion peptides and members of random peptide libraries 
(see, e.g., Lam et al., 1991, Nature 354:82-84; Houghten et al., 1991, Nature 
354:84-86) and combinatorial chemistry-derived molecular libraries made of D- 
and/or L-configu ration amino acids; 2) phosphopeptides (e.g., members of 
random and partially degenerate, directed phosphopeptide libraries, see, e.g., 

25 Songyang et a(, 1993, Cell 72:767-778); 3) antibodies (e.g., polyclonal, 
monoclonal, humanized, anti-idiotypic, chimeric, and single chain antibodies 
as well as Fab, F(ab') 2 , Fab expression library fragments, and epitope-binding 
fragments of antibodies); and 4) small organic and inorganic molecules. 

Ligands can be obtained from a wide variety of sources including 

30 libraries of synthetic or natural compounds. Synthetic compound libraries are 
commercially available from, for example, Maybridge Chemical Co. (Trevillet, 



-61 - 



WO 02/083077 



PCT/US02/12063 



Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates (Merrimack, 
NH), and Microsource (New Milford, CT). A rare chemical library is available 
from Aldrich Chemical Company, Inc. (Milwaukee, Wl). Natural compound 
libraries comprising bacterial, fungal, plant or animal extracts are available 
5 from, for example, Pan Laboratories (Bothell, WA). In addition, numerous 
means are available for random and directed synthesis of a wide variety of 
organic compounds and biomolecules, including expression of randomized 
oligonucleotides. 

Alternatively, libraries of natural compounds in the form of bacterial, 

10 fungal, plant and animal extracts can be readily produced. Methods for the 
synthesis of molecular libraries are readily available (see, e.g., DeWitt et al., 
1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb et al., 1994, Proc. Natl. Acad. 
Sci. USA 91:1 1422; Zuckermann et al., 1994, J. Med. Chem. 37:2678; Cho et 
al., 1993, Science 261:1303; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 

15 33:2059; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and in 
Gallop et al., 1994, J. Med, Chem. 37:1233). In addition, natural or synthetic 
compound libraries and compounds can be readily modified through 
conventional chemical, physical and biochemical means (see, e.g., Blondelle 
et al., 1996, Trends in Biotech. 14:60), and may be used to produce 

20 combinatorial libraries. In another approach, previously identified 
pharmacological agents can be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification, amidification, and the 
analogs can be screened for Gene 216-moduIating activity. 

Numerous methods for producing combinatorial libraries are known in 

25 the art, including those involving biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to polypeptide libraries, while the other four 

30 approaches are applicable to polypeptide, non-peptide oligomer, or small 
molecule libraries of compounds (K. S. Lam, 1997, Anticancer Drug Des. 
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12:145). 

Libraries may be screened in solution (e.g., Houghten, 1992, 
Biotechniques 13:412-421 ), or on beads (Lam, 1991 , Nature 354:82-84), chips 
(Fodor, 1993, Nature 364:555-556), bacteria or spores (Ladner U.S. Pat. No. 
5 5,223,409), plasmids (Cull et al., 1992, Proc. Natl. Acad. Sci. USA 89:1865- 
1869), or on phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 
1990, Science 249:404-406; Cwirla et al., 1990, Proc. Natl. Acad. Sci. USA 
97:6378-6382; Felici, 1991, J. Mol. Biol. 222:301-310; Ladner, supra). 

Where the screening assay is a binding assay, a Gene 216 polypeptide, 

1 0 polynucleotide, analog, or fragment thereof, may be joined to a label, where the 
label can directly or indirectly provide a detectable signal. Various labels 
include radioisotopes, fluorescers, chemiluminescers, enzymes, specific 
binding molecules, particles, e.g. magnetic particles, and the like. Specific 
binding molecules include pairs, such as biotin and streptavidin, digoxin and 

15 antidigoxin, etc. For the specific binding members, the complementary 
member would normally be labeled with a molecule that provides for detection, 
in accordance with known procedures. 

A variety of other reagents may be included in the screening assay. 
These include reagents like salts, neutral proteins, e.g. albumin, detergents, 

20 etc., that are used to facilitate optimal protein-protein binding and/or reduce 
non-specific or background interactions. Reagents that improve the efficiency 
of the assay, such as protease inhibitors, nuclease inhibitors, anti-microbial 
agents, etc., may be used. The components are added in any order that 
produces the requisite binding. Incubations are performed at any temperature 

25 that facilitates optimal activity, typically between 4° and 40°C. Incubation 
periods are selected for optimum activity, but may also be optimized to 
facilitate rapid high-throughput screening. Normally, between 0.1 and 1 hr will 
be sufficient. In general, a plurality of assay mixtures is run in parallel with 
different agent concentrations to obtain a differential response to these 

30 concentrations. Typically, one of these concentrations serves as a negative 
control, i.e. at zero concentration or below the level of detection. 
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To perform cell-free ligand screening assays, it may be desirable to 
immobilize either the Gene 216 polypeptide, polynucleotide, or fragment to a 
surface to facilitate identification of ligands that bind to these molecules, as 
well as to accommodate automation of the assay. For example, a fusion 
5 protein comprising a Gene 216 polypeptide and an affinity tag can be 
produced. In one embodiment, a glutathione-S-transferase/phosphodiesterase 
fusion protein comprising a Gene 216 polypeptide is adsorbed onto glutathione 
sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione-derivatized 
microtiter plates. Cell lysates (e.g., containing 35 S-Iabeled polypeptides) are 

10 added to the Gene 216-coated beads under conditions to allow complex 
formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the Gene 216-coated beads are washed to remove any unbound 
polypeptides, and the amount of immobilized radiolabel is determined. 
Alternatively, the complex is dissociated and the radiolabel present in the 

15 supernatant is determined, in another approach, the beads are analyzed by 
SDS-PAGE to identify Gene 216-binding polypeptides. 

Ligand-binding assays can be used to identify agonist or antagonists 
that alter the function or levels of the Gene 216 polypeptide. Such assays are 
designed to detect the interaction of test agents with Gene 21 6 polypeptides, 

20 polynucleotides, analogs, or fragments thereof. Interactions may be detected 
by direct measurement of binding. Alternatively, interactions may be detected 
by indirect indicators of binding, such as stabilization/destabilization of protein 
structure, or activation/inhibition of biological function. Non-limiting examples 
of useful ligand-binding assays are detailed below. 

25 Ligands that bind to Gene 216 polypeptides, polynucleotides, analogs, 

or fragments thereof, can be identified using real-time Bimolecular Interaction 
Analysis (BIA; Sjolander et al., 1 991 , Anal. Chem. 63:2338-2345; Szabo et al., 
1995, Curr. Opin. Struct Biol. 5:699-705). BIA-based technology (e.g., 
BIAcore™; LKB Pharmacia, Sweden) allows study of biospecific interactions 

30 in real time, without labeling. In BIA, changes in the optical phenomenon 
surface plasmon resonance (SPR) is used determine real-time interactions of 
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biological molecules. 

Ligands can also be identified by scintillation proximity assays (SPA, 
described in U.S. Patent No. 4,568,649). In a modification of this assay that 
is currently undergoing development, chaperonins are used to distinguish 
5 folded and unfolded proteins. A tagged protein Is attached to SPA beads, and 
test agents are added. The bead is then subjected to mild denaturing 
conditions (such as, e.g., heat, exposure to SDS, etc.) and a purified labeled 
chaperonin is added. If a test agent binds to a target, the labeled chaperonin 
will not bind; conversely, if no test agent binds, the protein will undergo some 

10 degree of denaturation and the chaperonin will bind. 

Ligands can also be identified using a binding assay based on 
mitochondrial targeting signals (Hurt et al., 1985, EMBO J. 4:2061-2068; Eilers 
and Schatz, 1986, Nature 322:228-231). In a mitochondrial import assay, 
expression vectors are constructed in which nucleic acids encoding particular 

1 5 target proteins are inserted downstream of sequences encoding mitochondrial 
import signals. The chimeric proteins are synthesized and tested for their 
ability to be imported into isolated mitochondria in the absence and presence 
of test compounds. A test compound that binds to the target protein should 
inhibit its uptake into isolated mitochondria in vitro. 

20 The ligand-binding assay described in Fodor et al., 1991, Science 

251:767-773, which involves testing the binding affinfty of test compounds for 
a plurality of defined polymers synthesized on a solid substrate, can also be 
used. 

Ligands that bind to Gene 216 polypeptides or peptides can be 
25 identified using two-hybrid assays (see, e.g., U.S. Pat. No. 5,283,317; Zervos 
et al., 1993, Ce// 72:223-232; Madura et al., 1993, J. Biol. Chem. 268:12046- 
12054; Bartel et al., 1993, Biotechniques 14:920-924; Iwabuchi et al., 1993, 
Oncogene 8:1693-1696; and Brent WO 94/10300). The two-hybrid system 
relies on the reconstitution of transcription activation activity by association of 
30 the DNA-binding and transcription activation domains of a transcriptional 
activator through protein -protein interaction. The yeast GAL4 transcriptional 
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activator may be used in this way, although other transcription factors have 
been used and are well known in the art. To carryout the two-hybrid assay, the 
GAL4 DNA-binding domain, and the GAL4 transcription activation domain are 
expressed, separately, as fusions to potential interacting polypeptides. 
5 In one embodiment, the "bait" protein comprises a Gene 216 

polypeptide fused to the GAL4 DNA-binding domain. The "fish" protein 
comprises, for example, a human cDNA library encoded polypeptide fused to 
the GAL4 transcription activation domain. If the two, coexpressed fusion 
proteins interact in the nucleus of a hqst cell, a reporter gene (e.g. LacZ) is 

10 activated to produce a detectable phenotype. The host cells that show two- 
hybrid interactions can be used to isolate the containing plasmids containing 
the cDNA library sequences. These plasmids can be analyzed to determine 
the nucleic acid sequence and predicted polypeptide sequence of the 
candidate ligand. Alternatively, methods such as the three-hybrid (Licitra et al., 

15 1996, Proa Natl. Acad. ScL USA 93:12817-12821), and reverse two-hybrid 
(Vidal et al., 1996, Proc. Natl. Acad. ScL USA 93:10315-10320) systems may 
be used. Commercially available two-hybrid systems such as the CLONTECH 
Matchmaker™ systems and protocols (CLONTECH Laboratories, Inc., Palo 
Alto, CA) may be also be used (see also, A.R. Mendelsohn et al., 1994, Cum 

20 Op. Biotech. 5:482; E.M. Phizicky et al., 1995, Microbiological Rev. 59:94; M. 
Yang et al., 1995, Nucleic Acids Res. 23:1152; S. Fields et al., 1994, Trends 
Genet 10:286; and U.S. Patent No. 6,283,173 and 5,468,614). 

Several methods of automated assays have been developed in recent 
years so as to permit screening of tens of thousands of test agents in a short 

25 period of time. High-throughput screening methods are particularly preferred 
for use with the present invention. The ligand-binding assays described herein 
can be adapted for high-throughput screens, or alternative screens may be 
employed. For example, continuous format high throughput screens (CF-HTS) 
using at least one porous matrix allows the researcher to test large numbers 

30 of test agents for a wide range of biological or biochemical activity (see United 
States Patent No. 5,976,813 to Beutel et al.). Moreover, CF-HTS can be used 
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to perform multi-step assays. 
Diagnostics 

As discussed herein, chromosomal region 20p13-p12 has been 
genetically linked to a variety of diseases and disorders, including asthma. The 
5 present invention provides nucleic acids and antibodies that can be useful in 
diagnosing individuals with aberrant Gene 216 expression. In particular, the 
disclosed SNPs, alleles, and haplotypes can be used to diagnose 
chromosomal abnormalities linked to these diseases. 

Antibody-based diagnostic methods : In a further embodiment of the 
10 present invention, antibodies which specifically bind to the Gene 216 
polypeptide may be used for the diagnosis of conditions or diseases 
characterized by underexpression or overexpression of the Gene 216 
polynucleotide or polypeptide, or in assays to monitor patients being treated 
with a Gene 216 polypeptide or peptide, or a Gene 216 agonist, antagonist, or 
15 inhibitor. 

The antibodies useful for diagnostic purposes may be prepared in the 
same manner as those for use in therapeutic methods, described herein. 
Antibodies may be raised to the full-length Gene 216 polypeptide sequence 
(e.g., SEQ ID NO:4). Alternatively, the antibodies may be raised to fragments 

20 or variants of the Gene 216 polypeptide. In one aspect of the invention, 
antibodies are prepared to bind to a Gene 216 polypeptide fragment 
comprising one or more domains of the Gene 216 polypeptide (e.g., pre-, pro-, 
catalytic, disintegrin, cysteine-rich, EGF, transmembrane, and cytoplasmic 
domains) described herein. 

25 Diagnostic assays for the Gene 21 6 polypeptide include methods that 

utilize the antibody and a label to detect the protein in biological samples (e.g., 
human body fluids, cells, tissues, or extracts of cells or tissues). The 
antibodies may be used with or without modification, and may be labeled by 
joining them, either covalently or non-covalently, with a reporter molecule. A 

30 wide variety of reporter molecules that are known in the art may be used, 
several of which are described herein. 
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The invention provides methods for detecting disease-associated 
antigenic components in a biological sample, which methods comprise the 
steps of: 1) contacting a sample suspected to contain a disease-associated 
antigenic component with an antibody specific for an disease-associated 
5 antigen, extracellular or intracellular, under conditions in which an antigen- 
antibody complex can form between the antibody and disease-associated 
antigenic components in the sample; and 2) detecting any antigen-antibody 
complex formed in step (1) using any suitable means known in the art, wherein 
the detection of a complex indicates the presence of disease-associated 

10 antigenic components in the sample. It will be understood that assays that 
utilize antibodies directed against altered Gene 216 amino acid sequences 
(i.e., epitopes encoded by SNP-related alleles or haplotypes, or mutations, or 
other variants) are within the scope of the invention. 

Many immunoassay formats are known in the art, and the particular 

15 format used is determined by the desired application. An immunoassay can 
use, for example, a monoclonal antibody directed against a single disease- 
associated epitope, a combination of monoclonal antibodies directed against 
different epitopes of a single disease-associated antigenic component, 
monoclonal antibodies directed towards epitopes of different disease- 

20 associated antigens, polyclonal antibodies directed towards the same disease- 
associated antigen, or polyclonal antibodies directed towards different disease- 
associated antigens. Protocols can also, for example, use solid supports, or 
may involve immunoprecipitation. 

In accordance with the present invention, "competitive" (U.S. Pat. Nos. 

25 3,654,090 and 3,850,752), "sandwich" (U.S. Pat No. 4,016,043), and "double 
antibody," or "DASP" assays may be used. Several procedures for measuring 
the Gene 216 polypeptide (e.g., ELISA, RIA, and FACS) are known in the art 
and provide a basis for diagnosing altered or abnormal levels of Gene 216 
polypeptide expression. Normal or standard values for Gene 216 polypeptide 

30 expression are established by incubating biological samples taken from normal 
subjects, preferably human, with antibody to the Gene polypeptide under 
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conditions suitable for complex formation. The amount of standard complex 
formation may be quantified by various methods; photometric means are 
preferred. Levels of the Gene 216 polypeptide expressed in the subject 
sample, negative control (normal) sample, and positive control (disease) 
5 sample are compared with the standard values. Deviation between standard 
and subject values establishes the parameters for diagnosing disease. 

Typically, immunoassays use either a labeled antibody or a labeled 
antigenic component (e.g., that competes with the antigen in the sample for 
binding to the antibody). A number of fluorescent materials are known and can 

10 be utilized as labels for antibodies or polypeptides. These include, for 
example, Cy3, Cy5, Alexa, BODIPY, fluorescein (e.g., FluorX, DTAF, and 
FITC), rhodamine (e.g., TR1TC), auramine, Texas Red, AMCA blue, and 
Lucifer Yellow. Antibodies or polypeptides can also be labeled with a 
radioactive element or with an enzyme. Preferred isotopes include 3 H, 14 C, 

15 32 P, 35 S, 36 CI, 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 1, 131 I, and 186 Re. Preferred 
enzymes include peroxidase, ^-glucuronidase, p-D-glucosidase, p-D- 
galactosidase, urease, glucose oxidase plus peroxidase, and alkaline 
phosphatase (see, e.g., U.S. Pat Nos. 3,654,090; 3,850,752 and 4,016,043). 
Enzymes can be conjugated by reaction with bridging molecules such as 

20 carbodiimides, diisocyanates, glutaraldehyde, and the like. Enzyme labels can 
be detected visually, or measured by calorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric, or gasometric techniques. Other 
labeling systems, such as avidin/biotin, Tyramide Signal Amplification (TSA™), 
are known in the art, and are commercially available (see, e.g., ABC kit, Vector 

25 Laboratories, Inc., Burlingame, CA; NEN® Life Science Products, Inc., Boston, 
MA). 

Kits suitable for antibody-based diagnostic applications typically include 
one or more of the following components: 

(1) Antibodies: The antibodies may be pre-labeled; alternatively, the 
30 antibody may be unlabeled and the ingredients for labeling may be included in 
the kit in separate containers, or a secondary, labeled antibody is provided; 
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and 

(2) Reaction components: The kit may aiso contain other suitably 
packaged reagents and materials needed for the particular immunoassay 
protocol, including solid-phase matrices, if applicable, and standards. 
5 The kits referred to above may include instructions for conducting the 

test. Furthermore, in preferred embodiments, the diagnostic kits are adaptable 
to high-throughput and/or automated operation. 

Nucleic-acid-based diagnostic methods : The invention provides 
methods for altered levels or sequences of Gene 216 nucleic acids in a 

10 sample, such as in a biological sample, which methods comprise the steps of: 
1 ) contacting a sample suspected to contain a disease-associated nucleic acid 
with one or more disease-associated nucleic acid probes under conditions in 
which hybrids can form between any of the probes and disease-associated 
nucleic acid in the sample; and 2) detecting any hybrids formed in step (1) 

15 using any suitable means known in the art, wherein the detection of hybrids 
indicates the presence of the disease-associated nucleic acid in the sample. 
To detect disease-associated nucleic acids present in low levels in biological 
samples, it may be necessary to amplify the disease-associated sequences or 
the hybridization signal as part of the diagnostic assay. Techniques for 

20 amplification are known to those of skill in the art. 

The presence of Gene 216 polynucleotide sequences can be detected 
by DNA-DNA or DNA-RNA hybridization, or by amplification using probes or 
primers comprising at least a portion of a Gene 216 polynucleotide, or a 
sequence complementary thereto. In particular, nucleic acid amplification- 

25 based assays can use Gene 216 oligonucleotides or oligomers to detect 
transformants containing Gene 216 DNA or RNA. Gene 216 nucleic acids 
useful as probes in diagnostic methods include oligonucleotides at least 1 5 
nucleotides in length, preferably at least 20 nucleotides in length, and most 
preferably at least 25-55 nucleotides in length, that hybridize specifically with 

30 Gene 216 nucleic acids. 

Several methods can be used to produce specific probes for Gene 216 
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polynucleotides. For example, labeled probes can be produced by oligo- 
labeling, nick translation, end-labeling, or PCR amplification using a labeled 
nucleotide. Alternatively, Gene 216 polynucleotide sequences (e.g., SEQ ID 
NO:1 or SEQ ID NO:6), or any portions or fragments thereof, may be cloned 
5 into a vector for the production of an mRNA probe. Such vectors are known 
in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase, such as T7, T3, 
or SP(6) and labeled nucleotides. These procedures may be conducted using 
a variety of commercially available kits (e.g., from Amersham-Pharmacia; 

10 Promega Corp.; and U.S. Biochemical Corp., Cleveland, OH). Suitable 
reporter molecules or labels which may be used include radio nucleotides, 
enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as 
substrates, cofactors, inhibitors, magnetic particles, and the like. 

A sample to be analyzed, such as, for example, a tissue sample (e.g., 

15 hair or buccal cavity) or body fluid sample (e.g., blood or saliva), may be 
contacted directly with the nucleic acid probes. Alternatively, the sample may 
be treated to extract the nucleic acids contained therein. It will be understood 
that the particular method used to extract DNA will depend on the nature of the 
biological sample. The resulting nucleic acid from the sample may be 

20 subjected to gel electrophoresis or other size separation techniques, or, the 
nucleic acid sample may be immobilized on an appropriate solid matrix without 
size separation. 

Kits suitable for nucleic acid-based diagnostic applications typically 
include the following components: 
25 (1 ) Probe DNA; The probe DNA may be prelabeled; alternatively, 

the probe DNA may be unlabeled and the ingredients for labeling may be 
included in the kit in separate containers; and 

(2) Hybridization reagents: The kit may also contain other suitably 
packaged reagents and materials needed for the particular hybridization 
30 protocol, including solid-phase matrices, if applicable, and standards. 

In cases where a disease condition is suspected to involve an alteration 



-71 - 



WO 02/083077 



PCT/US02/12063 



of the Gene 216 nucleotide sequence, specific oligonucleotides may be 
constructed and used to assess the level of disease mRNA in cells affected or 
other tissue affected by the disease. For example, PCR can be used to test 
whether a person has a disease-related polymorphism (i.e., mutation). 
5 For PCR analysis, Gene 216 oligonucleotides may be chemically 

synthesized, generated enzymatically, or produced from a recombinant source. 
Oligomers will preferably comprise two nucleotide sequences, one with a 
sense orientation (5* -» 3 f ) and another with an antisense orientation (3' -> 5'), 
employed under optimized conditions for identification of a specific gene or 

10 condition. The same two oligomers, nested sets of oligomers, or even a 
degenerate pool of oligomers may be employed under less stringent conditions 
for detection and/or quantification of closely related DNA or RNA sequences. 

In accordance with PCR analysis, two oligonucleotides are synthesized 
by standard methods or are obtained from a commercial supplier of custom- 

15 made oligonucleotides. The length and base composition are determined by 
standard criteria using the Oligo 4.0 primer Picking program (W. Rychlik, 1992; 
available from Molecular Biology Insights, Inc., Cascade, CO). One of the 
oligonucleotides is designed so that it will hybridize only to the disease gene 
DNA under the PCR conditions used. The other oligonucleotide is designed 

20 to hybridize a segment of genomic DNA such that amplification of DNA using 
these oligonucleotide primers produces a conveniently identified DNA 
fragment. Samples may be obtained from hair follicles, whole blood, or the 
buccal cavity. The DNA fragment generated by this procedure is sequenced 
by standard techniques. 

25 In one particular aspect, Gene 216 oligonucleotides can be used to 

perform Genetic Bit Analysis (GBA) of Gene 216 in accordance with published 
methods (T.T. Nikiforov et al., 1994, Nucleic Acids Res. 22(20):4 167-75; T.T. 
Nikiforov TT et al., 1994, PCR Methods AppL 3(5):285-91). In PCR-based 
GBA, specific fragments of genomic DNA containing the polymorphic site(s) 

30 are first amplified by PCR using one unmodified and one phosphorothioate- 
modified primer. The double-stranded PCR product is rendered single- 
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stranded and then hybridized to immobilized oligonucleotide primer in wells of 
a multi-well plate. The primer is designed to anneal immediately adjacent to 
the polymorphic site of interest. The 3' end of the primer is extended using a 
mixture of individually labeled dideoxy nucleoside triphosphates. The label on 
5 the extended base is then determined. Preferably, GBA is performed using 
semi-automated ELISA or biochip formats (see, e.g., S.R. Head et al., 1997, 
Nucleic Acids Res. 25(24):5065-71 ; T.T. Nikiforov et al., 1 994, Nucleic Acids 
Res. 22(20):41 67-75). 

Other amplification techniques besides PCR may be used as 

1 0 alternatives, such as ligation-mediated PCR or techniques involving Q-beta 
replicase (Cahill et al., 1991, Clin. Chem., 37(9): 1482-5). Products of 
amplification can be detected by agarose gel electrophoresis, quantitative 
hybridization, or equivalent techniques for nucleic acid detection known to one 
skilled in the art of molecular biology (Sambrook et al., 1 989). Other alterations 

15 in the disease gene may be diagnosed by the same type of amplification- 
detection procedures, by using oligonucleotides designed to contain and 
specifically identify those alterations. 

Gene 216 polynucleotides may also be used to detect and quantify 
levels of Gene 216 mRNA in biological samples in which altered expression of 

20 Gene 216 polynucleotide may be correlated with disease. These diagnostic 
assays may be used to distinguish between the absence, presence, increase, 
and decrease of Gene 216 mRNA levels, and to monitor regulation of Gene 
216 polynucleotide levels during therapeutic treatment or intervention. For 
example, Gene 216 polynucleotide sequences, or fragments, or 

25 complementary sequences thereof, can be used in Southern or Northern 
analysis, dot blot, or other membrane-based technologies; in PCR 
technologies; or in dip stick, pin, ELISA or biochip assays utilizing fluids or 
tissues from patient biopsies to detect the status of, e.g., levels or 
overexpression of Gene 216, or to detect altered Gene 216 expression. Such 

30 qualitative or quantitative methods are well known in the art (G.H. Keller and 
M.M. Manak, 1993, DNA Probes, 2 nd Ed, Macmillan Publishers Ltd., England; 
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D.W. Dieffenbach and G. S. Dveksler, 1995, PCR Primer: A Laboratory 
Manual, Cold Spring Harbor Press, Plainview, NY; B.D. Hames and S.J. 
Higgins, 1985, Gene Probes 1, 2, IRL Press at Oxford University Press, 
Oxford, England). 

5 Methods suitable for quantifying the expression of Gene 216 include 

radiolabeling or biotinylating nucleotides, co-amplification of a control nucleic 
acid, and standard curves onto which the experimental results are interpolated 
(P.C. Melby et al., 1993, J. Immunol. Methods 159:235-244; and C. Duplaa et 
al., 1993, Anal. Biochem. 229-236). The speed of quantifying multiple samples 

10 may be accelerated by running the assay in an ELISA format where the 
oligomer of interest is presented in various dilutions and a spectrophoto metric 
or colorimetric response gives rapid quantification. 

In accordance with these methods, the specificity of the probe, i.e., 
whether it is made from a highly specific region (e.g., at least 8 to 10 or 12 or 

15 15 contiguous nucleotides in the 5' regulatory region), or a less specific region 
(e.g., especially in the 3' coding region), and the stringency of the hybridization 
or amplification (e.g., high, intermediate, or low) will determine whether the 
probe identifies only naturally occurring sequences encoding the Gene 216 
polypeptide, alleles thereof, or related sequences. 

20 In a particular aspect, a Gene 216 nucleic acid sequence, or a 

sequence complementary thereto, or fragment thereof, may be useful in 
assays that detect Gene 21 6-related diseases such as asthma. The Gene 216 
polynucleotide can be labeled by standard methods, and added to a biological 
sample from a subject under conditions suitable for the formation of 

25 hybridization complexes. After a suitable incubation period, the sample can be 
washed and the signal is quantified and compared with a standard value. If the 
amount of signal in the test sample is significantly altered from that of a 
comparable negative control (normal) sample, the altered levels of Gene 216 
nucleotide sequence can be correlated with the presence of the associated 

30 disease. Such assays may also be used to evaluate the efficacy of a particular 
prophylactic or therapeutic regimen in animal studies, in clinical trials, or for an 
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individual patient. 

To provide a basis for the diagnosis of a disease associated with altered 
expression of Gene 216, a normal or standard profile for expression is 
established. This may be accomplished by incubating biological samples taken 
5 from normal subjects, either animal or human, with a sequence complementary 
to the Gene 216 polynucleotide, or a fragment thereof, under conditions 
suitable for hybridization or amplification. Standard hybridization may be 
quantified by comparing the values obtained from normal subjects with those 
from an experiment where a known amount of a substantially purified 

10 polynucleotide is used. Standard values obtained from normal samples may 
be compared with values obtained from samples from patients who are 
symptomatic for the disease. Deviation between standard and subject (patient) 
values is used to establish the presence of the condition. 

Once the disease is diagnosed and a treatment protocol is initiated, 

15 hybridization assays may be repeated on a regular basis to evaluate whether 
the level of expression in the patient begins to approximate that which is 
observed in a normal individual. The results obtained from successive assays 
may be used to show the efficacy of treatment over a period ranging from 
several days to months. 

20 With respect to diseases such as asthma, the presence of an abnormal 

amount of Gene 216 transcript in a biological sample (e.g., body fluid, cells, 
tissues, or cell or tissue extracts) from an individual may indicate a 
predisposition for the development of the disease, or may provide a means for 
detecting the disease prior to the appearance of actual clinical symptoms. A 

25 more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier, thereby preventing the 
development or further progression of the disease. 

Microarravs : In another embodiment of the present invention, 
oligonucleotides, or longer fragments derived from the Gene 216 

30 polynucleotide sequence described herein may be used as targets in a 
microarray (e.g., biochip) system. The microarray can be used to monitor the 
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expression level of large numbers of genes simultaneously (to produce a 
transcript image), and to identify genetic variants, mutations, and 
polymorphisms. This information may be used to determine gene function, to 
understand the genetic basis of a disease, to diagnose disease, and to develop 
5 and monitor the activities of therapeutic or prophylactic agents. Preparation 
and use of microarrays have been described in WO 95/1 1995 to Chee et aL; 
D.J. Lockhart et aL, 1996, Nature Biotechnology 14:1675-1680; M. Schena et 
aL, 1996, Proc. Natl. Acad. ScL USA 93:10614-10619; U.S. Patent No. 
6,015,702 to P. Lai et al; J. Worley et aL, 2000, Microarray Biochip 
10 Technology, M. Schena, ed., Biotechniques Book, Natick, MA, pp. 65-86; Y.H. 
Rogers et aL, 1999, Anal. Biochem. 266(1 ):23-30; S.R. Head et aL, 1999, Mol. 
Cell. Probes. 13(2):81-7; S.J. Watson et aL, 2000, Biol. Psychiatry 
48(1 2): 1147-56. 

In one application of the present invention, microarrays containing 

15 arrays of Gene 216 polynucleotide sequences can be used to measure the 
expression levels of Gene 216 in an individual. In particular, to diagnose an 
individual with a Gene 216-related condition or disease, a sample from a 
human or animal (containing nucleic acids, e.g., mRNA) can be used as a 
probe on a biochip containing an array of Gene 216 polynucleotides (e.g., 

20 DNA) in decreasing concentrations (e.g., 1 ng, 0.1 ng, 0.01 ng, etc.). The test 
sample can be compared to samples from diseased and normal samples. 
Biochips can also be used to identify Gene 216 mutations or polymorphisms 
in a population, including but not limited to, deletions, insertions, and 
mismatches. For example, mutations can be identified by: 1) placing Gene 

25 216 polynucleotides of this invention onto a biochip; 2) taking a test sample 
(containing, e.g., mRNA) and adding the sample to the biochip; 3) determining 
if the test samples hybridize to the Gene 216 polynucleotides attached to the 
chip under various hybridization conditions (see, e.g., V.R. Chechetkin et aL, 
2000, J. BiomoL Struct Dyn. 18(1):83-101). Alternatively microarray 

30 sequencing can be performed (see, e.g., E.P. Diamandis, 2000, Clin. Chem. 
46(10):1 523-5). 
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Chromosome mapping : In another application of this invention, the 
Gene 216 nucleic acid sequence, or a complementary sequence, or fragment 
thereof, can be used as probes which are useful for mapping the naturally 
occurring genomic sequence. The sequences may be mapped to a particular 
5 chromosome, to a specific region of a chromosome, or to human artificial 
chromosome constructions (HACs), yeast artificial chromosomes (YACs), 
bacterial artificial chromosomes (BACs), bacterial PI constructions, or single 
chromosome cDNA libraries (see CM. Price, 1993, Blood Rev., 7:127-134 and 
by B.J. Trask, 1991, Trends Genet 7:149-154). 

10 In another of its aspects, the invention relates to a diagnostic kit for 

detecting Gene 216 polynucleotide or polypeptide as it relates to a disease or 
susceptibility to a disease, particularly asthma. Also related is a diagnostic kit 
that can be used to detect or assess asthma conditions. Such kits comprise 
one or more of the following: 

15 (a) a Gene 21 6 polynucleotide, preferably the nucleotide sequence 

of SEQ ID NO:1 or SEQ ID NO:6, or a fragment thereof; or 

(b) a nucleotide sequence complementary to that of (a); or 

(c) a Gene 216 polypeptide, preferably the polypeptide of SEQ ID 
NO:4, or a fragment thereof; or 

20 (d) an antibody to a Gene 216 polypeptide, preferably to the 

polypeptide of SEQ ID NO:4, or an antibody bindable fragment thereof. It will 
be appreciated that in any such kits, (a), (b), (c), or (d) may comprise a 
substantial component and that instructions for use can be included. The kits 
may also contain peripheral reagents such as buffers, stabilizers, etc. 

25 The present invention also includes a test kit for genetic screening that 

can be utilized to identify mutations in Gene 216. By identifying patients with 
mutated Gene 216 DNA and comparing the mutation to a database that 
contains known mutations in Gene 216 and a particular condition or disease, 
identification and/or confirmation of, a particular condition or disease can be 

30 made. Accordingly, such a kit would comprise a PCR-based test that would 
involve transcribing the patients mRNA with a specific primer, and amplifying 
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the resulting cDNA using another set of primers. The amplified product would 
be detectable by gel electrophoresis and could be compared with known 
standards for Gene 216. Preferably, this kit would utilize a patient's blood, 
serum, or saliva sample, and the DNA would be extracted using standard 
5 techniques. Primers flanking a known mutation would then be used to amplify 
a fragment of Gene 216. The amplified piece would then be sequenced to 
determine the presence of a mutation. 

Genomic Screening : The use of polymorphic genetic markers linked to 
the Gene 216 gene is very useful in predicting susceptibility to the diseases 

10 genetically linked to 20p13-p12. Similarly, the identification of polymorphic 
genetic markers within the Gene 216 gene will allow the identification of 
specific allelic variants that are in linkage disequilibrium with other genetic 
lesions that affect one of the disease states discussed herein including 
respiratory disorders, obesity, and inflammatory bowel disease. SSCP (see 

15 below) allows the identification of polymorphisms within the genomic and 
coding region of the disclosed gene. The present invention provides 
sequences for primers that can be used identify exons that contain SNPs and 
the corresponding alleles, as well as sequences for primers that can be used 
to identify the sequence change. This information can be used to identify 

20 additional SNPs, alleles, and haplotypes in accordance with the methods 
disclosed herein. Suitable methods for genomic screening have also been 
described by, e.g., Sheffield et a!., 1995, Genet, 4:1837-1844; LeBlanc- 
Straceski etal., 1994, Genomics, 19:341-9; Chen etal., 1995, Genomics, 25:1- 
8. In employing these methods, the disclosed reagents can be used to predict 

25 the risk for disease (e.g., respiratory disorders, obesity, and inflammatory 
bowel disease) in a population or individual. 
Therapeutics 

The present invention provides methods of screening for drugs 
comprising contacting such an agent with a novel protein of this invention or 
30 fragment thereof and assaying 1 ) for the presence of a complex between the 
agent and the protein or fragment, or 2) for the presence of a complex between 
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the protein or fragment and a ligand, by methods well known in the art. In such 
competitive binding assays the novel protein or fragment is typically labeled. 
Free protein or fragment is separated from that present in a protein: protein 
complex, and the amount of free (i.e., uncomplexed) label is a measure of the 
5 binding of the agent being tested to Gene 216 protein or its interference with 
protein ligand binding, respectively. 

This invention also contemplates the use of competitive drug screening 
assays in which neutralizing antibodies capable of specifically binding the Gene 
216 protein compete with a test compound for binding to the Gene 216 protein 

10 or fragments thereof. In this manner, the antibodies can be used to detect the 
presence of any peptide that shares one or more antigenic determinants of a 
Gene 216 protein. 

The goal of rational drug design is to produce structural analogs of 
biologically active proteins of interest or of small molecules with which they 

1 5 interact (e.g., agonists, antagonists, inhibitors) in order to fashion drugs which 
are, for example, more active or stable forms of the protein, or which, e.g., 
enhance or interfere with the function of a protein in vivo (see, e.g., Hodgson, 
1991, Bio/Technology, 9:19-21). In one approach, one first determines the 
three-dimensional structure of a protein of interest or, for example, of the Gene 

20 216 receptor or ligand complex, by x-ray crystallography, by computer 
modeling or most typically, by a combination of approaches. Less often, useful 
information regarding the structure of a protein may be gained by modeling 
based on the structure of homologous proteins. An example of rational drug 
design is the development of HIV protease inhibitors (Erickson et al., 1990, 

25 Science, 249:527-533). In addition, peptides (e.g., Gene 216 protein) are 
analyzed by an alanine scan (Wells, 1991 , Methods in Enzymol., 202:390-41 1 ). 
in this technique, an amino acid residue is replaced by Ala, and its effect on 
the peptide's activity is determined. Each of the amino acid residues of the 
peptide is analyzed in this manner to determine the important regions of the 

30 peptide. 

It is also possible to isolate a target-specific antibody, selected by a 
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functional assay, and then to solve its crystal structure. In principle, this 
approach yields a pharmacore upon which subsequent drug design can be 
based. It is possible to bypass protein crystallography altogether by generating 
anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active 
5 antibody. As a mirror image of a mirror image, the binding site of the anti-ids 
would be expected to be an analog of the original Gene 216 protein. The anti- 
id could then be used to identify and isolate peptides from banks of chemically 
or biologically produced banks of peptides. Selected peptides would then act 
as the pharmacore. 

1 0 Thus, one may design drugs which result in, for example, altered Gene 

216 protein activity or stability or which act as inhibitors, agonists, antagonists, 
etc. of Gene 216 protein activity. By virtue of the availability of cloned Gene 
216 gene sequences, sufficient amounts of the Gene 216 protein may be made 
available to perform such analytical studies as x-ray crystallography. In 

1 5 addition, the knowledge of the Gene 21 6 polypeptide sequence will guide those 
employing computer-modeling techniques in place of, or in addition to x-ray 
crystallography. 

In another aspect of the present invention, cells and animals that carry 
the Gene 216 gene or an analog thereof can be used as model systems to 

20 study and test for substances that have potential as therapeutic agents. After 
a test substance is administered to animals or applied to the cells, the 
phenotype of the animals/cells can be determined. 

In yet another aspect of this invention, antibodies that specifically react 
with Gene 216 polypeptide of peptides derived therefrom can be used as 

25 therapeutics. In particular, anti-Gene 216 antibodies can be used to block the 
Gene 216 activity. Anti-Gene 216 antibodies or fragments thereof can be 
formulated as pharmaceutical compositions and administered to a subject. It 
is noted that antibody-based therapeutics produced from non-human sources 
can cause an undesired immune response in human subjects. To minimize 

30 this problem, chimeric antibody derivatives can be produced. Chimeric 
antibodies combine a non-human animal variable region with a human 
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constant region. Chimeric antibodies can be constructed according to methods 
known in the art (see Morrison et a!., 1985, Proc. Natl. Acad. Sci. USA 
81:6851; Takeda et al., 1985, Nature 314:452; U.S. Patent No. 4,816,567 of 
Cabilly et al.; U.S. Patent No. 4,816,397 of Boss et aL; European Patent 
5 Publication EP 171496; EP 0173494; United Kingdom Patent GB 2177096B). 
In addition, antibodies can be further "humanized" by any of the techniques 
known in the art, (e.g., Teng et al., 1983, Proc. Natl. Acad. Sci. USA 80:7308- 
7312; Kozbor et ai., 1983, Immunology Today 4: 7279; Olsson et al., 1982, 
Meth. Enzymol. 92:3-16; international Patent Application WO92/06193; EP 
10 0239400). Humanized antibodies can also be obtained from commercial 
sources (e.g., Scotgen Limited, Middlesex, Great Britain). Immunotherapy with 
a humanized antibody may resuft in increased long-term effectiveness for the 
treatment of chronic disease situations or situations requiring repeated 
antibody treatments. 

15 In one embodiment, compositions (e.g., pharmaceutical compositions) 

for use with the present invention comprise metal loprotease inhibitors, or 
analogs or derivatives thereof. Non-limiting examples of metalloprotease 
inhibitors include: 1) naturally occurring inhibitors, e.g., oprin (J.J. Catanese 
and L.F. Kress, 1992, Biochemistry 31:410-418; HSF (Y. Yamakawa and T. 

20 Omori-Satoh, 1992, J. Biochem. 112:583-589); erinacin (D. Mebs et al., 1996, 
Toxicon 34:1313-1316; Omori-Satoh et al., 2000, Toxicon 38:1561-1580); 
DM40 and DM43 (A.G. Neves-Ferreira et al., 2000, Biochem. Biophys. Acta. 
1473:309-320); citrate (B. Francis et al., 1992, Toxicon 30:1239-1246); TIMP-1 
and TIMP-2 (R.V. Ward et al., 1991, Biochem J. 278, Pt 1:179-873); 

25 pyrophosphate (G.S. Makowski and M.L Ramsby, 1999, \nflammation 23:333- 
360); proglutamyl peptides such as pyroGlu-Asn-Trp-OH and pyroGlu-Glu-Trp- 
OH (A. Robeva et al., 1991, Biomed. Biochem. Acta. 50:769-773); 2) peptide 
analogs and derivatives, e.g., 2-distereomeric furan-2-carbonylamino-3- 
oxohexahydroindolizino[8,7-b]indole carboxylates (S. D'Alessio et al., 2001, 

30 Eur, J. Med. Chem. 36:43-53); phosphonate and carboxyiate derivatives of 
pyroGlu-Asn-Trp-OH (D'Alessio et al., 2001); POL 647 and POL 656 (F.X. 
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Gomis-Ruth et al., 1998, Prot. Sci. 7:283-292); cysteine-switches (K. Nomura 
and N. Suzuki, 1993, FEBS Lett. 321:84-88); 3) hydroxamate compounds, e.g., 
batimastat/BB-94 (see, e.g., G.F. Beattie et a!., 1998, Clin. Cancer Res. 
8:1899-1902); prinomastat/AG3340 (see, e.g., R. Scatena, 2000, Expert Opin. 
5 Investig. Drugs 9:2159-2165); and 4) other inhibitors, e.g., ortho-substituted 
macrocyclic lactams (G.M. Ksander, 1997, J. Med. Chem. 40:495-505); 
diketopiperazine (DKP) (A.K. Szardenings et al., 1998, J. Med. Chem. 
41 (13):2 194-200; alendronate/PCP (Makowski and Ramsby, 1999); and 
CT1746 (Z. An et al., 1997, Clin. Exp. Metastasis 15:184-195). 

10 In particular, the determined structures of metalloproteases and 

metaitoprotease inhibitors can be used to devise Gene 216-targeted inhibitors 
(i.e., by rational drug design; see Szardenings et al, 1998). Structural 
information can be found in, e.g., C. Oefner et al., 2000, J. Mol. Biol. 
296(2):341-9; B. Wu et al., 2000, J. MoL Biol. 295(2):257-68; L. Chen et al., 

1 5 1 999, J. MoL Biol. 293(3):545-57; C. Fernandez-Catalanet al., 1 998, EMBO J. 
17(17):5238-48; S. Arumugam et aL, 1998, Biochemistry 37(27):9650-7; 
Gohlke et al., 1996, FEBS Lett. 378:126-130; Gomis-Ruth et al., 1998; F.X. 
Gomis-Ruth etal, 1993, EMBO J. 12:4151-4157; F.X. Gomis-Ruth etal, 1996, 
J. Mol. Biol. 264:556-566; K. Maskos et al., 1998, Proc. Natl. Acad. Sci. USA 

20 95(7):3408-12; F.X. Gomis-Ruth et al, 1997, Nature 389:77-80; M. Betz et al., 
1997, Eur. J. Biochem. 247(1 ):356-63; B. Lovejoy et al., 1994, Biochemistry 
33(27):8207-17. Structures of zinc metalloproteases are also found in 
Molecular Modeling DataBase (MMDB) at the NCBI website (hypertext transfer 
protocol on the world wide web at ncbi.nlm.nih.gov:80/Structure/MMDB/ 

25 mmdb.shtml; e.g. Accession Nos. 1D5J, 1D8F, 1D7X, 1BSK, 2TLX, 1TLX, 
1BUD, 1BSW, 1UEA, 4AIG, 3AIG, 2AIG, 1KUH, 1DTH, 1UMS, 1UMT, 7TLN, 
6TMN, 5TMN, 5TLN, 4TMN, 4TLN, 3TMN, 2TMN, 1TMN, 1TLP, 1IAG, 1HYT, 
1AST, 8TLN, 1THL). In an alternative approach, the binding specificity of 
TIMP proteins can be engineered to produce inhibitors that specifically 

30 inactivate Gene 216 polypeptide (see, e.g., H. Nagase et al., 1999, Ann. NY 
Acad. Sci. 878:1-1 1 ; G.S. Butler et al., 1999, J. Biol. Chem. 274(29):20391- 
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20396). 

In another embodiment of the present invention, compositions (e.g., 
pharmaceutical compositions) for use with the present invention comprise 
disintegrin agonists, or analogs or derivatives thereof. The determined 
5 structures of disintegrin proteins and domains can be used to devise Gene 21 6 
disintegrin-targeted agonists (i.e., by rational drug design). Such structural 
information can be found in R.A. Atkinson et a!., 1994, Int. J. Pept Protein Res. 
43:563-72; V. Saudek et al., 1991 , Eur. J. Biochem. 202:329-38; H. Minoux et 
al., 2000, J. Comput. Aided Mol. Des. 14:317-27. 

10 The present invention contemplates compositions comprising a Gene 

216 polynucleotide, polypeptide, antibody, ligand (e.g., agonist, antagonist, or 
inhibitor), or fragments, variants, or analogs thereof, and a physiologically 
acceptable carrier, excipient, or diluent as described in detail herein. The 
present invention further contemplates pharmaceutical compositions useful in 

15 practicing the therapeutic methods of this invention. Preferably, a 
pharmaceutical composition includes, in admixture, a pharmaceutically 
acceptable excipient (carrier) and one or more of a Gene 216 polypeptide, 
polynucleotide, ligand, antibody, or fragment or variant thereof, as described 
herein, as an active ingredient. The preparation of pharmaceutical 

20 compositions that contain Gene 216-reIated reagents as active ingredients is 
well understood in the art. Typically, such compositions are prepared as 
injectables, either as liquid solutions or suspensions, however, solid forms 
suitable for solution in, or suspension in, liquid prior to injection can also be 
prepared. The preparation can also be emulsified. The active therapeutic 

25 ingredient is often mixed with excipients that are pharmaceutically acceptable 
and compatible with the active ingredient. Suitable excipients are, for example, 
water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. 
In addition, if desired, the composition can contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH-buffering agents, which 

30 enhance the effectiveness of the active ingredient. 

A Gene 216 polypeptide, polynucleotide, ligand, antibody, or variant or 
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fragment thereof can be formulated into the pharmaceutical composition as 
neutralized physiologically acceptable salt forms. Suitable salts include the 
acid addition salts (i.e., formed with the free amino groups of the polypeptide 
or antibody molecule) and which are formed with inorganic acids such as, for 
5 example, hydrochloric or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic, and the like. Salts formed from the free carboxyl 
groups can also be derived from inorganic bases such as, for example, 
sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 

10 procaine, and the like. 

The pharmaceutical compositions can be administered systemically by 
oral or parenteral routes. Non-limiting parenteral routes of administration 
include subcutaneous, intramuscular, intraperitoneal, intravenous, transdermal, 
inhalation, intranasal, intra-arterial, intrathecal, enteral, sublingual, or rectal. 

15 Intravenous administration, for example, can be performed by injection of a 
unit dose. The term "unit dose" when used in reference to a pharmaceutical 
composition of the present invention refers to physically discrete units suitable 
as unitary dosage for humans, each unit containing a predetermined quantity 
of active material calculated to produce the desired therapeutic effect in 

20 association with the required diluent; i.e., carrier, or vehicle. 

In one particular embodiment of the present invention, the disclosed 
pharmaceutical compositions are administered via mucoactive aerosol therapy 
(see, e.g., M. Fuloria and B.K. Rubin, 2000, Respir Care 45:868-873; I. 
Gonda, 2000, J. Pharm. Sci. 89:940-945; R. Dhand, 2000, Curr. Opin. Pulm. 

25 Med. 6(1 ):59-70; B.K. Rubin, 2000, Respir. Care 45(6):684-94; S. Suarez and 
A.J. Hickey, 2000, Respir. Care. 45(6):652-66). 

Pharmaceutical compositions are administered in a manner compatible 
with the dosage formulation, and in a therapeutically effective amount. The 
quantity to be administered depends on the subject to be treated, capacity of 

30 the subject's immune system to utilize the active ingredient, and degree of 
modulation of Gene 216 activity desired. Precise amounts of active ingredient 
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required to be administered depend on the judgment of the practitioner and are 
specific for each individual. However, suitable dosages may range from about 
0.1 to 20, preferably about 0.5 to about 10, and more preferably one to several, 
milligrams of active ingredient per kilogram body weight of individual per day 
5 and depend on the route of administration. Suitable regimes for initial 
administration and booster shots are also variable, but are typified by an initial 
administration followed by repeated doses at one or more hour intervals by a 
subsequent injection or other administration. Alternatively, continuous 
intravenous infusions sufficient to maintain concentrations of 1 0 nM to 1 0 p.M 

10 in the blood are contemplated. An exemplary pharmaceutical formulation 
comprises: Gene 216 antagonist or inhibitor (5.0 mg/ml); sodium bisulfite USP 
(3.2 mg/ml); disodium edetate USP (0.1 mg/ml); and water for injection q.s.a.d. 
(1 .0 ml). As used herein, "pg" means picogram, "ng" means nanogram, "ng" 
means microgram, "mg" means milligram, "jil" means microliter, "ml" means 

1 5 milliliter, and "I" means L. 

For further guidance in preparing pharmaceutical formulations, see, 
e.g., Gilman et al. (eds), 1990, Goodman and Gilman's: The Pharmacological 
Basis of Therapeutics, 8th ed., Pergamon Press; and Remington's 
Pharmaceutical Sciences, 17th ed., 1990, Mack Publishing Co., Easton, PA; 

20 Avis et al. (eds), 1993, Pharmaceutical Dosage Forms: Parenteral 
Medications, Dekker, New York; Lieberman etal. (eds), 1990, Pharmaceutical 
Dosage Forms: Disperse Systems, Dekker, New York. 

Pharmacogenetics : The Gene 216 polypeptides and polynucleotides 
are also useful in pharmacogenetic analysis (i.e., the study of the relationship 

25 between an individual's genotype and that individual's response to a 
therapeutic composition or drug). See, e.g., M. Eichelbaum, 1996, Clin. Exp. 
Pharmacol. Physiol. 23(10-1 1):983~985, and M.W. Linder, 1997, Clin. Chem. 
43(2):254-266. The genotype of the individual can determine the way a 
therapeutic acts on the body or the way the body metabolizes the therapeutic. 

30 Further, the activity of drug metabolizing enzymes affects both the intensity 
and duration of therapeutic activity. Differences in the activity or metabolism 
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of therapeutics can lead to severe toxicity or therapeutic failure. Accordingly, 
a physician or clinician may consider applying knowledge obtained in relevant 
pharmacogenetic studies in determining whether to administer a Gene 216 
polypeptide, polynucleotide, analog, antagonist, inhibitor, or modulator, as well 
5 as tailoring the dosage and/or therapeutic or prophylactic treatment regimen. 

In general, two types of pharmacogenetic conditions can be 
differentiated. Genetic conditions can be due to a single factor that alters the 
way the drug act on the body (altered drug action), or a factor that alters the 
way the body metabolizes the drug (altered drug metabolism). These 

10 conditions can occur either as rare genetic defects or as naturally-occurring 
polymorphisms. For example, glucose-6-phosphate dehydrogenase deficiency 
(G6PD) is a common inherited enzymopathy which results in haemolysis after 
ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

1 5 The discovery of genetic polymorphisms of drug metabolizing enzymes 

(e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes CYP2D6 
and CYP2C19) has provided an explanation as to why some patients do not 
obtain the expected drug effects or show exaggerated drug response and 
serious toxicity after taking the standard and safe dose of a drug. These 

20 polymorphisms are expressed in two phenotypes in the population, the 
extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM 
is different among different populations. The gene coding for CYP2D6 is highly 
polymorphic and several mutations have been identified in PM, which all lead 
to the absence of functional CYP2D6. Poor metabolizers quite frequently 

25 experience exaggerated drug response and side effects when they receive 
standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response. This has been demonstrated for the analgesic effect of 
codeine mediated by its CYP2D6-formed metabolite morphine. At the other 
extreme, ultra-rapid metabolizers fail to respond to standard doses. Recent 

30 studies have determined that ultra-rapid metabolism is attributable to CYP2D6 
gene amplification. 



-86- 



WO 02/083077 



PCT/US02/12063 



By analogy, genetic polymorphism or mutation may lead to allelic 
variants of Gene 216 in the population which have different levels of activity. 
The Gene 216 polypeptides or polynucleotides thereby allow a clinician to 
ascertain a genetic predisposition that can affect treatment modality. In 
5 addition, genetic mutation or variants at other genes may potentiate or diminish 
the activity of Gene 216-targeted drugs. Thus, in a Gene 216-based treatment, 
polymorphism or mutation may give rise to individuals that are more or less 
responsive to treatment. Accordingly, dosage would necessarily be modified 
to maximize the therapeutic effect within a given population containing the 

10 polymorphism. As an alternative to genotyping, specific polymorphic 
polypeptides or polynucleotides can be identified. 

To identify genes that modify Gene 21 6-targeted drug response, several 
pharmacogenetic methods can be used. One pharmacogenomics approach, 
"genome-wide association", relies primarily on a high-resolution map of the 

15 human genome. This high-resolution map shows previously identified gene- 
related markers (e.g., a "bi-allelic" gene marker map which consists of 60,000- 
100,000 polymorphic or variable sites on the human genome, each of which 
has two variants). A high-resolution genetic map can then be compared to a 
map of the genome of each of a statistically significant number of patients 

20 taking part in a Phase ll/lll drug trial to identify markers associated with a 
particular observed drug response or side effect. Alternatively, a high- 
resolution map can be generated from a combination of some ten million 
known single nucleotide polymorphisms (SNPs) in the human genome. Given 
a genetic map based on the occurrence of such SNPs, individuals can be 

25 grouped into genetic categories depending on a particular pattern of SNPs in 
their individual genome. In this way, treatment regimens can be tailored to 
groups of genetically similar individuals, taking into account traits that may be 
common among such genetically similar individuals (see, e.g., D.R. Pfost et al., 
2000, Trends BiotechnoL 18(8):334-8). 

30 As another example, the "candidate gene approach", can be used. 

According to this method, if a gene that encodes a drug target is known, all 
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common variants of that gene can be fairly easily identified in the population 
and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As yet another example, a "gene expression profiling approach", can be 
5 used. This method involves testing the gene expression of an animal treated 
with a drug (e.g., a Gene 216 polypeptide, polynucleotide, analog, or 
modulator) to determine whether gene pathways related to toxicity have been 
turned on. 

Information obtained from one of the approaches described herein can 

10 be used to establish a pharmacogenetic profile, which can be used to 
determine appropriate dosage and treatment regimens for prophylactic or 
therapeutic treatment an individual. A pharmacogenetic profile, when applied 
to dosing or drug selection, can be used to avoid adverse reactions or 
therapeutic failure and thus enhance therapeutic or prophylactic efficiency 

15 when treating a subject with a Gene 216 polypeptide, polynucleotide, analog, 
antagonist, inhibitor, or modulator. 

Gene 216 polypeptides or polynucleotides are also useful for monitoring 
therapeutic effects during clinical trials and other treatment Thus, the 
therapeutic effectiveness of an agent that is designed to increase or decrease 

20 gene expression, polypeptide levels, or activity can be monitored over the 
course of treatment using the Gene 216 compositions or modulators. For 
example, monitoring can be performed by: 1) obtaining a p re-administration 
sample from a subject prior to administration of the agent; 2) detecting the level 
of expression or activity of the protein in the p re-ad ministration sample; 3) 

25 obtaining one or more post-administration samples from the subject; 4) 
detecting the level of expression or activity of the polypeptide in the post- 
administration samples; 5) comparing the level of expression or activity of the 
polypeptide in the pre-ad ministration sample with the polypeptide in the post- 
administration sample or samples; and 6) increasing or decreasing the 

30 administration of the agent to the subject accordingly. 

Gene Therapy : In recent years, significant technological advances have 
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been made in the area of gene therapy for both genetic and acquired diseases 
(Kay et al., 1997, Proc. Natl. Acad. Sci. USA, 94:12744-12746). Gene therapy 
can be defined as the transfer of DNA for therapeutic purposes. Improvement 
in gene transfer methods has allowed for development of gene therapy 
5 protocols for the treatment of diverse types of diseases. Gene therapy has 
also taken advantage of recent advances in the identification of new 
therapeutic genes, improvement in both viral and non-viral gene delivery 
systems, better understanding of gene regulation, and improvement in cell 
isolation and transplantation. Gene therapy would be carried out according to 
10 generally accepted methods as described by, for example, Friedman, 1991, 
Therapy for Genetic Diseases, Friedman, Ed., Oxford University Press, pages 
105-121. 

Vectors for introduction of genes both for recombination and for 
extrachromosomal maintenance are known in the art, and any suitable vector 

1 5 may be used. Methods for introducing DNA into cells such as electroporation, 
calcium phosphate co-precipitation, and viral transduction are known in the art, 
and the choice of method is within the competence of one skilled in the art 
(Robbins (ed), 1997, Gene Therapy Protocols, Human Press, NJ). Cells 
transformed with a Gene 216 gene can be used as model systems to study 

20 chromosome 20 disorders and to identify drug treatments for the treatment of 
such disorders. 

Gene transfer systems known in the art may be useful in the practice of 
the gene therapy methods of the present invention. These include viral and 
non-viral transfer methods. A number of viruses have been used as gene 

25 transfer vectors, including polyoma, i.e., SV40 (Madzak et al., 1992, J. Gen. 
Virol., 73:1533-1536), adenovirus (Berkner, 1992, Curr. Top. Microbiol. 
Immunol., 158:39-6; Berkner et al., 1988, Bio Techniques, 6:616-629; Gorziglia 
et al., 1 992, J. Virol., 66:4407-4412; Quantin et al., 1992, Proc. Natl. Acad. Sci. 
USA, 89:2581-2584; Rosenfeld et al., 1992, Cell, 68:143-155; Wilkinson et al., 

30 1 992, NucL Acids Res., 20:2233-2239; Stratford-Perricaudet et al., 1 990, Hum. 
Gene Ther., 1:241-256), vaccinia virus (Mackett et al., 1992, Biotechnology, 
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24:495-499), adeno-associated virus (Muzyczka, 1992, Curr. Top. Microbiol. 
Immunol., 158:91- 123; Ohi et aL, 1990, Gene, 89:279-282), herpes viruses 
including HSV and EBV (Margolskee, 1992, Curr. Top. Microbiol. Immunol., 
158:67-90; Johnson et aL, 1992, J. Virol., 66:2952-2965; Fink et aL, 1992, 
5 Hum. Gene Ther., 3:11-19; Breakfield et aL, 1987, Mol. NeurobioL, 1:337-371; 
Fresse et aL, 1990, Biochem. Pharmacol., 40:2189-2199), and retroviruses of 
avian (Brandyopadhyay et aL, 1984, Mol. Cell Biol., 4:749-754; Petropouplos 
etaL, 1992, J. Virol., 66:3391-3397), murine (Miller, 1992, Curr. Top. Microbiol. 
Immunol., 158:1-24; Miller et aL, 1985, Mol. Cell Biol., 5:431-437; Sorge et aL, 

10 1984, Mol. Cell Biol., 4:1730-1737; Mann et aL, 1985, J. ViroL, 54:401-407), 
and human origin (Page et aL, 1990, J. ViroL, 64:5370-5276; Buchschalcher 
et aL, 1992, J. Virol., 66:2731-2739). Most human gene therapy protocols 
have been based on disabled murine retroviruses. 

Non-viral gene transfer methods known in the art include chemical 

15 techniques such as calcium phosphate coprecipitation (Graham et aL, 1973, 
Virology, 52:456-467; Pellicer et aL, 1980, Science, 209:1414-1422), 
mechanical techniques, for example microinjection (Anderson et aL, 1980, 
Proc. Natl. Acad. Sci. USA, 77:5399-5403; Gordon et aL, 1980, Proc. Natl. 
Acad. Sci. USA, 77:7380-7384; Brinster et aL, 1981, Cell, 27:223-231; 

20 Constantini et aL, 1981, Nature, 294:92-94), membrane fusion-mediated 
transfer via liposomes (Feigner et aL, 1987, Proc. Natl. Acad. Sci. USA, 
84:7413-7417; Wang et aL, 1989, Biochemistry, 28:9508-9514; Kaneda et aL, 
1989, J. Biol. Chem., 264:12126-12129; Stewart et aL, 1992, Hum. Gene 
Ther., 3:267-275; Nabel et aL, 1990, Science, 249:1285-1288; Lim et aL, 1992, 

25 Circulation, 83:2007-2011), and direct DNA uptake and receptor-mediated 
DNA transfer (Wolff et aL, 1990, Science, 247:1465-1468; Wu et aL, 1991, 
BioTechniques, 11:474-485; Zenke et aL, 1990, Proc. Natl. Acad. Sci. USA, 
87:3655-3659; Wu et aL, 1989, J. Biol. Chem., 264:16985-16987; Wolff etaL, 
1991, BioTechniques, 11:474-485; Wagner et aL, 1991, Proc. Natl. Acad. Sci. 

30 USA, 88:4255-4259; Cotten et aL, 1 990, Proc. Natl. Acad. Sci. USA, 87:4033- 
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4037; Curie! etai., 1991, Proc. Natl. Acad. ScL USA, 88:8850-8854; Curiel et 
al., 1991, Hum. Gene Then, 3:147-154). 

In one approach, plasmid DNA is complexed with a polylysine- 
conjugated antibody specific to the adenovirus hexon protein, and the resulting 
5 complex is bound to an adenovirus vector. The trimolecular complex is then 
used to infect cells. The adenovirus vector permits efficient binding, 
internalization, and degradation of the endosome before the coupled DNA is 
damaged. 

In another approach, liposome/DNA is used to mediate direct in vivo 

10 gene transfer. While in standard liposome preparations the gene transfer 
process is non-specific, localized in vivo uptake and expression have been 
reported in tumor deposits, for example, following direct in situ administration 
(Nabel, 1992, Hum. Gene Ther. t 3:399-410). 

Suitable gene transfer vectors possess a promoter sequence, preferably 

15 a promoter that is cell-specific and placed upstream of the sequence to be 
expressed. The vectors may also contain, optionally, one or more expressible 
marker genes for expression as an indication of successful transfection and 
expression of the nucleic acid sequences contained in the vector. In addition, 
vectors can be optimized to minimize undesired immunogenicity and maximize 

20 long-term expression of the desired gene product(s) (see Nabe, 1999, Proc. 
Natf. Acad. ScL USA 96:324-326). Moreover, vectors can be chosen based on 
cell-type that is targeted for treatment Notably, gene transfer therapies have 
been initiated for the treatment of various pulmonary diseases (see, e.g., M.J. 
Welsh, 1999, J. Clin. Invest. 104(9): 1165-6; D.L. Ennist, 1999, Trends 

25 Pharmacol. ScL 20:260-266; S.M. Albelda et aL, 2000, Ann. Intern. Med. 
132:649-660; E. Alton and C. Kitson C, 2000, Expert Opin. Investig. Drugs. 
9(7): 1523-35). 

Illustrative examples of vehicles or vector constructs for transfection or 
infection of the host cells include replication-defective viral vectors, DNA virus 
30 or RNA virus (retrovirus) vectors, such as adenovirus, herpes simplex virus and 
adeno-associated viral vectors. Adeno-associated virus vectors are single 
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stranded and allow the efficient delivery of multiple copies of nucleic acid to the 
cell's nucleus. Preferred are adenovirus vectors. The vectors will normally be 
substantially free of any prokaryotic DNA and may comprise a number of 
different functional nucleic acid sequences. An example of such functional 
5 sequences may be a DNA region comprising transcriptional and translational 
initiation and termination regulatory sequences, including promoters (e.g., 
strong promoters, inducible promoters, and the like) and enhancers which are 
active in the host cells. Also included as part of the functional sequences is an 
open reading frame (polynucleotide sequence) encoding a protein of interest. 

10 Flanking sequences may also be included for site-directed integration. In 
some situations, the 5'-flanking sequence will allow homologous 
recombination, thus changing the nature of the transcriptional initiation region, 
so as to provide for inducible or non-inducible transcription to increase or 
decrease the level of transcription, as an example. 

15 In general, the encoded and expressed Gene 216 polypeptide may be 

intracellular, i.e., retained in the cytoplasm, nucleus, or in an organelle, or may 
be secreted by the cell. For secretion, the natural signal sequence present in 
Gene 216 may be retained. When the polypeptide or peptide is a fragment of 
a Gene 216 protein, a signal sequence may be provided so that, upon 

20 secretion and processing at the processing site, the desired protein will have 
the natural sequence. Specific examples of coding sequences of interest for 
use in accordance with the present invention include the Gene polypeptide 
coding sequences, e.g., SEQ ID NO:4. 

As previously mentioned, a marker may be present for selection of cells 

25 containing the vector construct The marker may be an inducible or non- 
inducible gene and will generally allow for positive selection under induction, 
or without induction, respectively. Examples of marker genes include 
neomycin, dihydrofolate reductase, glutamine synthetase, and the like. The 
vector employed will generally also include an origin of replication and other 

30 genes that are necessary for replication in the host cells, as routinely employed 
by those having skill in the art. As an example, the replication system 
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comprising the origin of replication and any proteins associated with replication 
encoded by a particular virus may be included as part of the construct. The 
replication system must be selected so that the genes encoding products 
necessary for replication do not ultimately transform the cells. Such replication 
5 systems are represented by replication-defective adenovirus (see G. Acsadi et 
ah, 1994, Hum. Mol. Genet. 3:579-584) and by Epstein-Barr virus. Examples 
of replication defective vectors, particularly, retroviral vectors that are 
replication defective, are BAG, (see Price et al., 1987, Proc. Natl. Acad. Sci. 
USA, 84:156; Sanes et al., 1986, EMBO J., 5:3133). It will be understood that 

10 the final gene construct may contain one or more genes of interest, for 
example, a gene encoding a bioactive metabolic molecule. In addition, cDNA, 
synthetically produced DNA or chromosomal DNA may be employed utilizing 
methods and protocols known and practiced by those having skill in the art. 
According to one approach for gene therapy, a vector encoding a Gene 

15 216 polypeptide is directly injected into the recipient cells {in vivo gene 
therapy). Alternatively, cells from the intended recipients are explanted, 
genetically modified to encode a Gene 216 polypeptide, and reimplanted into 
the donor (ex vivo gene therapy). An ex vivo approach provides the advantage 
of efficient viral gene transfer, which is superior to in vivo gene transfer 

20 approaches. In accordance with ex vivo gene therapy, the host cells are first 
transfected with engineered vectors containing at least one gene encoding a 
Gene 216 polypeptide, suspended in a physiologically acceptable carrier or 
excipient such as saline or phosphate buffered saline, and the like, and then 
administered to the host. The desired gene product is expressed by the 

25 injected cells, which thus introduce the gene product into the host. The 
introduced gene products can thereby be utilized to treat or ameliorate a 
disorder that is related to altered levels of Gene 216 (e.g., asthma). 
Animal Models 

Gene 216 polynucleotides can be used to generate genetically altered 
30 non-human animals or human cell lines. Any non-human animal can be used; 
however typical animals are rodents, such as mice, rats, or guinea pigs. 
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Genetically engineered animals or cell lines can carry a gene that has been 
altered to contain deletions, substitutions, insertions, or modifications of the 
polynucleotide sequence (e.g., exon sequence). Such alterations may render 
the gene nonfunctional, (i.e., a null mutation) producing a "knockout" animal or 
5 cell line. In addition, genetically engineered animals can carry one or more 
exogenous or non-naturally occurring genes, i.e., "transgenes", that are derived 
from different organisms (e.g., humans), or produced by synthetic or 
recombinant methods. Genetically altered animals or cell lines can be used to 
study Gene 216 function, regulation, and treatments for Gene 216-related 

10 diseases. In particular, knockout animals and cell lines can be used to 
establish animal models and in vitro models for Gene 216-related illnesses, 
respectively. In addition, transgenic animals expressing human Gene 216 can 
be used in drug discovery efforts. 

A "transgenic animal" is any animal containing one or more cells 

15 bearing genetic information altered or received, directly or indirectly, by 
deliberate genetic manipulation at a subcellular level, such as by targeted 
recombination or microinjection or infection with recombinant virus. The term 
"transgenic animal" is not intended to encompass classical cross-breeding or 
in vitro fertilization, but rather is meant to encompass animals in which one or 

20 more cells are altered by, or receive, a recombinant DNA molecule. This 
recombinant DNA molecule may be specifically targeted to a defined genetic 
locus, may be randomly integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. 

Transgenic animals can be selected after treatment of germline cells or 

25 zygotes. For example, expression of an exogenous Gene 216 gene or a 
variant can be achieved by operably linking the gene to a promoter and 
optionally an enhancer, and then microinjecting the construct into a zygote 
(see, e.g., Hogan et al., Manipulating the Mouse Embryo, A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY). Such 

30 treatments include insertion of the exogenous gene and disrupted homologous 
genes. Alternatively, the gene(s) of the animals may be disrupted by insertion 
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or deletion mutation of other genetic alterations using conventional techniques 
(see, e.g., Capecchi, 1989, Science, 244:1288; Valancuis et al., 1991, MoL Cell 
Biol., 11:1402; Hasty et al., 1991, Nature, 350:243; Shinkai et al., 1992, Cell, 
68:855; Mombaerts et aL, 1992, Cell, 68:869; Philpott et al., 1992, Science, 
5 256:1448; Snouwaert et al., 1 992, Science, 257:1083; Donehower et aL, 1 992, 
Nature, 356:215). 

In one aspect of the invention, Gene 216 knockout mice can be 
produced in accordance with well-known methods (see, e.g., M.R. Capecchi, 
1989, Science, 244:1288-1292; P. Li etal., 1995, Cell 80:401-411; LA. Galli- 

10 Taiiadoros et al., 1995, J. Immunol. Methods 181(1):1-15; C.H. Westphal et al., 
1997, Curr. Biol. 7(7):530-3; S.S. Cheah et al., 2000, Methods MoL Biol. 
136:455-63). The disclosed murine Gene 216 genomic clone can be used to 
prepare a Gene 216 targeting construct that can disrupt Gene 216 in the 
mouse by homologous recombination at the Gene 216 chromosomal locus. 

15 The targeting construct can comprise a disrupted or deleted Gene 216 
sequence that inserts in place of the functioning portion of the native mouse 
gene. For example, the construct can contain an insertion in the Gene 216 
protein-coding region. 

Preferably, the targeting construct contains markers for both positive 

20 and negative selection. The positive selection marker allows the selective 
elimination of cells that lack the marker, while the negative selection marker 
allows the elimination of cells that carry the marker. In particular, the positive 
selectable marker can be an antibiotic resistance gene, such as the neomycin 
resistance gene, which can be placed within the coding sequence of Gene 21 6 

25 to render it non-functional, while at the same time rendering the construct 
selectable. The herpes simplex virus thymidine kinase (HSV tk) gene is an 
example of a negative selectable marker that can be used as a second marker 
to eliminate cells that carry it. Cells with the HSV tk gene are selectively killed 
in the presence of gangcyclovir. As an example, a positive selection marker 

30 can be positioned on a targeting construct within the region of the construct 
that integrates at the Gene 216 locus. The negative selection marker can be 
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positioned on the targeting construct outside the region that integrates at the 
Gene 216 locus. Thus, if the entire construct is present in the cell, both 
positive and negative selection markers will be present, if the construct has 
integrated into the genome, the positive selection marker will be present, but 
5 the negative selection marker will be lost. 

The targeting construct can be employed, for example, in embryonal 
stem cell (ES). ES cells may be obtained from pre-implantation embryos 
cultured in vitro (M.J. Evans et ai., 1981 , Nature 292:154-156; M.O. Bradley et 
al., 1984, Nature 309:255-258; Gossler et a!., 1986, Proc. Natl. Acad. Sci. USA 

1 0 83:9065-9069; Robertson et al., 1986, Nature 322:445-448; S. A. Wood et al., 
1 993, Proc. NatL Acad. Sci. USA 90:4582-4584). Targeting constructs can be 
efficiently introduced into the ES cells by standard techniques such as DNA 
transfection or by retrovirus-mediated transduction. Following this, the 
transformed ES cells can be combined with blastocysts from a non-human 

15 animal. The introduced ES cells colonize the embryo and contribute to the 
germ line of the resulting chimeric animal (R. Jaenisch, 1988, Science 
240:1468-1474). The use of gene-targeted ES cells in the generation of gene- 
targeted transgenic mice has been previously described (Thomas et al., 1987, 
Cell 51:503-512) and is reviewed elsewhere (Frohman et al., 1989, Cell 

20 56:145-147; Capecchi, 1989, Trends in Genet. 5:70-76; Baribault et al., 1989, 
Mol. Biol. Med. 6:481-492; Wagner, 1990, EMBO J. 9:3025-3032; Bradley et 
al., 1992, Bio/Technology 10: 534-539). 

Several methods can be used to select homologously recombined 
murine ES cells. One method employs PCR to screen pools of transform ant 

25 cells for homologous insertion, followed by screening individual clones (Kim et 
al., 1988, Nucleic Acids Res. 16:8887-8903; Kim et al., 1991, Gene 103:227- 
233). Another method employs a marker gene is constructed which will only 
be active if homologous insertion occurs, allowing these recombinants to be 
selected directly (Sedivy et al., 1989, Proc. Natl. Acad. Sci. USA 86:227-231). 

30 For example, the positive-negative selection (PNS) method can be used as 
described above (see, e.g., Mansour et al., 1988, Nature 336:348-352; 
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Capecchi, 1989, Science 244:1288-1292; Capecchi, 1989, Trends in Genet. 
5:70-76). In particular, the PNS method is useful for targeting genes that are 
expressed at low levels. 

The absence of functional Gene 216 in the knockout mice can be 
5 confirmed, for example, by RNA analysis, protein expression analysis, and 
functional studies. For RNA analysis, RNA samples are prepared from 
different organs of the knockout mice and the Gene 216 transcript is detected 
in Northern blots using oligonucleotide probes specific for the transcript. For 
protein expression detection, antibodies that are specific for the Gene 216 

10 polypeptide are used, for example, in flow cytometric analysis, 
immunohistochemical staining, and activity assays. Alternatively, functional 
assays are performed using preparations of different cell types collected from 
the knockout mice. 

Several approaches can be used to produce transgenic mice. In one 

15 approach, a targeting vector is integrated into ES cell by homologous 
recombination, an intrachromosomal recombination event is used to eliminate 
the selectable markers, and only the transgene is left behind (A.L. Joyner et al., 
1989, Nature 338(6211): 153-6; P. Hasty et al., 1991, Nature 350(631 5):243~6; 
V. Valancius and O. Smithies, 1991, MoL Cell Biol. 11(3):1402-8; S. Fiering et 

20 al., 1993, Proc. Natl. Acad. Sci. USA 90(18):8469-73). In an alternative 
approach, two or more strains are created; one strain contains the gene 
knocked-out by homologous recombination, while one or more strains contain 
transgenes. The knockout strain is crossed with the transgenic strain to 
produce new line of animals in which the original wild-type allele has been 

25 replaced (although not at the same site) with a transgene. Notably, knockout 
and transgenic animals can be produced by commercial facilities (e.g., The 
Lerner Research Institute, Cleveland, OH; B&K Universal, Inc., Fremont, CA; 
DNX Transgenic Sciences, Cranbury, NJ; Incyte Genomics, Inc., St. Louis, 
MO). 

30 Transgenic animals (e.g., mice) containing a nucleic acid molecule 

which encodes human Gene 216, may be used as in vivo models to study the 
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overexpression of Gene 216. Such animals can also be used in drug 
evaluation and discovery efforts to find compounds effective to inhibit or 
modulate the activity of Gene 216, such as for example compounds for treating 
respiratory disorders, diseases, or conditions. One having ordinary skill in the 
5 art can use standard techniques to produce transgenic animals which produce 
human Gene 216 polypeptide, and use the animals in drug evaluation and 
discovery projects (see, e.g., U.S. Patent No. 4,873,191 to Wagner; U.S. 
Patent No. 4,736,866 to Leder). 

In another embodiment of the present invention, the transgenic animal 

10 can comprise a recombinant expression vector in which the nucleotide 
sequence that encodes human Gene 216 is operably linked to a tissue specific 
promoter whereby the coding sequence is only expressed in that specific 
tissue. For example, the tissue specific promoter can be a mammary cell 
specific promoter and the recombinant protein so expressed is recovered from 

15 the animal's milk. 

In yet another embodiment of the present invention, a Gene 216 
"knockout" can be produced by administering to the animal antibodies (e.g., 
neutralizing antibodies) that specifically recognize an endogenous Gene 216 
polypeptide. The antibodies can act to disrupt function of the endogenous 

20 Gene 216 polypeptide, and thereby produce a null phenotype. In one specific 
example, an orthologous mouse Gene 216 polypeptide (e.g., SEQ ID NO:366) 
or peptide can be used to generate antibodies. These antibodies can be given 
to a mouse to knockout the function of the mouse Gene 216 ortholog. 

In addition, non-mammalian organisms may be used to study Gene 216 

25 and Gene 216-related diseases. For example, model organisms such as C. 
elegans, D. melanogaster, and S. cerevisiae may be used. Gene 216 
homologues can be identified in these model organisms, and mutated or 
deleted to produce a Gene 216-deficient strain. Human Gene 216 can then be 
tested for the ability to "complement" the Gene 216-deficient strain. Gene 216- 

30 deficient strains can also be used for drug screening. The study of Gene 216 
homologs can facilitate the understanding of human Gene 216 biological 
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function, and assist in the identification of binding proteins (e.g., agonists and 

antagonists). 

Gene Identification 

To identify genes in the region on 20p13-p12, a set of bacterial artificial 
5 chromosome(BAC) clones containing this chromosomal region was identified 
in accordance with the methods described herein. The BAC clones served as 
a template for genomic DNA sequencing and served as reagents for identifying 
coding sequences by direct cDNA selection. Genomic sequencing and direct 
cDNA selection methods were used to characterize DNA from 20p13~p12. 

10 When one or more genes have been genetically localized to a specific 

chromosomal region, the gene(s) can be characterized at the molecular level 
by a series of steps that include: 1) cloning the entire region of DNA in a set 
of overlapping clones (physical mapping); 2) characterizing the gene(s) 
encoded by these clones by a combination of direct cDNA selection, exon 

15 trapping and DNA sequencing (gene identification); and 3) identifying 
mutations (i.e., SNPs) in the gene(s) by comparative DNA sequencing of 
affected and unaffected members of the kindred and/or in unrelated affected 
individuals and unrelated unaffected controls (mutation analysis). 

Physical mapping is accomplished by screening libraries of human DNA 

20 cloned in vectors that are propagated in a host such as E. co//, using 
hybridization or PGR assays from unique molecular landmarks in the 
chromosomal region of interest. In accordance with the present invention, a 
physical map of the disorder region was generated by screening a library of 
human DNA cloned in BACs with a set overgo markers that had been 

25 previously mapped to chromosome 20p13-p12 by the efforts of the Human 
Genome Project. Overgos are unique molecular landmarks in the human 
genome that can be assayed by hybridization. The location of thousands of 
overgos on the twenty-two autosomes and two sex chromosomes has been 
determined through the efforts of the Human Genome Project. For a positional 

30 cloning effort, the physical map is tied to the genetic map because the markers 
used for genetic mapping can also be used as overgos for physical mapping. 
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. By screening a BAC library with a combination of overgos derived from genetic 
markers, genes, and random DNA fragments, a physical map comprised of 
overlapping clones representing all of the DNA in a chromosomal region of 
interest can be assembled. 
5 BACs are cloning vectors for large (80 kilobase to 200 kilobase) 

segments of human or other DNA that are propagated in E. coli To construct 
a physical map using BACs, a library of BAC clones is screened so that 
individual clones harboring the DNA sequence corresponding to a given overgo 
or set of overgos are identified. Throughout most of the human genome, the 

10 overgo markers are spaced approximately 20 to 50 kilobases apart, so that an 
individual BAC clone typically contains at least two overgo markers. In 
addition, the BAC libraries that were screened contain enough cloned DNA to 
cover the human genome twelve times over. An individual overgo typically 
identifies more than one BAC clone. By screening a twelve-fold coverage BAC 

15 library with a series of overgo markers spaced approximately 50 kilobases 
apart, a physical map consisting of a series of overlapping contiguous BAC 
clones, i.e., BAC "contigs," can be assembled for any region of the human 
genome. This map is closely tied to the genetic map because many of the 
overgo markers used to prepare the physical map are also genetic markers. 

20 When constructing a physical map, it often happens that there are gaps 

in the overgo map of the genome that result in the inability to identify BAC 
clones that are overlapping in a given location. Typically, the physical map is 
first constructed from a set of overgos identified through the publicly available 
literature and World Wide Web resources. The initial map consists of several 

25 separate BAC contigs that are separated by gaps of unknown molecular 
distance. To identify BAC clones that fill these gaps, it is necessary to develop 
new overgo markers from the ends of the clones on either side of the gap. 
This is done by sequencing the terminal 200 to 300 base pairs of the BACs 
flanking the gap, and developing a PCR or hybridization based assay. If the 

30 terminal sequences are demonstrated to be unique within the human genome, 
then the new overgo can be used to screen the BAC library to identify 
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additional BACs that contain the DNA from the gap in the physical map. To 
assemble a BAC contig that covers a region the size of the disorder region 
(6,000,000 or more base pairs), it is necessary to develop new overgo markers 
from the ends of a number of clones. 
5 After building a BAC contig, this set of overlapping clones serves as a 

template for identifying the genes encoded in the chromosomal region. Gene 
identification can be accomplished by many methods. Three methods are 
commonly used: 1) a set of BACs selected from the BAC contig to represent 
the entire chromosomal region are sequenced, and computational methods are 

10 used to identify all of the genes; 2) the BACs from the BAC contig are used as 
a reagent to clone cDNAs corresponding to the genes encoded in the region 
by a method termed direct cDNA selection; or 3) the BACs from the BAC contig 
are used to identify coding sequences by selecting for specific DNA sequence 
motifs in a procedure called exon trapping. Gene 216 was identified by 

1 5 methods (1 ) and (2) in accordance with the techniques disclosed herein. 

To sequence the entire BAC contig representing the disorder region, a 
set of BACs can be chosen for subcloning into plasmid vectors and subsequent 
DNA sequencing of these subclones. Since the DNA cfoned in the BACs 
represents genomic DNA, this sequencing is referred to as genomic 

20 sequencing to distinguish it from cDNA sequencing. To initiate the genomic 
sequencing for a chromosomal region of interest, several non-overlapping BAC 
clones are chosen. DNA for each BAC clone is prepared, and the clones are 
sheared into random small fragments that are subsequently cloned into 
standard plasmid vectors such as pUC18. The plasmid clones are then grown 

25 to propagate the smaller fragments, and these are the templates for 
sequencing. To ensure adequate coverage and sequence quality for the BAC 
DNA sequence, sufficient plasmid clones are sequenced to yield three-fold 
coverage of the BAC clone. For example, if the BAC is 100 kilobases long, 
then phagemids are sequenced to yield 300 kilobases of sequence. Since the 

30 BAC DNA is randomly sheared prior to cloning in the phagemid vector, the 300 
kilobases of raw DNA sequence can be assembled by computational methods 
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into overlapping DNA sequences termed sequence contigs. For the purposes 
of initial gene identification by computational methods, three-fold coverage of 
each BAC is sufficient to yield twenty to forty sequence contigs of 1000 base 
pairs to 20,000 base pairs. 
5 In accordance with the present invention, the "seed" BACs from the 

BAC contig in the disorder region were sequenced. The sequence of the 
"seed" BACs was then used to identify minimally overlapping BACs from the 
contig, and these were subsequently sequenced. In this manner, the entire 
candidate region can be sequenced, with several small sequence gaps left in 

10 each BAC. This sequence serves as the template for computational gene 
identification. In one approach, genes can be identified by comparing the 
sequence of BAC contig to publicly available databases of cDNA and genomic 
sequences, e.g. UniGene, dbEST, EMBL nucleotide database, GenBank, and 
the DNA Database of Japan (DDBJ). The BAC DNA sequence can also be 

1 5 translated into protein sequence, and the protein sequence can be used to 
search publicly available protein databases, e.g., GenPept, EMBL protein 
database, Protein Information Resource (PIR), Protein Data Bank (PDB), and 
SWISS-PROT. These comparisons are typically done using the BLAST family 
of computer algorithms and programs (Altschul et al., 1990, J. Mol. Biol., 

20 215:403-410; Altschul et al, 1997, Nucl. Acids Res., 25:3389-3402). 

For nucleotide queries, BLASTN, BLASTX, and TBLASTX can be used. 
BLASTN compares a nucleotide query sequence with a nucleotide sequence 
database; BLASTX compares a nucleotide query sequence translated in all 
reading frames against a protein sequence database; TBLASTX compares the 

25 six-frame translations of a nucleotide query sequence against the six-frame 
translations of a nucleotide sequence database. For protein queries, BLASTP 
and TBLASTN can be used. BLASTP compares a protein query sequence 
with a protein sequence database; TBLASTN compares a protein query 
sequence against a nucleotide sequence database dynamically translated in 

30 all reading frames. 

Additionally, computer algorithms such as MZEF (Zhang, 1997, Proc. 



- 102- 



WO 02/083077 



PCT/US02/12063 



Natl. Acad. Sci. USA 94:565-568), GRAIL (Uberbacher et al., 1996, Methods 
EnzymoL, 266:259-281), and Genscan (Burge and Karlin, 1997, J. MoL Biol., 
268:78-94) can be used to predict the location of exons in the sequence based 
on the presence of specific DNA sequence motifs that are common to all 
5 exons, as well as the presence of codon usage typical of human protein 
encoding sequences. 

In addition to identifying genes by computational methods, genes can 
be identified by direct cDNA selection (Del Mastro and Lovett, 1996, Methods 
In Molecular Biology, Humana Press Inc., NJ). In direct cDNA selection, cDNA 

10 pools from tissues of interest are prepared, and BACs from the candidate 
region are used in a liquid hybridization assay to capture the cDNAs which 
base pair to coding regions in the BAC. In the methods described herein, the 
cDNA pools were created from several different tissues by random priming and 
oligo dT priming the first strand cDNA from poly A + RNA, synthesizing the 

1 5 second-strand cDNA by standard methods, and adding linkers to the ends of 
the cDNA fragments. In this approach, the linkers are used to amplify the 
cDNA pools of BAC clones from the disorder region identified by screening a 
BAC library. The amplified products are then used as a template for initiating 
DNA synthesis to create a biotin labeled copy of BAC DNA. Following this, the 

20 biotin labeled copy of the BAC DNA is denatured and incubated with an excess 
of the PCR amplified, linkered cDNA pools which have also been denatured. 
The BAC DNA and cDNA are allowed to anneal in solution, and 
heteroduplexes between the BAC and the cDNA are isolated using streptavidin 
coated magnetic beads. The cDNAs that are captured by the BAC are then 

25 amplified using primers complimentary to the linker sequences, and the 
hybridization/selection process is repeated for a second round. After two 
rounds of direct cDNA selection, the cDNA fragments are cloned, and a library 
of these direct selected fragments is created. 

The cDNA clones isolated by direct selection are analyzed by two 

30 methods. Where the genomic target DNA sequence is obtained from a pool 
of BACs from the disorder region, the cDNAs are mapped to BAC genomic 
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clones to verify their chromosomal location. This is accomplished by arraying 
the cDNAs in microtiter dishes, and replicating their DNA in high-density grids. 
Individual genomic clones known to map to the region are then hybridized to 
the grid to identify direct selected cDNAs mapping to that region. cDNA clones 
5 that are confirmed to correspond to individual BACs are sequenced. To 
determine whether the cDNA clones isolated by direct selection share 
sequence identity or similarity to previously identified genes, the DNA and 
protein coding sequences are compared to publicly available databases using 
the BLAST family of programs described above. 

10 The combination of genomic DNA sequence and cDNA sequence 

provided by BAC sequencing and by direct cDNA selection yields an initial list 
of putative genes in the region. In the present invention, the genes in the 
region were candidates for the asthma locus. To further characterize each 
gene, Northern blots were performed to determine the size of the transcript 

15 corresponding to each gene, and to determine which putative exons were 
transcribed together to make an individual gene. For Northern blot analysis of 
each gene, probes are prepared from direct selected cDNA clones or by PCR 
amplifying specific fragments from genomic DNA, cDNA or from the BAC 
encoding the putative gene of interest. The Northern blot analysis is used to 

20 determine the size of the transcript and the tissues in which it is expressed. 
For transcripts that are not highly expressed, it is sometimes necessary to 
perform a reverse transcription PCR assay using RNA from the tissues of 
interest as a template for the reaction. 

Gene identification by computational methods and by direct cDNA 

25 selection provides unique information about the genes in a region of a 
chromosome. Once genes are identified, it is possible to examine subjects for 
sequence variants. Variant sequences can be inherited as allelic differences 
or can arise from spontaneous mutations. 

Inherited alleles can be analyzed for linkage to a disease susceptibility 

30 locus. Linkage analysis is possible because of the nature of inheritance of 
chromosomes from parents to offspring. During meiosis, the two parental 
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homologs pair to guide their proper separation to daughter cells. While they 
are paired, the two homologs exchange pieces of the chromosomes, in an 
event called "crossing over" or "recombination." The resulting chromosomes 
contain parts that originate from both parental homologs. The closer together 
5 two sequences are on the chromosome, the less likely that a recombination 
event will occur between them, and the more closely linked they are. 

In the present invention, data obtained from the different families were 
combined and analyzed together by a computer using statistical methods 
described herein. The results were then used as evidence for linkage between 

10 the genetic markers used and an asthma susceptibility locus. 

In general, a recombination frequency of 1% is equivalent to 
approximately 1 map unit, a relationship that holds up to frequencies of about 
20% or 20 cM. One centimorgan (cM) is roughly equivalent to 1,000 kb of 
DNA. The entire human genome is 3,300 cM long. In order to find an 

15 unknown disease gene within 5-10 cM of a marker locus, the whole human 
genome can be searched with roughly 330 informative marker loci spaced at 
approximately 10 cM intervals (Botstein et al., 1980, Am. J. Hum. Genet, 
32:314-331). 

The reliability of linkage results is established by using a number of 
20 statistical methods. The methods most commonly used for the detection by 
linkage analysis of oligogenes involved in the etiology of a complex trait are 
non-parametric or model-free methods which have been implemented into the 
computer programs MAPMAKER/SIBS (L. Krugiyak and E.S. Lander, 1995, 
Am. J. Hum. Genet 57:439-454) and GENEHUNTER (L. Krugiyak et al., 1 996, 
25 Am. J. Hum. Genet 58:1347-1363). Typically, linkage analysis is performed 
by typing members of families with multiple affected individuals at a given 
marker locus and evaluating if the affected members (excluding parent- 
offspring pairs) share alleles at the marker locus that are identical by descent 
(IBD) more often than expected by chance alone. 
30 As a result of the rapid advances in mapping the human genome over 

the last few years, and concomitant improvements in computer methodology, 
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it has become feasible to carry out linkage analyses using multi-point data. 
Multi-point analysis provides a simultaneous analysis of linkage between the 
trait and several linked genetic markers, when the recombination distance 
among the markers is known. A LOD score statistic is computed at multiple 
5 locations along a chromosome to measure the evidence that a susceptibility 
locus is located nearby. A LOD score is the logarithm base 10 of the ratio of 
the likelihood that a susceptibility locus exists at a given location to the 
likelihood that no susceptibility locus is located there. By convention, when 
testing a single marker, a total LOD score greater than +3.0 (that is, odds of 

10 linkage being 1 ,000 times greater than odds of no linkage) is considered to be 
significant evidence for linkage. 

Multi-point analysis is advantageous for two reasons. First, the 
informativeness of the pedigrees is usually increased. Each pedigree has a 
certain amount of potential information, dependent on the number of parents 

15 heterozygous for the marker loci and the number of affected individuals in the 
family. However, few markers are sufficiently polymorphic as to be informative 
in all those individuals. If multiple markers are considered simultaneously, then 
the probability of an individual being heterozygous for at least one of the 
markers is greatly increased. Second, an indication of the position of the 

20 disease gene among the markers may be determined. This allows identification 
of flanking markers, and thus eventually allows identification of a small region 
in which the disease gene resides. 
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EXAMPLES 

The examples as set forth herein are meant to exemplify the various 
aspects of the present invention and are not intended to limit the invention in 
any way. 

5 EXAMPLE 1: Family Collection 

Asthma is a complex disorder that is influenced by a variety of factors, 
including both genetic and environmental effects. Complex disorders are 
typically caused by multiple interacting genes, some contributing to disease 
development and some conferring a protective effect. The success of linkage 

10 analyses in identifying chromosomes with significant LOD scores is achieved 
in part as a result of an experimental design tailored to the detection of 
susceptibility genes in complex diseases, even in the presence of epistasis and 
genetic heterogeneity. Also important are rigorous efforts in ascertaining 
asthmatic families that meet strict guidelines, and collecting accurate clinical 

15 information. 

Given the complex nature of the asthma phenotype, non-parametric 
affected sib pair analyses were used to analyze the genetic data. This 
approach does not require parameter specifications such as mode of 
inheritance, disease allele frequency, penetrance of the disorder, or phenocopy 

20 rates. Instead, it determines whether the inheritance pattern of a chromosomal 
region is consistent with random segregation. Where segregation is not 
random, affected sibs inherit identical copies of alleles more often than 
expected by chance. Because no models for inheritance are assumed, allele- 
sharing methods tend to be more robust than parametric methods when 

25 analyzing complex disorders. They do, however, require larger sample sizes 
to reach statistically significant results. 

At the outset of the program, the goal was to collect 400 affected sib- 
pair families for the linkage analyses. Based on a genome scan with markers 
spaced -10 cM apart, this number of families was predicted to provide > 95% 

30 power to detect an asthma susceptibility gene that caused an increased risk 
to first-degree relatives of 3-fold or greater. The assumed relative risk of 3-fold 
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was consistent with epidemiological studies in the literature that suggest an 
increased risk ranging from 3- to 7~fold. The relative risk was based on 
gender, different classifications of the asthma phenotype (i.e. bronchial hyper- 
responsiveness versus physician's diagnosis) and, in the case of offspring, 
5 whether one or both parents were asthmatic. 

The family collection efforts exceeded the initial goal of 400, obtaining 
a total of 444 affected sibling pair (ASP) families, with 342 families from the UK 
and 102 families from the US. The ASP families in the US collection were 
Caucasian with a minimum of two affected siblings that were identified through 

10 both private practice and community physicians as well as through advertising. 
A total of 102 families were collected in Kansas, Nebraska, and Southern 
California. In the UK collection, Caucasian families with a minimum of two 
affected siblings were identified through physicians' registers in a region 
surrounding Southampton and including the Isle of Wight. In both the US and 

15 UK collections, additional affected and unaffected sibs were collected 
whenever possible. An additional 39 families from the United Kingdom were 
utilized from an earlier collection effort with different ascertainment criteria. 
These families were recruited either: 1) without reference to asthma and 
atopy; or 2) by having at least one family member or at least two family 

20 members affected with asthma. The randomly ascertained samples were 
identified from general practitioner registers in the Southampton area. For 
families with affected members, the probands (i.e., the initial affected 
individuals identified) were recruited from hospital based clinics in 
Southampton. Seven pedigrees extended beyond a single nuclear family. 

25 Families were included in the study if they met all of the following 

criteria: 1) the biological mother and biological father were Caucasian and 
agreed to participate in the study; 2) at least two biological siblings were alive, 
each with a current physician diagnosis of asthma, and were 5 to 21 years of 
age; and 3) the two siblings were currently taking asthma medications on a 

30 regular basis. This included regular, intermittent use of inhaled or oral 
bronchodilators and regular use of cromolyn, theophylline, or steroids. 
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Families were excluded from the study if they met any one of the 
following criteria: 1) both parents were affected (i.e., with a current diagnosis 
of asthma, having asthma symptoms, or on asthma medications at the time of 
the study); 2) any asthmatic family member to be included in the study was 
5 taking beta-blockers at the time of the study, 3) any family member to be 
included in the study had congenital or acquired pulmonary disease at birth 
(e.g. cystic fibrosis), a history of serious cardiac disease (myocardial infarction) 
or any history of serious pulmonary disease (e.g. emphysema); or 4) any family 
member to be included in the study was pregnant. 

10 An extensive clinical instrument was designed and data from all 

participating family members were collected. The case report form (CRF) 
included questions on demographics, medical history including medications, 
a health survey on the incidence and frequency of asthma, wheeze, eczema, 
hay fever, nasal problems, smoking, and questions on home environment. 

1 5 Data from a video questionnaire designed to show various examples of wheeze 
and asthmatic attacks were also included in the CRF. Clinical data, including 
skin prick tests to 8 common allergens, total and specific IgE levels, and 
bronchial hyper-responsiveness following a methacholine challenge, were also 
collected from all participating family members. All data were entered into a 

20 SAS dataset (Statistical Analysis Software, Cary, NC) by IMTCI (International 
Medical Technical Consultants, Inc.) a Clinical Research Organization; either 
by double data entry or scanning followed by on-screen visual validation. An 
extensive automated review of the data was performed on a routine basis and 
a full audit at the conclusion of the data entry was completed to verify the 

25 accuracy of the dataset. 

EXAMPLE 2: Genome Scan 

In order to identify chromosomal regions linked to asthma, the 
inheritance pattern of alleles from genetic markers spanning the genome was 
assessed using the collected family resources. As described above, combining 

30 these results with the segregation of the asthma phenotype in these families 
allowed the identification of genetic markers that were tightly linked to asthma. 
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In turn, this provided an indication of the location of genes predisposing 
affected individuals to asthma. The genotyping strategy was twofold: 1) to 
conduct a genome wide scan using markers spaced at approximately 10 cM 
intervals; and 2) to target ten chromosomal regions for high density genetic 
5 mapping. The initial candidate regions for high-density mapping were chosen 
based on suggestions of linkage to these regions by other investigators. 

Genotypes of PCR amplified simple sequence microsatellite genetic 
linkage markers were determined using ABI model 377 Automated Sequencers 
(Applied Biosystems, Inc.; Foster City, CA). Microsatellite markers were 

10 obtained from Research Genetics Inc. (Huntsville, AL) in the fluorescent dye- 
conjugated form (see Dubovsky et al. 5 1995, Hum. Mol. Genet 4(3):449-452). 
The markers comprised a variation of a human linkage mapping panel as 
released from the Cooperative Human Linkage Center (CHLC), also known as 
the Weber lab screening set version 8. The variation of the Weber 8 screening 

1 5 set consisted of 535 markers with an average spacing of 6.8 cM (autosomes 
only) and 6.9 cM (all chromosomes). Eighty-nine percent of the markers 
consisted of either tri- ortetra-nucleotide microsatellites. There were no gaps 
present in chromosomal coverage greater than 17.5 cM. 

Study subject genomic DNA (5 4.5 ng/pJ) was amplified in a 10 pi 

20 PCR reaction using AmpliTaq Gold DNA polymerase (0.225 U); 1 X PCR buffer 
(80 mM (NH 4 ) 2 S0 4 ; 30 mM Tris-HC! (pH 8.8); 0.5% Tween-20); 200 pM each 
dATP, dCTP, dGTP and dTTP; 1.5-3.5 pM MgCI 2 ; and 250 pM forward and 
reverse PCR primers. PCR reactions were set up in 192 well plates (Corning 
Costar, Acton, MA) using a Tecan Genesis 150 robotic workstation equipped 

25 with a refrigerated deck (Tecan Genesis, Durham, NC). PCR reactions were 
overlaid with 20 pi mineral oil, and thermocycled on an MJ Research Tetrad 
DNA Engine (MJ Research, Waltham, MA) equipped with four 192 well heads 
using the following conditions: 92°C for 3 min; 6 cycles of 92°C for 30 sec, 
56°C for 1 min, 72°C for 45 sec; followed by 20 cycles of 92°C for 30 sec, 55°C 

30 for 1 min, 72°C for 45 sec; and a 6 min incubation at 72°C. 

PCR products of 8-12 microsatellite markers were subsequently pooled 
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into two 96-well microtitre plates. This included 2.0 pi PCR product from TET 
and FAM labeled markers, 3.0 pi HEX labeled markers) using a Tecan Genesis 
200 robotic workstation and brought to a final volume of 25 pi with H 2 0. 
Following this, 1 .9 jjJ of pooled PCR product was transferred to a loading plate 
5 and combined with 3.0 ]x\ loading buffer. Loading buffer included 2.5 pi 
formamide/blue dextran (9.0 mg/ml) and 0.5 \x\ GS-500 TAMRA labeled size 
standard (ABI, Foster City, CA). Samples were denatured in the loading plate 
for 4 min at 95°C, placed on ice for 2 min, and elect rophoresed on a 5% 
denaturing polyacrylamide gel (BioWhittaker Molecular Applications, Rockland, 

10 ME) on the ABI 377XL). Samples (0.8 jjJ) were loaded onto the gel using an 
8 channel Hamilton Syringe pipettor. 

Each gel consisted of 62 study subjects and 2 control subjects (CEPH; 
Centre d'Etude du Polymorphisme Humain) parents ID #1331-01 and 1331-02, 
Corlell Cell Repository, Camden, NJ). Genotyping gels were scored in 

15 duplicate by investigators blind to patient identity and affection status using 
GENOTYPER analysis software V 1.1.12 (ABI; PE Applied Biosystems). 
Nuclear families were loaded onto the gel with the parents flanking the siblings 
to facilitate error detection. The final tables obtained from the GENOTYPER 
output for each gel analyzed were imported into a SYBASE Database (Dublin, 

20 CA). 

Allele calling (binning) was performed using the SYBASE version of the 
ABAS software (Ghosh et al., 1997, Genome Research 7:165-178). Offsize 
bins were checked manually and incorrect calls were corrected or blanked. 
The binned alleles were then imported into the program MENDEL (Lange et al., 

25 1988, Genetic Epidemiology, 5:471) for inheritance checking using the 
USERM 13 subroutine (Boehnke et al., 1991, Am. J. Hum. Genet. 48:22-25). 
Non-inheritance was investigated by examining the genotyping traces and, 
once all discrepancies were resolved, the subroutine USERM13 (Boehnke et 
al., 1991, Am. J. Hum. Genet. 48:22-25) was used to estimate allele 

30 frequencies. 
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EXAMPLE 3: Linkage Analysis 

Chromosomal regions harboring asthma susceptibility genes by linkage 
analysis of genotyping data and three separate phenotypes (asthma, bronchial 
hyper-responsiveness, and atopic status) were identified as follows. 
5 1. Asthma Phenotype : For the initial linkage analysis, the 

phenotype and asthma affection status were defined by a patient who 
answered the following questions in the affirmative: i) have you ever had 
asthma; ii) do you have a current physician's diagnosis of asthma; and iii) are 
you currently taking asthma medications? Medications included inhaled or oral 

10 bronchodilators, cromolyn, theophylline, or steroids. Multipoint linkage 
analyses of allele sharing in affected individuals were performed using the 
MAPMAKER/SIBS analysis program (L. Kruglyak and E.S. Lander, 1995, Am. 
J. Hum. Genet 57:439-454). The map location and distances between 
markers were obtained from the genetic maps published online by the 

15 Marshfield Medical Research Foundation, Marshfield, Wl {hypertext transfer 
protocol on the world wide web at marshmed.org/genetics). Ambiguous 
ordering of markers in the Marshfield map was resolved using the program 
MULTIMAP (T.C. Matise et al., 1994, Nature Genet. 6:384-390). 

Families with fewer than two genotyped asthmatic offspring were 

20 eliminated. Such families were due, for example, to non-paternity, sample mix- 
up, or DNA contamination. In the end, 460 pedigrees, containing 462 nuclear 
families each with at least one affected sib pair, were retained for analysis. 
Using the discrete phenotype of asthma (yes/no), a candidate region was 
identified on chromosome 20 with a LOD score of 2.94, based on the full set 

25 of 462 nuclear families. Figure 1 displays the multipoint LOD score against the 
map location of the markers along chromosome 20. A Maximum LOD Score 
(MLS) of 2.94 was obtained at location 7.9 cM, 0.3 cM proximal to marker 
D20S906. A second MLS of 2.94 was obtained at marker D20S482 at location 
12.1 cM. An excess sharing by descent (Identity By Descent (IBD) = 2) of 0.31 

30 was observed at both maximum LOD scores. Table 2 lists the single and 
multipoint LOD scores at each marker. Analyses were done using a 
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conservative approach by weighting down multiple sibling pairs within a sibship. 
When affected sib pairs were utilized in the linkage analyses without 
weighting, the LOD score on chromosome 20 maximized at D20S482 with a 
value of 3.19. These data provided strong evidence for the presence of an 
5 asthma susceptibility gene in this region of chromosome 20. 

TABLE 2 



Marker 


Distance 


Single- 
point 


Multipoint 


D20S502 


0.5 


0.7 


2.4 


D20S103 


2.1 


2.4 


2.3 


D20S117 


2.8 


1.2 


2.0 


GTC4ATG 


6.3 


2.4 


2.5 


GTC3CA 


6.6 


1.3 


2.7 


D20S906 


7.6 


2.9 


2.9 


D20S842 


9.0 


1.3 


2.5 


D20S181 


9.5 


1.8 


2.6 


D20S193 


9.5 


2.5 


2.5 


D20S889 


11.2 


1.6 


2.6 


D20S482 


12.1 


1.9 


2.9 


D20S849 


14.0 


0.8 


2.0 


D20S835 


15.1 


0.5 


1.8 


D20S448 


18.8 


1.4 


1.4 


D20S602 


21.2 


1.1 


1.1 


D20S851 


24.7 


1.0 


0.8 


D20S604 


32.9 


0.0 


0.1 


D20S470 


39.3 


0.0 


0.1 


D20S477 


47.5 


0.0 


0.0 


D20S478 


54.1 


0.0 


0.0 


D20S481 


62.3 


0.0 


0.0 


D20S480 


79.9 


0.0 


0.0 


D20S171 


95.7 


0.4 


0.1 



2. Phenotypic Subgroups : Nuclear families were ascertained by the 
presence of at least two affected siblings with a current physician's diagnosis 

10 of asthma, as well as the use of asthma medication. In the initial analysis (see 
above), the evidence was examined for linkage based on the dichotomous 
phenotype (asthma - yes/no). To further characterize the linkage signals, 
additional quantitative traits were measured in the clinical protocol. Since 
quantitative trait loci (QTL) analysis tools with correction for ascertainment 

1 5 were not available, the following approach was taken to refine the linkage and 
association analyses: 
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i. Phenotypic subgroups that could be indicative of an 
underlying genotypic heterogeneity were identified. Asthma subgroups were 
defined according to 1) bronchial hyper-responsiveness (BHR)to methacholrne 
challenge; or 2) to atopic status using quantitative measures like total serum 

5 IgE and specific IgE to common allergens. 

ii. Non-parametric linkage analyses were performed on 
subgroups to test for the presence of a more homogeneous sub-sample. If 
genetic heterogeneity was present in the sample, the amount of allele sharing 
among phenotypically similar siblings was expected to increase in the 

10 appropriate subgroup in comparison to the full sample. A narrower region of 
significant increased allele sharing was also expected to result unless the 
overall LOD score decreased as a consequence of having a smaller sample 
size and of using an approximate partitioning of the data. 

iii. Alternatively, allele sharing probabilities were 
15 parameterized as a function of the quantitative trait value of each child in a 

given sib pair, as advocated by N. Morton and implemented in his program 
BETA (N. Morton, 1996, Proc. Natl. Acad. ScL USA 93:3471-3476). This 
approach alleviated the need to dichotomize a quantitative trait. However, the 
program did not correct for the use of non-independent sib pairs in sibship of 

20 size 3 or larger. As such, it did not provide an accurate measure of the 
significance of a linkage finding, but was used to corroborate the localization 
of the linkage signal. 

3. Results for BHR and IgE : PC 20 , the concentration of 
methacholine resulting in a 20% drop in FEV-I (forced expiratory volume), was 

25 polychotomized in four groups. Analyses were performed on the subsets of 
asthmatic children with mild to severe BHR (PC 2 o ^ 4 mg/ml) or PC 2 o(4), as well 
as on the broader subset with borderline to severe BHR (PC 20 < 16 mg/ml) or 
PC 20 (16). As shown in the LOD plot in Figure 2, the MLS for the subset of 127 
nuclear families with at least two PC 20 (4) affected sibs was 2.97 at 1 1.8 cM. 

30 This was 0.3 cM from D20S482, with an excess sharing by descent of 0.37. 
As shown in Figure 3, for the 218 nuclear families with at least two PC 20 (16), 
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the MLS was 3.93 at D20S482 with an excess sharing of 0.36. Both PC 20 (4) 
and PC 20 (16) strongly implicated the region of chromosome 20 under the 
second peak around marker D20S482. When considering the more extreme 
phenotype, PC 2 o(4), a higher proportion of families was linked to the region. 
5 However, the increase in LOD score for the PC 20 (16) phenotype indicated that 
families concordant for the milder BHR phenotype also contributed to the 
linkage signal and would provide a larger pool of linked families. 

Total IgE was dichotomized using an age specific cutoff for elevated 
levels (one standard deviation above the mean). Similarly, a dichotomous 

10 variable was created using specific IgE to common allergens. An individual 
was assigned a high specific IgE value if his/her level was positive (grass or 
tree) or elevated (> 0.35 KU/L for cat, dog, mite A, mite B t alternaria, or 
ragweed) for at least one such measure. In linkage analyses, the subset of 
asthmatic children with high total IgE (274 families) was given a maximum LOD 

15 score of 2.3 at 11.6 cM (Figure 4). The subset with high specific IgE (288 
families) was given a LOD score of 1.87 at 12.1 cM (Figure 5). Similar to the 
BHR results, analyses based on IgE implicated the region under the second 
peak around marker D20S482 The substantially lower LOD scores using the 
subset of affected sibs concordant for atopy indicated the presence of groups 

20 with fewer linked families. Thus, atopy in asthmatic individuals was not the 
primary phenotype associated with the linkage signal on chromosome 20. 

The BETA program (Morton, 1996) was used on two scales for PC 20 . 
Individuals that did not drop 20% by the last dose administered (16 mg/ml) 
were assigned an arbitrary value of 32 mg/ml. First, a (0,1 )-severity scale was 

25 constructed by applying a linear transformation to PC 20 where 0 mg/ml 
received a score of 1 and 32 mg/ml received a score of 0. For this scale, 
individuals that did not drop 20% in their FEVi did not contribute to the LOD 
score. A maximum LOD score of 3.43 was achieved at 12.1 cM with marker 
D20S482. Second, a linear transformation of PC 20 was used where 0 mg/ml 

30 received a score of 1 and 32 mg/ml a score of -1 . In other words, in addition 
to the high concordant pairs, discordant pairs and concordant pairs that did not 
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drop would also contribute to the LOD score. In contrast, individuals with PC 2 o 
close to 16 mg/ml would have little impact on the LOD score. A maximum LOD 
score of 2.08 was again achieved at 12.1 cM. 

Accordingly, a consistent pattern of evidence by linkage analysis pointed 
5 to the existence of an asthma susceptibility locus in the vicinity of marker 
D20S482. This was supported by the initial analysis of the asthma (yes/no) 
phenotype and by analyses of BHR in asthmatic individuals. Localization in the 
region of marker D20S482 was obtained using both BHR and IgE phenotypes. 
EXAMPLE 4: Physical Mapping 

10 The linkage results for chromosome 20 described above were used to 

delineate a candidate region for a disorder-associated gene located on 
chromosome 20. Gene discovery efforts were initiated in a 25 cM interval from 
the 20p telomere (marker D20S502) to marker D20S851 . This represented a 
>98% confidence interval. All genes known to map to this interval were 

15 considered as candidates. Intensive physical mapping (BAG contig 
construction) focused on a 90% confidence interval between markers D20S103 
and D20S916, a 15 cM interval. The discovery of novel genes using direct 
cDNA selection focused on a 95% confidence interval between markers 
D20S502 (20p telomere) and D20S916, a 17 cM region. 

20 The following section describes the generation of cloned coverage of the 

disorder gene region on chromosome 20, i.e., the construction of a BAG contig 
spanning the region. There were two primary reasons for using this approach: 
1) to provide genomic clones for DNA sequencing (analysis of this sequence 
would provide information about the gene content of the region); and 2) to 

25 provide reagents for direct cDNA selection (this would provide additional 
information about novel genes mapping to the interval). The physical map 
consisted of an ordered set of molecular landmarks, and a set of bacterial 
artificial chromosome clones (BACs; U.-J. Kim et aL, 1996, Genomics 34:213- 
218; H. Shizuya et al., 1992, Proc. Natl. Acad. ScL USA 89:8794-8797) that 

30 contained the disorder gene region from human chromosome 20p13-p12. 

Figure 6 depicts the BAC/STS (sequence tagged site) content contig 
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map of human chromosome 20p13-p12. Markers used to screen the RPCI-1 1 
BAC library (P. deJong, Roswell Park Cancer Institute (RPCI)) are shown in the 
top row. Markers that were present in the Genome Database website (GDB; 
hypertext transfer protocol on the world wide web at gdb.org; GDB, Toronto, 
5 Canada) are represented by GDB nomenclature. The BAC clones are shown 
below the markers as horizontal lines. BAC RPCl-1 1_1098L22 is labeled, and 
the location of Gene 216, described herein, is indicated at the top of the figure. 

1 . Map Integration . Various publicly available mapping resources 
were utilized to identify existing STS markers (Olson et al., 1989, Science, 

10 245:1434-1435) in the 20p13-p12 region. Online resources included the GDB 
website, the Genethon website (hypertext transfer protocol on the world wide 
web at the site genethon.fr/genethon_en.html), the Marshfield Center for 
Medical Genetics website (hypertext transfer protocol on the world wide web 
at marshmed.org/genetics), the Whitehead Institute Genome Center website 

15 (hypertext transfer protocol on the world wide web at genome.win.mit.edu; 
Whitehead Institute, Cambridge, MA), GeneMap98, dbSTS and dbEST (NCBI), 
the Sanger Center website (hypertext transfer protocol on the world wide web 
at sanger.ac.uk; Sanger Center, Hinxton, England), and the Stanford Human 
Genome Center website (hypertext transfer protocol on the world wide web at 

20 shgc.stanford.edu; Stanford HGC, Stanford, CA). Maps were integrated 
manually to identify markers mapping to the disorder region. A list of the 
markers is provided in Table 3. 

2. Marker Development : Sequences for existing STSs were 
obtained from the GDB website, RHDB website (Radiation Hybrid Database, 

25 hypertext transfer protocol on the world wide web at ebi.ac.uk/RHdb; RHDB, 
Hinxton, England), or NCBI, and were used to pick primer pairs (overgos; see 
Table 3) for BAC library screening. Novel markers were developed either from 
publicly available genomic sequences, proprietary cDNA sequences, or from 
sequences derived from BAC insert ends (described below). Primers were 

30 chosen using a script that automatically performed vector and repetitive 
sequence masking using CROSSMATCH (P. Green, University of 
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Washington). Subsequent primer selection was performed using a customized 
online Filemaker Pro database (hypertext transfer protocol on the world wide 
web at filemaker.com; Filemaker Pro, Santa Clara, CA). Primers for use in 
PCR-based clone confirmation or radiation hybrid mapping (described below) 
5 were chosen using the program Primer3 (S. Rozen, H. Skaletsky, 2000, Mol. 
Biol. 132:365-86; hypertext transfer protocol on the world wide web at 
genome.wi.mit.edu/genome_software/other/primer3.html). 



-118- 



WO 02/083077 



PCT/US02/12063 



3 



3 



CO 



8> 



CO 



WO 02/083077 



PCT/US02/12063 



3. Radiation Hybrid (RH) Mapping : Radiation hybrid mapping was 
performed against the Genebridge4 panel (Gyapay et al. t 1996, Hum. Mol. 
Genet. 5:339-46) purchased from Research Genetics, in order to refine the 

5 chromosomal localization of genetic markers used in genotyping. Mapping was 
also performed to identify, confirm, and refine localizations of markers from 
proprietary sequences. Standard PCR procedures were used for typing the RH 
panel with markers of interest. Briefly, 10 ul PCR reactions contained 25 ng 
DNA of each of the 93 Genebridge4 RH samples. PCR products were 

10 electrophoresed on 2% agarose gels (Sigma, St. Louis, MO) containing 0.5 
ug/ml ethidium bromide in 1 XTBE at 150 volts for 45 min. 

For electrophoresis, Model A3-1 systems were used (Owl Scientific 
Products, Portsmouth, NH). Typically, gels contained 10 tiers of lanes with 50 
wells/tier. Molecular weight markers (100 bp ladder, GibcoBRL, Rockville, MD) 

15 were loaded at both ends of the gel. Images of the gels were captured with a 
Kodak DC40 CCD camera and processed with Kodak 1D software (Kodak, 
Rochester, NY). The gel data were exported as tab delimited text files; names 
of the files included information about the panel screened, the gel image files 
and the marker screened. These data were automatically imported using a 

20 customized Perl script into Filemaker databases for data storage and analysis. 
The data were then automatically formatted and submitted to an internal 
server for linkage analysis to create a radiation hybrid map using RHMAPPER 
(L. Stein et aL, 1995; available from Whitehead Institute/MIT Center for 
Genome Research, at hypertext transfer protocol on the world wide web at 

25 genome.wi.mit.edu /ftp/pub/software/rhmapper; Whitehead Institute, 
Cambridge, MA; and via anonymous ftp to ftp.genome.wi.mit.edu, in the 
directory/pub/software/rhmapper). 

4. BAC Library Screening : The protocol used for BAC library 
screening was based on the "overgo" method, originally developed by John 

30 McPherson at Washington University in St. Louis (W-W. Cai et al., 1998, 
Genomics 54:387-397). This method involved filling in the overhangs 
generated after annealing two primers. Each primer was 22 nucleotides in 
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length, and overlapped by 8 nucleotides. The resulting labeled 36 bp product 
was used in hybridization-based screening of high density grids derived from 
the RPCI-1 1 BAC library (deJong, supra). Typically, 15 probes were pooled 
together to hybridize 12 filters (13.5 genome equivalents). 
5 Stock solutions (2 uM) of combined complementary oligos (Table 3) 

were heated at 80°C for 5 min, placed at 37°C for 10 min, and then stored on 
ice. Labeling reactions included the following: 1.0 ul H 2 0; 5 ul mixed oligos 
(2 uM each); 0.5 ul BSA (2 mg/ml); 2 ul OLB (Overgo Labeling Buffer) Solution 
(see below); 0.5 ul 32 P-dATP (3000 Ci/mmol); 0.5 ul 32 P-dCTP (3000 Ci/mmol); 

10 and 0.5 ul Klenow fragment (5 U/ul). The reaction was incubated at room 
temperature for 1 hr, and unincorporated nucleotides were removed using 
Sephadex G50 spin columns (Pharmacia, Piscataway, NJ). Solution O: 1.25 
M Tris-HCL, pH 8, 125 M MgCI 2 ; Solution A: 1 ml Solution O, 18 ul 2- 
mercaptoethanol, 5ul 0.1 M dTTP, 5ul 0.1 M dGTP; Solution B: 2 M HEPES- 

15 NaOH, pH 6.6; Solution C: 3 mM Tris-HCI, pH 7.4, 0.2 mM EDTA; Solution 
OLB: Solutions A, B, and C were combined to a final ratio of 1:2.5:1.5, and 
aliquots were stored at -20°C. 

High-density BAC library membranes were pre-wetted in 2 X SSC at 
58°C. Filters were then drained slightly and placed in hybridization solution 

20 (1% BSA; 1 mM EDTA, pH 8.0; 7% SDS; and 0.5 M sodium phosphate), pre- 
warmed to 58°C, and incubated at 58°C for 2-4 hr. Typically, 6 filters were 
hybridized in each container. Ten milliliters of pre-hybridization solution was 
removed, combined with the denatured overgo probes, and added back to the 
filters. Hybridization was performed overnight at 58°C. The hybridization 

25 solution was removed and filters were washed once in 2 X SSC, 0.1% SDS, 
followed by a 30 min wash in the same solution at 58°C. Filters were then 
washed in: 1 ) 1 .5 X SSC and 0. 1 % SDS at 58°C for 30 min; 2) 0.5 X SSC and 
0.1% SDS at 58°C for 30 min; and finally in 3) 0.1 X SSC and 0.1% SDS at 
58°C for 30 min. Filters were then wrapped in Saran Wrap® and exposed to 

30 film overnight. To remove bound probe, filters were treated in 0.1 X SSC and 
0.1% SDS pre-warmed to 95°C and cooled room temperature. Clone 
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addresses were determined as described by instructions supplied by RPCL 

To recover clonal BAC cultures from the library, a sample from the 
appropriate library well was plated by streaking onto LB agar (T. Maniatis et a!., 
1982, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 
5 Cold Spring Harbor, NY) containing 12.5 ug/ml chloramphenicol (Sigma). 
Plates were incubated overnight at 37°C. A single colony and a portion of the 
initial streak quadrant were inoculated into 400 ul LB plus chloramphenicol in 
wells of a 96 well plate. Cultures were grown overnight at 37°C. For storage, 
100 ul of 80% glycerol was added and the plates placed at -80°C. 

10 To determine the marker content of clones, aliquots of the 96 well plate 

cultures were transferred to the surface of nylon filters (GeneScreen Plus, 
NEN) placed on LB/chloramphenicol Petri plates. Colonies were grown 
overnight at 37°C and colony lysis was performed by placing filters on pools of: 
1 ) 10% SDS for 3 min; 2) 0.5 N NaOH and 1 .5 M NaCI for 5 min; and 3) 0.5 

15 M Tris-HCl, pH 7.5, and 1 M NaCI for 5 min. Filters were then air-dried and 
washed free of debris in 2 X SSC for 1 hr. The filters were air-dried for at least 
1 hr and DNA was crosslinked linked to the membrane using standard 
conditions. Probe hybridization and filter washing were performed as 
described above for the primary library screening. Confirmed clones were 

20 stored in LB containing 15% glycerol. 

In certain cases, polymerase chain reaction (PCR) was used to confirm 
the marker content of clones. PCR conditions for each primer pair were initially 
optimized with respect to MgCI 2 concentration. The standard buffer was 10 
mM Tris-HCI (pH 8.3), 50 mM KCl, MgCI 2 , 0.2 mM each dNTP, 0.2 uM each 

25 primer, 2.7 ng/ul human DNA, 0.25 units of AmpliTaq (Perkin Elmer) and MgC! 2 
concentrations of 1.0 mM, 1.5 mM, 2.0 mM or 2.4 mM. Cycling conditions 
included an initial denaturation at 94°C for 2 min; followed by 40 cycles at 94°C 
for 1 5 sec, 55°C for 25 sec, and 72°C for 25 sec; followed by a final extension 
at 72°C for 3 min. Depending on the results from the initial round of 

30 optimization the conditions were further optimized. Variables included 
increasing the annealing temperature to 58°C or 60°C, increasing the cycle 
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number to 42 and the annealing and extension times to 30 sec, and using 
AmpliTaqGold (Perkin Elmer). 

5. BAC DNA Preparation : Several different types of DNA 
preparation methods were used for isolation of BAC DNA. The manual 
5 alkaline lysis miniprep protocol listed below (Maniatis et aL, 1982) was 
successfully used for most applications, i.e., restriction mapping, CHEF gel 
analysis and FISH mapping, but was not reproducibly successful in 
endsequencing. The Autogen protocol was used specifically for BAC DNA 
preparation for endsequencing. 

10 For manual alkaline lysis BAC minipreps, bacteria were grown in 15 ml 

terrific broth (TB) containing 12.5 ug/ml chloramphenicol. Cultures were placed 
in a 50 ml conical tube at 37°C for 20 hr with shaking at 300 rpm. The cultures 
were centrifuged in a Sorvall RT 6000 D (Sorvall, Newton, CT) at 3000 rpm 
(1800 x g) at 4°C for 15 min. The supernatant was then aspirated as 

1 5 completely as possible. In some cases cell pellets were frozen at -20°C at this 
step for up to 2 weeks. The pellet was then vortexed to homogenize the cells 
and minimize dumping. Following this, 250 ul of P1 solution (50 mM glucose, 
1 5 mM Tris-HCI, pH 8, 10 mM EDTA, and 100 ug/ml RNase A) was added, and 
the mixture was pipetted up and down to mix. The mixture was then 

20 transferred to a 2 ml Eppendorf tube. Subsequently, 350 ul of P2 solution (0.2 
N NaOH, 1% SDS) was added, mixed gently, and the mixture was incubated 
for 5 min at room temperature. Then, 350 ul of P3 solution (3 M KOAc, pH 5.5) 
was added and mixed gently until a white precipitate formed. The solution was 
incubated on ice for 5 min and then centrifuged at 4°C in a microfuge for 10 

25 min. 

The supernatant was transferred carefully (avoiding the white 
precipitate) to a fresh 2 ml Eppendorf tube. Then, 0.9 ml of isopropanol was 
added, and the solution was mixed and left on ice for 5 min. The samples 
were centrifuged for 10 min, and the supernatant removed carefully. Pellets 
30 were washed in 70% ethanol and air-dried for 5 min. Pellets were 
resuspended in 200 pi of TE8 (10 mM Tris-HCI, pH 8.0, 1.0 mM EDTA, pH 
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8.0), and RNase (Boehringer Mannheim, Indianapolis, IN; hypertext transfer 
protocol at biochem.boehringer-mannheim.com) added to 100 ug/ml. Samples 
were incubated at 37°C for 30 min. DNA was precipitated by addition of 
NH 4 OAc to 0.5 M and 2 volumes of ethanol. Samples were centrifuged for 10 
5 min, and the pellets were washed with 70% ethanol. The pellets were air-dried 
and dissolved in 50 ul TE8. Typical yields for this DNA prep were 3-5 ug per 
15 ml bacterial culture. Ten to fifteen microliters of DNA was used for EcoRl 
restriction analysis. Five microliters was used for Not\ digestion and clone 
insert sizing by CHEF gel electrophoresis. 

10 Autogen 740 BAC DNA preparations for endsequencing were made by 

dispensing 3 ml of LB media containing 12.5 ug/ml of chloramphenicol into 
autoclaved Autogen tubes. A single tube was used for each clone. For 
inoculation, glycerol stocks were removed from -70°C storage and placed on 
dry ice. A small portion of the glycerol stock was removed from the original 

15 tube with a sterile toothpick and transferred into the Autogen tube. The 
toothpick was left in the Autogen tube for at least two min before discarding. 
After inoculation the tubes were covered with tape to ensure that the seal was 
tight. When all samples were inoculated, the tubes were transferred into an 
Autogen rack holder and placed into a rotary shaker. Cultures were incubated 

20 at 37°C for 16-17 hr at 250 rpm. Following this, standard conditions for BAC 
DNA preparation, as defined by the manufacturer, were used to program the 
Autogen. However, samples were not dissolved in TE8 as part of the program. 
Instead, DNA pellets were left dry. 

When the program was completed, the tubes were removed from the 

25 output tray and 30 pi of sterile distilled and deionized H 2 0 was added directly 
to the bottom of the tube. The tubes were then gently shaken for 2-5 sec and 
then covered with parafilm and incubated at room temperature for 1-3 hr. DNA 
samples were then transferred to an Eppendorf tube and used either directly 
for sequencing or stored at 4°C for later use. 

30 6. BAC Clone Characterization : DNA samples prepared either by 

manual alkaline lysis or the Autogen protocol were digested with EcoRl for 
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analysis of restriction fragment sizes. These data were used to compare the 
extent of overlap among clones. Typically 1-2 ug were used for each reaction. 
Reaction mixtures included: 1 X Buffer 2 (NEB, Beverly, MA); 0.1 mg/ml BSA 
(NEB); 50 ug/ml RNase A (Boehringer Mannheim); and 20 units of EcoRI 
5 (NEB) in a final volume of 25 ul. Digestions were incubated at 37°C for 4-6 hr. 
BAC DNA was also digested with Not\ for estimation of insert size by CHEF 
gel analysis (see below). Reaction conditions were identical to those for 
EcoRI, except that 20 units of Not\ were used. Six microliters of 6 X Ficoll 
loading buffer containing bromophenol blue and xylene cyanol was added prior 

10 to electrophoresis. 

EcoRI digests were analyzed on 0.6% agarose (Seakem, FMC 
Byproducts, Rockland, ME) in 1XTBE containing 0.5 ug/ml ethidium bromide. 
Gels (20 cm x 25 cm) were electrophoresed in a Model A4 electrophoresis unit 
(Owl Scientific) at 50 volts for 20-24 hr. Molecular weight size markers 

15 included undigested lambda DNA, Hind\\\ digested lambda DNA, and Haelll 
digested .X1 74 DNA. Molecular weight markers were heated at 65°C for 2 min 
prior to loading the gel. Images were captured with a Kodak DC40 CCD 
camera and analyzed with Kodak 1 D software. 

Not\ digests were analyzed on a CHEF DRII (Bio-Rad, Hercules, CA) 

20 electrophoresis unit according to the manufacturer's recommendations. Briefly, 
1% agarose gels (Bio-Rad pulsed field grade) were prepared in 0.5 X TBE. 
Gels were equilibrated for 30 min in the electrophoresis unit at 14°C, and 
electrophoresed at 6 volts/cm for 14 hr with circulation. Switching times were 
ramped from 10 sec to 20 sec. Gels were stained after electrophoresis in 0.5 

25 ug/ml ethidium bromide. Molecular weight markers included undigested 
lambda DNA, Hind\\\ digested lambda DNA, lambda ladder PFG ladder, and 
low range PFG marker (all from NEB). 

7. BAC Endseauencinq : The sequence of BAC insert ends utilized 
DNA prepared by either of the two methods described above. The ends of 

30 BAC clones were sequenced for the purpose of filling gaps in the physical map 
and for gene discovery information. The following vector primers specific to the 
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BAC vector pBACe3.6 were used to generate endsequence from BAC clones: 
pBAC 5'-2 (TGT AGG ACT ATA TTG CTC; SEQ ID NO:56) and pBAC 3'-1 
(CGA CAT TTA GGT GAC ACT; SEQ ID NO:57). 

The ABI dye-terminator sequencing protocol was used to set up 
5 sequencing reactions for 96 clones. The BigDye (ABI; PE Applied Biosystems) 
Terminator Ready Reaction Mix with AmpliTaq" FS, Part number 4303151 , was 
used for sequencing with fluorescently labeled dideoxy nucleotides. A master 
sequencing mix was prepared for each primer reaction set including: 1600 ul 
of BigDye terminator mix (ABI; PE Applied Biosystems); 800 ul of 5 X CSA 

1 0 buffer (ABI; PE Applied Biosystems); and 800 ul of primer (either pBAC 5-2 or 
pBAC 3-1 at 3.2 uM). The sequencing cocktail was vortexed to ensure it was 
well-mixed and 32 ul was aliquotted into each PCR tube. Eight microliters of 
the Autogen DNA for each clone was transferred from the DNA source plate 
to a corresponding well of the PCR plate. The PCR plates were sealed tightly 

1 5 and centrifuged briefly to collect all the reagents. Cycling conditions were as 
follows: 1) 95°C for 5 min; 2) 95°C for 30 sec; 3) 50°C for 20 sec; 4) 65°C for 
4 min; 5) steps 2 through 4 were repeated 74 times; and 6) samples were 
stored at 4°C. 

At the end of the sequencing reaction, the plates were removed from the 
20 thermocycler and centrifuged briefly. CentrrSep 96-well plates (Princeton 
Separations Inc., Adelphia, NJ) were then used according to manufacturer's 
recommendations to remove unincorporated nucleotides, salts, and excess 
primers. Each sample was resuspended in 1 .5 ul of loading dye, and 1 .3 pi of 
the mixture was loaded on ABI 377 Fluorescent Sequencers. The resulting 
25 endsequences were then used to develop markers to rescreen the BAC library 
for filling gaps and were also analyzed by BLASTN2 searching for EST or gene 
content in GenBank. 

EXAMPLE 5: Subcloning and Sequencing of BAC RPC1-11 1098L22 

The physical map of the chromosome 20 region provided the location 
30 of the BAC RPCI-1 1_1 098L22 clone that contains Gene 21 6 (see Figure 6). 
The BAC RPCI-1 1_1098L22 clone was deposited as clone RP1 1-1098L22 
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with the American Type Culture Collection (ATCC), 10801 University Blvd., 
Manassas, VA 201 10-2209 USA, under ATCC Designation No. PTA-3171, on 
March 14, 2001 according to the terms of the Budapest Treaty. DNA 
sequencing of BAC RPCI-1 1-1 098L22 from the region was completed. BAC 
5 RPCI-1 1 -1 098L22 DNA, (the "BAC DNA") was isolated according to one of two 
protocols: either a QIAGEN purification (QIAGEN, Inc., Valencia, CA, per 
manufacturer's instructions) or a manual purification using a method which was 
a modification of the standard alkaline lysis/cesium chloride preparation of 
plasmid DNA (see e.g., F.M. Ausubel et al., 1997, Current Protocols in 

1 0 Molecular Biology, John Wiley & Sons, New York, NY). Briefly, for the manual 
protocol, cells were pelleted, resuspended in GTE (50 mM glucose, 25 mM 
Tris-CI (pH 8), 10 mM EDTA) and lysozyme (50 mg/ml solution), followed by 
addition of NaOH/SDS (1% SDS and 0.2 N NaOH) and then an ice-cold 
solution of 3 M KOAc (pH 4.5-4.8). RnaseA was added to the filtered 

1 5 supernatant, followed by treatment with Proteinase K and 20% SDS. The DNA 
was then precipitated with isopropanol, dried, and resuspended in TE (10 mM 
Tris, 1 mM EDTA - pH 8.0). The BAC DNA was further purified by cesium 
chloride density gradient centrifugation (Ausubel et al., 1997). 

Following isolation, the BAC DNA was hyd rodynamical I y sheared using 

20 HPLC (Hengen et al., 1997, Trends in Biochem. Sc/., 22:273-274) to an insert 
size of 2000-3000 bp. After shearing, the DNA was concentrated and 
separated on a standard 1% agarose gel. A single fraction, corresponding to 
the approximate size, was excised from the gel and purified by electroelution 
(Sambrook et al., 1989). 

25 The overhangs of the purified DNA fragments were filled-in using T4 

DNA polymerase. The blunt-ended DNA was ligated to unique BsfXI-linker 
adapters in 100-1000 fold molar excess. The sequence of the adapters was: 
5' GTCTTCACCACGGGG (SEQ ID NO:58) and 5* GTGGTGAAGAC (SEQ ID 
NO:59). The linkers were complimentary to the BsfXI-cut pMPX vectors, but 

30 the overhangs were not self-complimentary. Therefore, it was expected that 
the linkers would not concatemerize, and that the cut-vector would not re-ligate 
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on itself. The linker-adapted inserts were separated from unincorporated 
linkers on a 1% agarose gel and purified using GeneClean (BIO 101, Inc., 
Vista, CA). The linker-adapted insert was then ligated to a modified 
pBlueScript vector to construct a "shotgun" subclone library. The vector 
5 contained an out-of-frame lacZ gene at the cloning site, which became in- 
frame in the event that an adapter-dimer was cloned. Such adapter-dimer 
clones gave rise to blue colonies, which were avoided. 

All subsequent steps were based on sequencing by ABI377 automated 
DNA sequencing methods. Major modifications to the protocols are highlighted 

10 below. Briefly, the library was transformed into DH5a -competent cells 
(GibcoBRL, DH5a -transformation protocol). Transformants were plated onto 
LB plates containing ampicillin (50 ug/ml) and IPTG/X-gal. The plates were 
incubated overnight at 37°C. White colonies were identified and then used to 
plate individual clones for sequencing. The cultures were grown overnight at 

1 5 37°C. DNA was purified using a silica bead DNA preparation method (Ng et 
al., 1996, Nucl. Acids Res., 24:5045-5047). In this manner, 25 ug of DNA was 
obtained per clone. 

These purified DNA samples were sequenced using ABI dye-terminator 
chemistry. The ABI dye terminator sequence reads were run on ABI377 

20 machines and the data were directly transferred to UNIX machines following 
lane tracking of the gels. All reads were assembled using PHRAP (P. Green, 
Abstracts of DOE Human Genome Program Contractor-Grantee Workshop V, 
Jan. 1996, p. 157) with default parameters and quality scores. The assembly 
was done at 8-fold coverage and yielded 1 contig, BAC RPCI-11-1098L22. 

25 SEQ ID NO:5 (Figure 7) comprises a portion of the BAC that includes the 
genomic sequence of Gene 216. 
EXAMPLE 6: Gene Identification 

Any gene or EST mapping to the interval based on public map data or 
proprietary map data was considered a candidate respiratory disease gene. 

30 Public map data were derived from several online sources: the Genome 
Database website (GDB), the Whitehead Institute Genome Center website, 
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GeneMap98, UniGene, OMIM, dbSTS and dbEST (NCBI) the Sanger Center 
website, and the Stanford Human Genome Center website. Proprietary data 
was obtained from sequencing genomic DNA (cloned into BACs) or cDNAs 
(identified by direct selection, screening of cDNA libraries, or full length 
5 sequencing of the IMAGE Consortium cDNA clones available online (hypertext 
transfer protocol on the world wide web at bio. 1 1nl.gov/bbrp/image.html). 

1 . Gene Identification from clustered DNA fragments . DNA 
sequences corresponding to gene fragments in public databases (GenBank 
and human dbEST) and proprietary cDNA sequences (IMAGE consortium and 

10 direct selected cDNAs) were masked for repetitive sequences and clustered 
using the PANGEA Systems (Oakland, CA) EST clustering tool. The clustered 
sequences were then subjected to computational analysis to identify regions 
bearing similarity to known genes. This protocol included the following steps: 
a. The clustered sequences were compared to the publicly available 

15 UniGene database (NCBI) using the BLASTN2 algorithm (Altschul et al., 1997). 
The parameters for this search were: E = 0.05, v = 50, B = 50, where E was 
the expected probability score cutoff, V was the number of database entries 
returned in the reporting of the results, and B was the number of sequence 
alignments returned in the reporting of the results (Altschul et al., 1990). 

20 b. The clustered sequences were compared to the GenBank database 

(NCBI) using BLASTN2 (Altschul et al., 1 997). The parameters for this search 
were E=0.05, V=50, B= 50, where E, V, and B were defined as above. 

c. The clustered sequences were translated into protein sequences for 
all six reading frames, and the protein sequences were compared to a non- 
25 redundant protein database compiled from GenPept Swissprot PIR (NCBI). 

The parameters for this search were E = 0.05, V = 50, B = 50, where E, V, and 
B were defined as above. 

d. The clustered sequences were compared to BAC sequences (see 
below) using BLASTN2 (Altschul et al., 1997). The parameters for this search 

30 were E=0.05, V=50, B=50, where E, V, and B were defined as above. 

2. Gene Identification from BAC Genomic Sequence : Following 
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assembly of the BAC sequences into contigs, the contigs were subjected to 
computational analyses to identify coding regions and regions bearing DNA 
sequence similarity to known genes. This protocol included the following steps: 

a. Contigs were degapped. The sequence contigs often contained 
5 symbols (denoted by a period symbol) that represented locations where the 

individual ABi sequence reads had insertions or deletions. Prior to automated 
computational analysis of the contigs, the periods were removed. The original 
data were maintained for future reference. 

b. BAC vector sequences were "masked" within the sequence by using 
10 the program CROSSMATCH (P. Green, University of Washington). The 

shotgun library construction detailed above left some BAC vector in the 
shotgun libraries. Accordingly, the CROSSMATCH program was used to 
compare the sequence of the BAC contigs to the BAC vector and to mask any 
vector sequence prior to subsequent steps. Masked sequences were marked 
15 by "X" in the sequence files, and remained inert during subsequent analyses. 

c. E. coll sequences contaminating the BAC sequences were masked 
by comparing the BAC contigs to the entire E. coli DNA sequence. 

d. Repetitive elements known to be common in the human genome 
were masked using CROSSMATCH (P. Green, University of Washington). In 

20 this implementation of CROSSMATCH, the BAC sequence was compared to 
a database of human repetitive elements (J. Jerka, Genetic Information 
Research Institute, Palo Alto, CA). The masked repeats were marked by "X" 
and remained inert during subsequent analyses. 

e. The location of exons within the sequence was predicted using the 
25 MZEF computer program (Zhang, 1997, Proc. Natl. Acad. ScL, 94:565-568) 

and GenScan gene prediction program (Burge and Karlin, J. Mol. B/o/. f 268:78- 
94). 

f. The sequence was compared to the publicly available UniGene 
database (NCBI) using the BLASTN2 algorithm (Altschul et al., 1997). The 

30 parameters for this search were: E = 0.05, v = 50, B = 50, where E was the 
expected probability score cutoff, V was the number of database entries 
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returned in the reporting of the results, and B was the number of sequence 
alignments returned in the reporting of the results (Altschul etal., 1990). 

g. The sequence was translated into protein sequences for ad six 
reading frames, and the protein sequences were compared to a non-redundant 

5 protein database compiled from GenPept, Swissprot, and PIR (NCBI), The 
parameters for this search were E = 0.05, V = 50, B = 50, where E, V, and B 
were defined as above. 

h. The BAC DNA sequence was compared to a database of clustered 
sequences using the BLASTN2 algorithm (Altschul et a!., 1997). The 

10 parameters for this search were E = 0.05, V = 50, B = 50, where E, V, and B 
were defined as above. The database of clustered sequences was prepared 
utilizing a proprietary clustering technology (PANGEA Systems, Inc.). The 
database included cDNA clones derived from direct selection experiments 
(described below), human dbEST sequences mapping to the 20p13-p12 

15 region, proprietary cDNAs, GenBank genes, and IMAGE consortium cDNA 
clones. 

i. Using the BLASTN2 algorithm (Altschul et al. v 1997), the BAC 
sequence was compared to the sequences derived from the ends of BACs 
from the region on chromosomes 20. The parameters for this search were E 

20 = 0.05, V = 50, B = 50, where E, V, and B were defined as above. 

j. The BAC sequence was compared to the GenBank database (NCBI) 
using the BLASTN2 algorithm (Altschul et al., 1997). The parameters for this 
search were E = 0.05, V = 50, B = 50, where E, V, and B were defined as 
above. 

25 k. The BAC sequence was compared to the STS division of GenBank 

database (NCBI) using the BLASTN2 algorithm (Altschul et al., 1997). The 
parameters for this search were E=0.05, V=50, B= 50, where E, V, and B were 
defined as above. 

I. The BAC sequence was compared to the Expressed Sequence Tag 

30 (EST) GenBank database (NCBI) using the BLASTN2 algorithm (Altschul et al., 
1 997). The parameters for this search were E = 0.05, V = 50, B = 50, where 
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E, V, and B were defined as above. 
3. Mapping Analysis 

Through mapping analysis, BAC RPCI-1 1_1098L22 (ATCC Designation 
No. PTA-3171) was identified as containing Gene 216. This BAC sequence 
5 (SEQ ID NO:5, Figure 7) included the genomic sequence of Gene 216 (SEQ 
ID NO:6; Figure 29), which corresponded to the cDNA sequence of Gene 216 
(SEQ ID NO:1; Figure 24). 

EXAMPLE 7: Gene 216 cDNA Cloning and Expression Analysis 

1 . Construction and screening of cDNA libraries : Directionally 

10 cloned cDNA libraries from normal lung and bronchial epithelium were 
constructed using standard methods (Scares et al., 1994, Automated DNA 
Sequencing and Analysis, Adams et al. (eds), Academic Press, NY, pp. 110- 
114). Total and cytoplasmic RNAs were extracted from tissue or cells by 
homogenizing samples in the presence of guanidinium thiocyanate-phenol- 

15 chloroform extraction buffer (e.g. Chomczynski and Sacchi, 1987, Anal. 
Biochem., 162:156-159) using a polytron homogenizer (Brinkman Instruments, 
Westbury, NY). Poly(A)+ RNA was isolated from total/cytoplasm ic RNA using 
dynabeads-dT according to the manufacturer's recommendations (Dynal, Inc., 
Lake Success, NY). The double stranded cDNA was then ligated into the 

20 plasmid vector pBluescript II KS+ (Stratagene, La Jolla, CA), and the ligation 
mixture was transformed into E. coli host DH10B or DH12S by electroporation 
(Soares et al., 1994). Transformants were grown at 37°C overnight. DNA was 
recovered from the E. coli colonies after scraping the plates by processing as 
directed for the Mega-prep kit (QIAGEN). The quality of the cDNA libraries was 

25 estimated by 1) counting a portion of the total number of primary 
transformants; 2) determining the average insert size; and 3) calculating the 
percentage of plasmid s with no cDNA insert. Additional cDNA libraries (human 
total brain, heart, kidney, leukocyte, and fetal brain) were purchased from Life 
Technologies (Bethesda, MD). 

30 cDNA libraries were used for isolating cDNA clones mapped within the 

disorder critical region. The libraries were oligo (dT) and random hexamer- 
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primed. Four 10x10 arrays of each of the cDNA libraries were prepared as 
follows. The cDNA libraries were titered to 2.5 x 10 6 cfu using primary 
transformants. The appropriate volume of frozen stock was used to inoculate 
2 L of LB with ampicillin (100 ug/u! final concentration). Four hundred aliquots 
5 containing 4 ml of the inoculated liquid culture were generated. Each tube 
contained about 5000 cfu (colony forming units). The tubes were incubated at 
30°C overnight with shaking until an OD of 0.7-0.9 was obtained. Frozen 
stocks were prepared for each of the cultures by aliquotting 300 ul of culture 
and 100 ul of 80% glycerol. Stocks were frozen in a dry ice/ethanol bath and 

10 stored at -70°C. DNA was isolated from the remaining culture using the 
QIAGEN spin mini-prep kit according to the manufacturer's instructions. The 
DNA from the 400 cultures was pooled to make 40 column pools and 40 row 
pools. For this, 4 boxes were prepared; each box contained 10 rows and 10 
columns of samples to yield a total of 40 rows and 40 columns of samples. 

15 Markers were designed to amplify putative exons from candidate genes. 
Standard PCR conditions were identified, and specific cDNA libraries were 
determined to contain cDNA clones of interest. Then, the markers were used 
to screen the arrayed library. Positive addresses indicating the presence of 
cDNA clones were confirmed by a second PCR using the same markers. 

20 Once a cDNA library was identified as likely to contain cDNA clones 

corresponding to a transcript of interest from the disorder critical region, it was 
used to isolate one or more clones containing cDNA inserts. This was 
accomplished by a modification of the standard "colony screening" method 
(Sambrook et al., 1989). Specifically, twenty 150 mm LB plus ampicillin agar 

25 plates were spread with 20,000 cfu of cDNA library. Colonies were allowed to 
grow overnight at 37°C. Colonies were then transferred to nylon filters 
(Hybond from Amersham-Pharmacia, Piscataway, NJ, or equivalent). 
Duplicates were prepared by pressing two filters together essentially as 
described (Sambrook et al., 1989). The "master" plate was then incubated 

30 another 6-8 hr to allow for additional growth. The DNA from the bacterial 
colonies was then bound to the nylon filters by treating the filters sequentially 
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with denaturing solution (0.5 N NaOH, 1.5 M NaCl)for2 min, and neutralization 
solution (0.5 M Tris-CI pH 8.0, 1.5 M NaCI) for 2 min. This was performed 
twice. The bacterial colonies were removed from the filters by washing the 
filters in a solution of 2 X SSC/2% SDS for 1 min while rubbing with tissue 
5 paper. The filters were air-dried and baked under vacuum at 80°C for 1-2 hr 
to crosslink the DNA to the filters. 

cDNA hybridization probes were prepared by random hexamer labeling 
(Fineberg and Vogelstein, 1983, Anal. Biochem., 132:6-13). For small 
fragments, gene-specific primers were included in the reaction, and random 

10 hexamers were omitted. The colony membranes were then pre-washed in 10 
mM Tris-CI pH 8.0, 1 M NaCI, 1 mM EDTA, and 0.1% SDS for 30 min at 55°C. 
Following the pre-wash, the filters were pre-hybridized at 42°C for 30 min. 
Prehybridization solution (> 2 ml/filter) contained 6 X SSC, 50% deionized 
formamide, 2% SDS, 5 X Denhardt's solution, and 100 mg/ml denatured 

15 salmon sperm DNA. Filters were then transferred to hybridization solution 
containing denatured a- 32 P-dCTP-labeled cDNA probe, and hybridized 
overnight at 42°C. Hybridization solution included 6 X SSC, 2% SDS, 5 X 
Denhardt's, and 100 mg/ml denatured salmon sperm DNA. 

The following morning, the filters were washed in 2 X SSC and 2% SDS 

20 at room temperature for 20 min with constant agitation. Two more washes 
were performed at 65 D C for 15 min each. A fourth wash was performed in 0.5 
X SSC and 0.5% SDS for 1 5 min at 65°C. Filters were then wrapped in plastic 
wrap and exposed to radiographic film. Individual colonies from the plates 
were aligned with the autoradiograph. Positive clones were picked into a 1 ml 

25 solution of LB Broth containing ampicillin. After shaking at 37°C for 1-2 hr, 
aliquots of the solution were plated on 150 mm plates for secondary screening. 
Secondary screening was identical to primary screening (above) except that 
it was performed on plates containing -250 colonies. This allowed individual 
colonies to be clearly identified. Positive cDNA clones were characterized by 

30 restriction endonuclease cleavage, PCR, and direct sequencing to confirm the 
sequence identity between the original probe and the isolated clone. 
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To obtain the full-length cDNA, novel sequence from the 5-end of the 
clone was used to reprobe the library. The sequence of the probes were 
clone-dependent. Reprobing was repeated until the length of the cDNA cloned 
matched that of the mRNA, estimated by Northern analysis. Utilizing this 
5 process, a single uterus clone was isolated as clone Gene 216_CS759. This 
clone was deposited with the American Type Culture Collection (ATCC), 1 0801 
University Blvd., Manassas, VA 20110-2209 USA, under ATCC Designation 
No. PTA-3173, on March 14, 2001, according to the terms of the Budapest 
Treaty. 

10 The uterus clone (SEQ ID NO:3) contained the entire Gene 216 open 

reading frame. Both strands of this clone were completely sequenced and the 
data were compared against the BAC sequence. Any discrepancies were 
flagged, and these regions were resequenced. Final analysis revealed that the 
uterine clone was 3433 bp long and contained the full complement of exons 

15 defining the open reading frame of Gene 216 (SEQ ID NO:3). In addition, the 
uterine clone contained a small portion of the Gene 216 5' untranslated region 
(5 bp), the entire 3' untranslated region with a polyadenylation signal, and a 
poly(A)+ tail of 76 bp in length. The Gene 216 open reading frame was 
determined to be 2436 bp in length and to encode a protein of 812 amino acids 

20 (SEQ ID NO:363) . Analysis of the composition of SNPs across the cDNA 
clone revealed that it contained the most frequent haplotype (Figure 8, see 
below). 

Rapid Amplification of cDNA ends (RACE) was performed following the 
manufacturer's instructions using a Marathon cDNA Amplification Kit. 

25 (CLONTECH). This method was used to clone the 5' and 3' ends of candidate 
genes. cDNA pools were prepared from total RNA by performing first strand 
synthesis. For first strand synthesis, a sample of total RNA sample was mixed 
with a modified oligo (dT) primer, heated to 70°C, and cooled on ice. The 
sample was then incubated with 5 X first strand buffer (CLONTECH), 10 mM 

30 dNTP mix, and AMV Reverse Transcriptase (20 U/ul). The reaction mixture 
was incubated at 42°C for 1 hr, and then placed on ice. 
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For second-strand synthesis, the components were added directly to the 
reaction tube. These included template, 5 X second-strand buffer 
(CLONTECH), 10 mM dNTP mix, sterile water, and 20 X second-strand 
enzyme cocktail (CLONTECH). The reaction mixture was incubated at 16°C 
5 for 1.5 hr. T4 DNA Polymerase was added to the reaction mixture and 
incubated at 16°C for 45 min. The second-strand synthesis was terminated 
with the addition of an EDTA/Glycogen mix. The sample was purified by 
phenol/chloroform extraction and ammonium acetate precipitation. The cDNA 
pools were checked for quality by analyzing on an agarose gel for size 
10 distribution. 

Marathon cDNA adapters(CLONTECH) were then ligated onto the cDNA 
ends using the standard protocol recommended by the manufacturer. The 
specific adapters contained priming sites that allowed for amplification of either 
5 f or 3' ends, and varied depending on the orientation of the gene specific 

1 5 primer (GSP) that was chosen. An aliquot of the double stranded cDNA was 
added to 10 uM Marathon cDNA adapter, 5 X DNA ligation buffer, T4 DNA 
ligase. The reaction was incubated at 16°C overnight and heat treated to 
terminate the reaction. PCR was performed by the addition of the following to 
the diluted double stranded cDNA pool: 10 X cDNA PCR reaction buffer, 10 uM 

20 dNTP mix, 10 uM GSP, 10 uM AP1 primer (kit), 50 X Advantage cDNA 
Polymerase Mix. 

Thermal cycling conditions were carried out at 94° C for 30 sec; followed 
by 5 cycles of 94°C for 5 sec, 72°C for 4 min, 5 cycles of 94°C for 5 sec; 
followed by 70°C for 4 min; followed by 23 cycles of 94°C for 5 sec; 68°C for 

25 4 min. The first round of PCR was performed using the GSP to extend to the 
end of the adapter to create the adapter primer-binding site. Following this, 
exponential amplification of the specific cDNA of interest was performed. 
Usually, a second, nested PCR was performed to provide specificity. The 
RACE product was analyzed on an agarose gel. Following gel excision and 

30 purification (GeneClean, BIO 101 ), the RACE product was cloned into pCTNR 
(General Contractor DNA Cloning System, 5' - 3*, Inc.) and sequenced to verify 
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that the clone was specific to the gene of interest. 

The 5' RACE technique was employed to identify the 5' untranslated 
region of Gene 216. Experiments were performed using lung mRNA and a 
primer that hybridized near the 5" end of the available sequence. The result of 
5 the experiment identified an additional 75 bp 5' of that present in the uterus 
cDNA clone (rt690; SEQ ID NO:351). This sequence was subsequently cloned 
and deposited with the ATCC (American Type Culture Collection, 10801 
University Blvd., Manassas, VA 20110-2209 USA), as clone Gene 216_rt690, 
under ATCC Designation No.PTA-3172 on March 14, 2001, according to the 

1 0 terms of the Budapest Treaty. 

Further attempts to extend the 5' end of Gene 216 by 5' RACE gave 
similar results indicating that the 5' end of the transcript was obtained. 

This sequence in combination with the uterus cDNA clone yielded the 
master consensus sequence containing the 5' to 3' cDNA for Gene 216 (SEQ 

15 ID NO:1; Figure 24). 

2. Identification of Splice Variants : Additional cDNA clones were 
isolated and determined to represent alternatively spliced variants of Gene 
216. To ensure that all splice variants present in lung tissue were identified, 
an RT-PCR-based screening protocol was designed using multiple primer pairs 

20 spanning the entire gene. These amp I icons produced PCR fragments of 
approximately 600 bp and overlapped by approximately 100 bp. The PCR 
products were fractionated on agarose gels and any fragments that were 
different from the expected size were cloned and sequenced. The results are 
summarized in Figures 9 and 10. The availability of the complete genomic 

25 sequence of BAC RPCI-11_1098L22 enabled the intron/exon structure of 
Gene 21 6 (Figure 1 1 ) to be determined. Gene 21 6 was determined to contain 
22 exons that spanned approximately 23.5 kb of genomic DNA. 

Analysis of the sequence surrounding the intron/exon boundaries of 
Gene 216 indicated that the consensus splice sequence GT/AG was used in 

30 all cases (Table 4). However, in several of the cDNA clones, the use of an 
alternative splice site at the intron/exon boundary of exon V was observed. 
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The sequence CAGCAG was observed at the border of intron UV and exon V. 
The CAGCAG sequence represented a duplication of the canonical acceptor 
splice consensus CAG. The CAG sequence is found in approximately 65% of 
all known acceptor splice sites. Where there is a duplication of the CAG 
5 sequence, the splicing machinery can utilize either AG sequence as an 
acceptor site. If the first AG (splice site 1) is used, the resulting sequence 
encodes an alanine. If the second AG (splice site 2) is used, this alanine is 
deleted. Accordingly, use of the first AG in the intron/exon boundary of exon 
V of Gene 216 produces a splice variant that encodes the amino acid 

10 sequence DPQADQVQM (Figure 12) (SEQ ID NO:60). Use of the second AG 
produces a splice variant that encodes the amino acid sequence DPQDQVQM 
(Figure 1 2) (SEQ ID NO:61 ). 

It is noted that the percentage of clones that used splice site 1 or splice 
site 2 could not be accurately determined from the dataset because the 

15 majority of the clones were derived from PCR-based techniques. Typically, 
there is bias in PCR reactions that results in the amplification of one splicing 
product over another. The amplified products, once cloned, may not reflect the 
true percentage of splicing products in the total population. For example, small 
splicing products are preferentially amplified over larger ones, and the loss or 

20 gain of an exon will skew the relative ratio of one splicing product to another. 

TABLE 4 



EXON 


3' INTRON 


5 EXON 


3? EXON 


3 INTRON 


A 






AAG 


GTGAGG 


8 


CAG 




CCG 


GTCAGT 


C 


CAG 


GTC 


CCA 


GTGAGT 


D 


CAG 


CAG 


ACG 


GTGAGA 


D(ALT) 


CAG 


CAG 


GAG 


GTACCC 


B 


TAG 


GAT 


GAG 


GTGAGC 


F 


TAG 


TGG 


AGG 


GTCAGG 


G 


CAG 


GGC 


CTG 


GTGAGG 


H 


CAG 


TTC 


CAG 


GTTGGG 




CAG 


CTT 


CAC 


GTGGGT 


J 


CAG 


GGG 


ACG 


GTGAGC 


K 


CAG 


GAC 


CGG 


GTACGC 


L 


TAG 


GCA 


CAG 


GTT AAG 


M 


CAG 


GAG 


CTG 


GTGAGG 


K 


CAG 


CTG 


CTG 


GTGAGA 


O 


CAG 


GCT 


GAG 


GTAGGG 


P 


CAG 


GGA 


ATG 


GTGAGC 


P (ALT) 


TAG 


ATG 


ATG 


GTGAGC 


□ 


TAG 


GTG 


GGG 


GTGAGA 


R 


CAG 


GTT 


AAA 


GTATGC 


S 


CAG 


ACC 


TGG 


GTAGGC 


T 


CAG 


CCC 


TGG 


GTGAGT 


U 


CAG 


ACC 


AAG 


GTAGGC 


V 


CAG 


CAG 
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3. Promoter Analysis : In order to identify the transcriptional start site 
of Gene 216, multiple 5' RACE products were sequenced from several different 
tissues. In most cases the 5' ends were located 80 bp upstream of the 
translation al start site. The region upstream of this sequence was then 

5 analyzed for potential transcription factor binding sites using GEMS Launcher, 
a promoter analysis program (Genomatix, Munich, Germany). GEMS 
Launcher uses statistically weighted algorithms to identify binding elements 
that comprise a promoter or regulatory module. A stretch of DNA sequence 
spanning 2000 bp upstream of the translational start site was analyzed. The 

10 results indicated that Gene 216 did not possess a TATA or CCAAT box. In 
fact, the first binding element that was identified was a GC box within the 5* 
untranslated region oriented in the opposite direction (Figure 13). This result 
is not unprecedented since 60% of TATA-less genes possess a GC box on the 
opposing strand. Also, this result was in agreement with published data 

15 regarding the promoters of mouse ADAM 1 7 and 19. Other binding elements 
that were identified within 600 bp upstream of the initiator methionine included 
an E-box, one AP2, and three SP1 sites (Figure 13). These types of binding 
elements were also identified in the mouse ADAM 1 7 and 1 9 genes, and may 
represent components of a promoter module for Gene 216. Approximately 

20 1200 bp upstream of the putative promoter module, GEMS Launcher identified 
binding elements that may comprise an additional regulatory element (Figure 
13). This region was highly conserved with the mouse ortholog of Gene 216 
(see below), as determined by dot matrix analysis. 

4. BLAST Analysis : BLASTP, BLASTN, and BLASTX analysis of 
25 Gene 216 against protein and nucleotide databases revealed that it was a 

novel member of the ADAM (A Disintegrin And Metalloprotease) gene family. 

The ADAM gene family is a sub-group of the zinc-dependent metalloprotease 
superfamily. There are currently 31 known members of the ADAM gene family. 

ADAM proteins have a complex domain organization that includes a signal 
30 sequence, a propeptide domain, a metalloprotease domain, a disintegrin 
domain, a cysteine-rich domain, and an epidermal growth factor-like domain, 
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as well as a transmembrane region and a cytoplasmic tail. ADAM proteins 
have been implicated in many processes, including proteolysis in the secretory 
pathway and extracellular matrix, extra- and intra-cellular signaling, processing 
of plasma membrane proteins, and procytokine conversion. The homology of 
5 Gene 216 and human ADAMs 19, 12, 15, 8 and 9 indicated that Gene 216 
belonged to a branch of the 31 -member family containing active 
metal lop rotease domains (Figure 14). 

5. Expression Analysis : To characterize the expression of Gene 
216, a series of expression experiments were performed. 

10 i. Northern Analysis : Northern analysis (Sambrook et aL 1989) of 

the Gene 216 transcript was performed. Probes were generated using one of 
the methods described below. Briefly, sequence verified IMAGE consortium 
cDNA clones were digested with appropriate restriction endonucleases to 
release the insert. The restriction digest was electrophoresed on an agarose 

1 5 gel and the bands containing the insert were excised. The gel piece containing 
the DNA insert was placed in a Spin-X (Corning Costar Corporation, 
Cambridge, MA) or Supelco spin column (Supelco Park, Bellefonte, PA) and 
spun at high speed for 15 min. DNA was ethanol precipitated and 
resuspended in TE. 

20 Alternatively, products were purified from PCR or RT-PCR. First, 

oligonucleotide primers were designed for PCR amplification of portions of 
cDNA, EST, or genomic DNA. Pools of DNA (for PCR) or RNA (for RT-PCR) 
were used as template for the reactions. The PCR primers were used to 
amplify genomic DNA to verify the size of the predicted product. The expected 

25 size was based on the genomic sequence. Inserts purified from IMAGE clones 
or PCR products were random primer labeled (Fineberg and Vogelstein, supra) 
to generate probes for hybridization. Probes were labeled by incorporation of 
a- 32 P-dCTP in second round of PCR. Commercially available Multiple Tissue 
Northern blots (CLONTECH, Palo Alto, CA) were hybridized and washed under 

30 conditions recommended by the manufacturer. A separate filter that contained 
6 tissues from the immune system was also utilized (CLONTECH). The results 
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revealed a major 5.0 kb transcript and a minor 3.5 kb transcript that were 
expressed in most tissues examined (Figures 15A-15B). The strongest signals 
were consistently identified in heart, skeletal muscle, colon, lymph, and small 
intestine. Moderate expression levels were observed in lung, liver, kidney, 
5 placenta, bone marrow, and brain. 

It was hypothesized that the 5 kb transcript was an incompletely spliced 
transcript from Gene 216. To test this hypothesis, Northern blotting was 
performed using cytoplasmic mRNA isolated from bronchial smooth muscle 
cells. The same radioactive probe was employed as described above. The 

1 0 results showed a very strong 3.5 kb signal and no signal at 5.0 kb (Figure 1 5C). 
This suggested that the predominant 5 kb transcript contained intronic material 
and was localized to the nucleus. Notably, intron ST is 1 A kb in size. The 
addition of the ST intron to the 3.5 kb full length cDNA would produce a 
transcript that is -5.0 kb in size. This suggests that regulatory elements in the 

15 region around intron ST affect splicing, retention in the nucleus, and/or 
transport to the cytoplasm. 

ii. RNA Dot Blot Analysis : RNA dot blotting was used to determine 
the expression of Gene 21 6 in a wide range of tissues. mRNA from 50 tissues 
was dotted onto a nylon filter, and probed with a radiolabeled oligo designed 

20 to hybridize to the 3' untranslated region of Gene 21 6. Figure 1 6 shows that 
Gene 216 was highly expressed in gastrointestinal tissues as well as aorta, 
uterus, prostate, ovary, lung, fetal lung, trachea, and placenta. The majority 
of these tissues are derived from the endoderm. During development, the 
endoderm forms a tube that produces the primordium of the digestive tract. 

25 Extensions from this tube also develop into the lung and trachea. 

iii. RT-PCR : Total RNA was isolated from primary cultures of seven 
cell types cultured from lung tissue. This RNA was analyzed in RT-PCR 
experiments. Genomic DNA was removed from the total RNA by DNasel 
digestion. The "'Superscript' Preamplification System for First strand cDNA 

30 synthesis" (Life Technologies) was used according to the manufacturer's 
specifications. cDNA was synthesized from DNasei treated total RNA using 
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oligo(dT) or random hexamers. Gene specific primers were used to PCR 
amplify the target cDNAs. The PCR reaction contained 0.5 u.1 of first strand 
cDNA, 1 jjJ sense primer (10 jliM), 1 pi antisense primer (10 uJvl), 3 pi dNTPs 
(2 mM), 1 .2 pi MgCI 2 (25 mM), 3 uJ 10 X PCR buffer, and 1 U Taq Polymerase 
5 (Perkin Elmer). Total volume was 30 uJ. The PCR reaction mixture was 
incubated at 94°C for 4 min; followed by 30 cycles of incubation at 94°C for 30 
sec, 58°C for 1 min; followed by incubation at 72°C for 1 min; followed by a 
final incubation at 72°C for 7 min. PCR products were analyzed on agarose 
gels. Figure 17 shows that Gene 216 was expressed in lung fibroblasts, 
1 0 pulmonary artery smooth muscle cells, bronchial smooth muscle cells and total 
lung, but was not expressed in bronchial epithelium or pulmonary artery 
endothelial cells. 

iv. cDNA Library Representation : A comprehensive approach to 
determining the tissue distribution of Gene 21 6 was performed by in silico data 
15 mining. For searches, public EST database and Genome Therpaeutics 
Corporation's internal cDNA database were used. BLASTN2 analysis 
identified ESTs from multiple cDNA libraries. A summary of ail tissues 
expressing Gene 216 is given in Table 5. 

TABLE 5 



Source 


Tissue 


UNIGENE 


Eye 




Muscle 




Placenta 




Stomach 




Uterus 




Whole embryo 




Breast 




Normal testis 


Direct selected cDNAs 


Bronchial smooth muscle (1 clone) 




Normal lung (2 clones) 




Brain (1 clone) 


Primary cell types (RT/PCR) 


Pulmonary artery smooth muscle 




Bronchial smooth muscle 




Lung fibroblast 




Total lung 


RNA Dot Blot 


Aorta 




Colon 
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Bladder 




Uterus 




Prostate 




Ovary 




Small intestine 




Heart 




Stomach 




Testis 




Appendix 




Lung 




Trachea 




Fetal kidney 




Fetal lung 


Northern Blot 


Brain 




Heart 




Skeletal muscle 




Colon 




Thymus 




Spleen 




Kidney 




Liver 




Small intestine 




Placenta 




Lung 




Lymph 




Bone marrow 



EXAMPLE 8: Gene 216 Polypeptide 

1 . ADAM Family Features: The zinc-dependent metal I o protease 
superfamily is comprised of several sub-groups. Metalloproteases that exhibit 
5 the zinc-binding consensus sequence HEXXHXXGXXH (SEQ ID NO:62) are 
referred to as zincins. In zincins, the 3 histidines in the consensus sequence 
play an essential role in binding to the zinc ion. Such binding is essential for 
catalytic activity. Zincins can be further divided into metzincins, which contain 
a methionine residue beneath the active-site zinc ion ("Met-turn" motif). Within 
1 0 this sub-group there are 4 sub-families: astacins, matraxins, adamlysins, and 
serralysins. The ADAM proteins belong to adamlysins sub-family of 
metzincins, along with snake venom metalloproteases. 

Currently, there are 31 known members of the ADAM family. The 
ADAM genes encode proteins of approximately 750 amino acids that contain 
15 8 different domains. Domain I is the pre-domain and contains the signal 
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sequence peptide that facilitates secretion through the plasma membrane. 
Domain II is the pro-domain that is cleaved before the protein is secreted 
resulting in activation of the catalytic domain. Domain III is the catalytic domain 
containing metalloprotease activity. Domain IV is the disintegrin-like domain 
5 that is believed to interact with integrins or other receptors. Domain V is the 
cysteine-rich domain and is speculated to be involved in protein-protein 
interactions or in the presentation of the disintegrin-like domain. Domain VI is 
the EGF-like domain that plays a role in stimulating membrane fusion. Domain 
VII is the transmembrane domain that anchors the ADAM protein to the 

1 0 membrane. Domain VIII is the cytoplasmic domain that contains binding sites 
for cytoskeletal-associated proteins and/or SH3 binding domains. This binding 
is thought to play a role in bi-directional signaling. Figure 8 shows the location 
of the ADAM domains identified in the Gene 216 protein sequence. 

To determine whether Gene 216 was a novel member of the ADAM 

15 family, the 812 amino acid sequence was aligned with other ADAM proteins 
using Pile-Up (Genetics Computer Group, Burlington, MA) (Figure 18). 
Sequence alignments indicated that the Gene 216 protein contained the eight 
domains characteristic of ADAM proteins (Figure 18). The consensus 
sequence HEXXHXXGXXH (SEQ ID NO:62) was located within the catalytic 

20 domain of Gene 216 protein. In addition, a methionine residue identified as a 
"Met-turn" was located in the Gene 216 protein. A conserved cysteine (amino 
acid 1 33) was identified in the prodomain of Gene 216 protein. This cysteine 
is important for activation in other ADAMs, as it forms an intramolecular 
complex with the zinc ion bound to the metalloprotease domain. The cysteine- 

25 zinc complex blocks the active site, and dissociation of the cysteine is required 
for catalytic activity. Dissociation is believed to activate the catalytic domain 
by a conformational change or the enzymatic cleavage of the prodomain. This 
process is referred to as the "cysteine switch". 

In ADAM 12, the conserved cysteine is located at a different position 

30 than conserved cysteines in other ADAM proteins (B.L. Gilpin et a!., 1998, J. 
Biol. Chem. 273:157-166). This alternative position correspond to amino acid 
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179 in Gene 216 (Figure 19). However, sequence analysis of 14 ADAMs, 
including ADAMs 8, 9, 12 and 15 (Stone et al., 1999, J. Prot Chem. 18:447- 
465) made it more likely that position 133 of Gene 216 was involved in the 
cysteine switch (see Figures 18 and 19) . In addition, Gene 216 shared a 
5 higher percentage of sequence identity with other ADAMs around position 133 
than position 179. This provided further support that the Gene 216 cysteine 
at position 133 was involved in the cysteine switch. 

Hydrophobicity analysis (PepPiot, Genetics Computer Group) of the 
Gene 216 amino acid sequence revealed the presence of two hydrophobic 

10 regions (Figure 20). One region was located at the amino terminus of the 
protein and contained the predicted the signal sequence. The other 
hydrophobic region was located near the carboxyl terminus and contained the 
predicted transmembrane domain that anchors the protein to the cell surface. 
Computational biology analysis (BLIMPS, Henikoff et al., 1994, Genomics 

1 5 19:97-1 07) of the Gene 21 6 cytoplasmic domain revealed the presence of a 
putative SH2 and SH3 binding domain as well as a putative casein kipase I 
phosphorylation site (Figure 19). Such sites may contribute to the bi-directional 
signaling of Gene 216, as observed for other ADAM proteins. 

Sequence analyses indicated that Gene 216 is a novel member of the 

20 ADAM family. Gene 216 is most closely related to ADAMs 8, 9, 12, 15, and 19, 
a branch of the family that is known to possess an active metalloprotease 
domain. Table 6 lists the 5 most similar BLASTP hits using the Gene 216 
amino acid sequence as a query. In humans, Gene 216 is most closely related 
to ADAM 19. Based on BLASTN and BLASTP analysis, Gene 216 nucleotide 

25 sequence shares the 37% identity with the ADAM 1 9 nucleotide sequence; and 
Gene 216 amino acid sequence shares 58% identity with the ADAM 19 amino 
acid sequence. 

TABLE 6: Top 5 Hits from BLAST Analysis of Gene 216 protein 

30 Hit GenBank Locus Description , Smallest Sum 

1 U66003 Xenopus laevis (ADAM 13) 5.5e-166 
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2 


AF019887 


Mus musculus metailoprotease- 
disintearin meltrin beta 


1.2e-139 


3 


AF1 34707 


Homo sapiens disintegrin and 
metalloorotease domain 19 (ADAM 19) 


1.6e-139 


4 


S60257 


Mouse mRNA for meltrin alpha 


1.8e-121 


5 


AF023476 


Homo sapiens meltrin-L precursor 
(ADAM 12) 


4.9e-119 



10 

Table 7 lists the top two hits from BLIMPS analysts of the Block protein 
motif database. 

TABLE 7: Top 2 Hits from BLIMPS Analysis of Gene 216 protein 

1 5 Description Strength Score AA# AA 

Disintegrins proteins 1950 1597 377 

Sequence: 

CCfAhnCsLRPGAQCAhGdCCvRCHKpAGalCRqAMGDCDIPEfCTGTSshCPP (SEQ ID NO:335) 



20 

Description Strength Score AA# AA 

Zinc metallopeptidases 1173 1276 276 

Sequence : 

TMAHEIGHSLG (SEQ ID NO:336) 

25 



2. Amino Acid Changes : Table 10 below lists the SNPs identified 
in Gene 216. A total of 53 SNPs are disclosed. In total, 9 SNPs were 
identified in the Gene 216 open reading frame. The remaining SNPs do not 

30 affect the resulting protein, however, they can affect the expression and 
resulting phenotype. Example 10 describes SNP identification for Gene 216, 
and Figure 1 9 shows resulting changes to the protein sequence. Seven of the 
nine SNPs caused amino acid changes in the Gene 216 protein. The other 2 
SNPs comprised silent mutations. Of the 7 amino acid changes, 4 were 

35 clustered toward the carboxyl terminus of the Gene 216 protein. One SNP was 
identified in the Gene 216 transmembrane domain, while 3 SNPs were 
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identified in the cytoplasmic domain. 

Of the cytoplasmic tail SNPs, one was located in an SH2 binding 
domain. This SNP caused a non-conservative amino acid change: methionine 
(hydrophobic) to threonine (polar). The other two cytoplasmic tail SNPs also 
5 caused non-conservative amino acid changes: proline (hydrophobic) to serine 
(polar) and giutamine (polar) to histidine (basic). Such changes can disturb the 
signaling properties of the Gene 216 protein. In addition, the transmembrane 
domain SNP caused an amino acid change from valine to isoleucine. This 
change can affect Gene 216 signaling efficiency. 

1 0 The two SNPs in the Gene 216 pro-domain generated non-conservative 

amino acid changes: tyrosine (polar) to histidine (basic) and threonine (polar) 
to alanine (hydrophobic). Since the ADAM pro-domain is cleaved during 
activation of the catalytic domain, such changes may affect the cleavage 
process. One SNP in the Gene 21 6 catalytic domain resulted in a change from 

15 alanine (hydrophobic) to valine (hydrophobic). This change can affect the 
sheddase (i.e., proteolysis) efficiency of the protein. 

Notably, amino acid changes in the identified Gene 216 catalytic 
domain, especially within the meta I lo protease domain, is important as this 
domain is critical to sheddase function. Recently, the X-ray crystal log rap hie 

20 data of the snake venom catalytic domain was determined and deposited in the 
public domain (Protein Data Bank web site, Research Collaboratory for 
Structural Bioinformatics (RCSB) Consortium, Rutgers University, Piscataway, 
NJ; Accession No. 1C9GA). This information can be utilized to predict whether 
an amino acid change will alter the folding of the catalytic domain of the Gene 

25 216 protein, in particular, the sequence of the catalytic domain of Gene 21 6 
protein can be plotted as X-ray crystal lograp hie coordinates and used to 
determine changes in the tertiary structure of this domain. 

3. Biological Role of Gene 216 : ADAM proteins belong to a part of 
a very large superfamily of zinc-dependent metalloproteases (Stone et aL, 

30 1999, J. Prot Chem. 18:447-465). Gene 216 represents a novel member of 
the ADAM family that is closely related to ADAM 19. ADAM 19 is known to 
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participate in the proteolytic processing of the membrane anchored protein 
neuregulin 1 (NRG1) (Shirakabe eta!., 2001, J. Biol. Chem. 276(1 2):9352-8). 
The expression and activation of ADAM 19 protein is localized to the trans- 
golgi apparatus. This localization has also been observed for other ADAM 
5 proteins (Lum et al., 1998, J. Biol. Chem. 273:26236-26247; Roghani et aL, 
1999, J. Biol. Chem. 274:3531-3540; Shirakabe et al., 2001, J. Biol. Chem. 
276(12):9352-8). This suggests that the ADAM genes, including Gene 216, 
encode proteins that function in the trans-golgi apparatus as intracellular 
processing enzymes. The processed substrates of these enzymes may be 
1 0 released into the cytosol as part of a signal transduction cascade that leads to 
the cell surface. 

The substrate of ADAM 19 is termed NRG1 . NRG1 belongs to a group 
of growth and differentiation factors (neuregulins) that bind to members of the 
EGF family of tyrosine kinase receptors. Data suggest that the proteolytically 

1 5 cleaved isoform of NRG1 , NRG-p1 , may induce the tyrosine phosphorylation 
of EGFR2 and EGFR3 in differentiated muscle cells (Shirakabe et al., 2001, 
J. Biol. Chem. 276(1 2):9352-8). The sequence similarity of Gene 216 protein 
and ADAM 19 protein suggests that neuregulins or their isoforms serve as 
substrates for Gene 216 protein. Gene 216-processed neuregulins or isoforms 

20 can serve as ligands for EGFR1. Although other researchers have not 
demonstrated expression of neuregulins in lung tissue, Northern blots and 
RT/PCR experiments performed in accordance with this invention showed that 
NRG2 is expressed at low levels in lung tissue (data not shown). 

Epidermal growth factor receptor (EGFR1) plays a pivotal role in the 

25 maintenance and repair of epithelial tissue. Following injury in bronchial 
epithelium, EGFR1 is upregulated in response to ligands acting on it or through 
transactivation of the EGFR1 receptor. This results in increased proliferation 
of cells and airway remodeling at the point of insult, and leads to the repair of 
the bronchial epithelium (Polosa et al., 1999, Am. J. Respir. Cell Mol. Biol. 

30 20:914-923; Holgate et al., 1999, Clin. Exp. Allergy Suppl 2:90-95). 

In asthma, the bronchial epithelium is highly abnormal. Structurally, the 
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columnar cells separate from their basal attachments. Functionally, there is 
increased expression and release of proinflammatory cytokines, growth 
factors, and mediator-generating enzymes. Beneath this damaged structure, 
subepithelial myofibroblasts are activated to proliferate. This proliferation 
5 causes excessive matrix deposition leading to abnormal thickening and 
increased density of the subepithelial basement membrane. 

Immunocytochemical studies have shown that both TGF- (3 and EGFR1 
are highly expressed at the area of injury. This suggests that parallel pathways 
operate in the repair of epithelial cells (Puddicombe et ai., 2000, FASEB J. 

10 14:1362-1374). It is postulated that EGFR1 stimulates epithelial repair , while 
TGF- (3 regulates the production of profibrogenic growth factors and 
proinflammatory cytokines that lead to extracellular matrix synthesis. Notably, 
EGFR1 is involved in regulating a number of different stages of epithelial 
repair, e.g., survival, migration, proliferation, and differentiation. Accordingly, 

15 dysregulation of EGFR1 may cause the epithelium to arrest in a "state of 
repair" (Holgate et al., 1999, Clin. Exp. Allergy Suppl 2:90-95). 

Thus, Gene 216 protein and its variants can induce the epithelium into 
a continuous state of repair by functioning improperly, e.g., failing to bind, 
process, or release its substrate. The Gene 216 substrate may be a member 

20 of the neuregulin family that serves as the ligand for EGFR1 . The decrease in 
processed neuregulin-type substrate could, in turn, cause the observed 
increase in EGFR1 expression. At the same time, the TGF- (3 pathway could 
remain active. This would be expected to produce a continuous source of 
proinflammatory factors as well as growth factors. Overproduction of these 

25 factors could act to drive airway wall remodeling causing bronchial 
hyperresponsiveness, a phenotype of asthma. 

It is also possible that the disintegrin-like domain of Gene 216 plays a 
role in respiratory diseases such as asthma. Integrins are a family of 
heterodimeric transmembrane receptors that mediate cell-cell and cell- 

30 extracellular matrix interaction (Hynes, 1992, Cell 69:11). Integrins promote 
angiogenesis (Brooks et al., 1994, Science 264:569), which plays a major role 
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in various pathological mechanisms, such as tumor growth, metastasis, 
diabetic retinopathy, and certain inflammation diseases (Folkman, 1995, N. 
Engi J. Med. 333:1757). Disintegrins act as integrin ligands that disrupt cell- 
matrix interactions (CP. Blobel and J.M. White, 1992, Curr. Opin. Cell Biol. 
5 4:760-5) and inhibit angiogenesis (C.H. Yeh et al., 1998, Blood 92:3268-3276). 

Thus, the disintegrin-like domain of the Gene 216 polypeptide can 
inhibit angiogenesis in the respiratory system. Gene 216 variants that have 
partly functional or non-functional disintegrin activity may lack anti- 
angiogenesis function. These Gene 216 variants may give rise to 
10 angiogenesis and inflammation in the respiratory system, a phenotype of 
asthma. 

EXAMPLE 9: Identification of the Mouse Homolog for Gene 216 

The mouse ortholog of Gene 216 was identified by TBLASTN analysis 
of Gene 216 against mouse dbEST (NCBI). BLAST analysis identified three 

1 5 mouse ESTs that were partially homologous to the human sequence but were 
not 100% identical to any known mouse ADAM genes. However, three mouse 
ESTs were 100% identical to a partially sequenced mouse BAC (BAC389B9; 
Accession Number AF1 55960). This BAC maps to mouse chromosome 2 in 
a region that is syntenic to human chromosome 20p13. The 47 kb BAC 

20 sequence was analyzed for potential genes using the Genscan gene prediction 
program (Burge and Karlin, J. Mol. Biol., 268:78-94). Additional putative exons 
were identified based on comparison of the human Gene 216 protein to the 
mouse BAC by TBLASTN. The results identified a mouse gene that contained 
an ORF of 2124 bp encoding a protein of 707 amino acids. The genomic 

25 nucleotide sequence of the mouse homolog is depicted in Figure 21 and the 
corresponding amino acid sequence is depicted in Figure 22. The mouse 
amino acid sequence was analyzed by BLASTP analysis and found to have 
homology to mouse and human ADAM proteins. The mouse amino acid 
sequence was aligned against the amino acid sequence of human Gene 216 

30 (BestFit, Genetics Computer Group; Figure 23). The results indicated that the 
mouse and human proteins shared -70% identity at the amino acid level. This 
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confirmed that the mouse sequence was the murine ortholog of human Gene 
216. 

EXAMPLE 10: Polymorphism Identification 

Polymorphisms were identified in the chromosome 20 region and 
5 subsequently used in association studies. Most of the data focused on the 
region of Gene 216. 

1 . Single Nucleotide Polymorphism (SNP) Discovery : An efficient 
multi-tiered approach was used for mutation analysis. First, PCR assays were 
performed to analyze exons and the consensus splice sites. Assays were 

1 0 designed for all exons that contributed to the open reading frame of the gene. 
This strategy ensured the detection of mutations that modified the protein 
sequence as well as mutations that were predicted to disrupt mRNA splicing. 
The identified promoter and putative regulatory element for Gene 216 and a 
large intronic region were assayed for polymorphisms as well. Second, a total 

15 of 77 individuals were tested for polymorphisms using fluorescent SSCP 
(single strand conformational polymorphism). This sample size provided a 
99% power to detect a polymorphism with a frequency of 3% or greater. 
Briefly, PCR was used to generate templates from asthmatic individuals that 
showed increased sharing for the 20p13-p12 chromosomal region and 

20 contributed towards linkage. Non-asthmatic individuals were used as controls. 
Enzymatic amplification of Gene 216 was accomplished using PCR with 
oligonucleotides flanking each exon as well as the putative 5* region. Primers 
were chosen to amplify each exon as well as 1 5 or more base pairs within each 
intron on either side of the splice site. The forward and the reverse primers 

25 were labeled with two different dye colors to allow analysis of each strand and 
confirm variants independently. Standard PCR assays were utilized for each 
exon primer pair following optimization. Buffer and cycling conditions were 
specific to each primer set. The products were denatured using a formamide 
dye and electrophoresed on non-denaturing acrylamide gels with varying 

30 concentrations of glycerol (at least two different glycerol concentrations). 

Primers utilized in fluorescent SSCP experiments to screen coding and 
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non-coding regions of Gene 216 for polymorphisms are provided in Table 8. 
Column 1 lists the genes targeted for mutation analysis. Column 2 lists the 
specific exons analyzed. Column 3 lists the primer names. Columns 4 and 5 
list the forward primer sequences and corresponding SEQ ID NOS, 
5 respectively. Columns 5 and 6 list the reverse primer sequences and 
corresponding SEQ ID NOS, respectively. 

Once polymorphisms were identified, multiple individuals representative 
of each SSCP pattern and two genomic controls were sequenced. Sequencing 
was used to validate polymorphisms and to identify SNPs. The variants 

10 detected in the initial set of asthmatic and normal individuals were subject to 
fluorescent sequencing (ABI) using a standard protocol described by the 
manufacturer (Perkin Elmer). In cases where SSCP did not identify 
polymorphisms in Gene 216, sequence information was obtained from 16 
individuals that were identical by descent (IBD) in the region, and from 4 

15 controls. This was done to ensure that all potential polymorphisms were 
identified. 

Primers utilized in DNA sequencing for purposes of confirming 
polymorphisms detected using fluorescent SSCP are provided in Table 9. 
Column 1 lists the specific exons sequenced. Column 2 lists the forward 

20 primer names, column 3 lists the forward primer sequences, and column 4 lists 
the corresponding SEQ ID NOS. Column 5 lists the reverse primer names, 
column 6 lists the reverse primer sequences, and column 7 lists the 
corresponding SEQ ID NOS. 

Single nucleotide polymorphisms (SNPs) that were identified in Gene 

25 216 are provided in Table 10. Column 1 lists the SNP numbers (1-53). 
Column 2 lists the exons that either contain the SNPs or are flanked by intronic 
sequences that contain the SNPs. Column 3 lists the PMP sites for the SNPs. 
A " denotes polymorphisms which are 5' of the exon that are within the 
intronic region. The corresponding number is given from the 3' to 5' direction. 

30 A "+" denotes polymorphisms which are 3' of the exon that are within the 
intronic region. The number corresponding to the "+" is given from the 5' to 3' 
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direction. Columns 2 and 3, combined, show the SNP names as described 
herein, e.g., T+1 , T+2, etc. Column 4 indicates whether the SNP was detected 
in an exon or intron sequence. Column 5 lists the SNP locations in the Gene 
216 genomic sequence of SEQ ID NO:6 (see Figure 7). Column 6 lists the 
5 SNP reference sequences which illustrate the SNP nucleotide changes with 
underlining. Column 7 lists the SEQ ID NOs of the SNP reference sequences. 
Column 8 lists the base changes of the SNP sequences. Column 9 lists the 
amino acid changes resulting from the SNP sequences. 

It is noted that the SNP nomenclature from related U.S. Application 
10 Serial Number 09/834,597, filed April 13, 2001, has been revised in this 
continuation-in-part application. The table describing the former and present 
SNP nomenclature is shown immediately following Table 10, below. 
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Using an in-house program called snp__view; the genomic structure of 
the gene was diagrammed (Figure 11). In Figure 1 1, the exons are shown to 
scale and the SNPs are identified by their location along the genomic BAC 
5 DNA. The polymorphic sites identified in the Gene 216 genomic sequence are 
also shown by the underlined nucleotides in Figure 29. The polymorphic sites 
discovered within the cDNA and the corresponding amino acid position in Gene 
21 6 are underlined in Figure 24. It will be understood by those of skill in the art 
that the SNPs identified in the Gene 216 genomic sequence can be correlated 

1 0 to the SNP positions identified in the Gene 21 6 cDNA sequence by aligning the 
genomic and cDNA sequences. 
EXAMPLE 11: Polymorphism Genotyping 

Putative variants were confirmed by sequencing. Following this, rapid 
allele specific assays were designed to type more than 400 individuals (> 200 

1 5 cases and > 200 controls). These assays were used in the association studies. 
All coding SNPs (cSNPs) that resulted in an amino acid change (ccSNPs) 
were typed. Neutral polymorphisms were typed if: 1 ) the polymorphism was 
identified in an exon which lacked a ccSNP; 2) the polymorphism was identified 
in an exon which contained a ccSNP, but the two polymorphisms showed 

20 different frequencies; and 3) the polymorphism was identified in an intronic 
region adjacent to an exon which lacked a cSNP. If results from the 
association studies appeared positive, additional neutral polymorphisms were 
typed. More than 30 allele specific assays from Gene 21 6 were typed for the 
case control population (Table 11). 

25 Two types of allele specific assays (ASAs) were used. If the SNP 

resulted in a mutation that created or abolished a restriction site, restriction 
fragment length polymorphisms (RFLPs) were obtained from PCR products 
that spanned the variants. The RFLPs were then analyzed. If the 
polymorphisms did not result in RFLPs, allele specific oligonucleotide assays 

30 were used. For these assays, PCR products that spanned the polymorphism 
were electrophoresed on agarose gels and transferred to nylon membranes by 
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Southern blotting. Oligomers 16-20 bp in length were designed such that the 
middle base was specific for each variant. The oligomers were labeled and 
successively hybridized to the membrane in order to determine genotypes. 
The specific method used to type each SNP is indicated in Table 11. 
5 Table 1 1 below contains the information relating to the specific assay 

used. Column 1 lists the SNP designation number. Column 2 lists the specific 
assay used, either RFLP or ASO. Column 3 lists the enzyme used in the RFLP 
assay (described below). Columns 4 and 6 list the sequence of the primers 
used in the ASO assay (described below). Columns 5 and 7 list the 
10 corresponding SEQ ID NOS for the primers. 

1. RFLP Assay : The amplicon containing the polymorphism was 
PCR amplified using primers that were used to generate a fragment for 
sequencing (sequencing primers) or SSCP (SSCP primers). The appropriate 
population of individuals was PCR amplified in 96 well microtiter plates. 

15 Enzymes were purchased from NEB. The restriction cocktail containing 

the appropriate enzyme for the particular polymorphism is added to the PCR 
product. The reaction was incubated at the appropriate temperature according 
to the manufacturer's recommendations (NEB) for 2-3 hr, followed by a 4°C 
incubation. After digestion, the digestion products were size fractionated using 

20 the appropriate agarose gel depending on the assay specifications (2.5%, 3%, 
or Metaphor, FMC Byproducts). Gels were electrophoresed in 1 XTBE Buffer 
at 170 Volts for approximately 2 hr. The gel was illuminated using ultraviolet 
light and the image was saved as a Kodak 1D file. Using the Kodak 1D image 
analysis software, the images were scored and the data was exported to 

25 Microsoft EXCEL (Microsoft, Redmond, WA). 

2. ASO assay : The amplicon containing the polymorphism was 
PCR amplified using primers that were used to generate a fragment for 
sequencing (sequencing primers) or SSCP (SSCP primers). The appropriate 
population from individuals was PCR amplified in 96-well microtiter plates and 

30 re-arrayed into 384-well microtiter plates using a Tecan Genesis RSP200. The 
amplified products were loaded onto 2% agarose gels and size fractionated at 
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150 V for 5 min. The DNA was transferred from the gel to Hybond N+ nylon 
membrane (Amersham-Pharmacia) using a Vacuum blotter (Bio-Rad). The 
filter containing the blotted PCR products was transferred to a dish containing 
300 ml pre-hybridization solution. This solution contained 5 X SSPE (pH 7.4), 
5 2% SDS, and 5 X Denhardt's. The filter was incubated in pre-hybridization 
solution at 40°C for over 1 hr. After pre-hybridization, 10 ml of the pre- 
hybridization solution and the filter were transferred to a washed glass bottle. 

For these assays, the allele specific oligonucleotides (ASO) were 
designed with the polymorphism in the middle. The size of the oligonucleotide 

10 was dependent upon the GC content of the sequence around the 
polymorphism. Those ASOs that had a G or C polymorphism were designed 
so that the T m was between 54-56°C and those that had an A or T variance 
were designed so that the T m was between 60-64°C. All oligonucleotides were 
phosphate free at the 5' end and purchased from GibcoBRL. For each 

15 polymorphism, 2 ASOs were designed: one for each variant. 

The two ASOs that represented the polymorphism were resuspended 
at a concentration of 1 ug/ul. Each ASO was end-labeled separately with y- 
ATP 32 (6000 Ci/mmol) (NEN) using T4 polynucleotide kinase according to 
manufacturer recommendations (NEB). The end -labeled products were 

20 removed from the unincorporated y-ATP 32 by passing the reactions through 
Sephadex G-25 columns according to manufacturers recommendation 
(Amersham-Pharmacia). The entire end-labeled product of one ASO was 
added to the bottle containing the appropriate filter and 10 ml hybridization 
solution. Hybridization solution included 5 X SSPE (pH 7.4), 2% SDS, and 5 

25 X Denhardt's solution. The hybridization reaction was placed in a rotisserie 
oven (Hybaid, Franklin, MA) and left at 40°C for a minimum of 4 hr. The other 
ASO was stored at -20° C. 

After the prerequisite hybridization time had elapsed, the filter was 
removed from the bottle and transferred to 1 L of wash solution pre-warmed to 

30 45°C. Wash solution contained 0.1 X SSPE (pH 7.4) and 0.1% SDS. After 15 
min, the filter was transferred to another L of wash solution pre-warmed to 
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50°C. After 15 min, the filter was wrapped in Saran, placed in an 
autoradiograph cassette and an X-ray film (Kodak) placed on top of the filter. 
Typically, an image would be observed on the film within 1 hr. After an image 
had been captured on film for the 50°C wash, the process was repeated for 
5 wash steps at 55°C, 60°C and 65°C. The image that captured the best result 
was used. 

The ASO was removed from the filter by adding 1 L of boiling strip 
solution. This solution contained 0.1 x SSPE (pH 7.4) and 0.1% SDS. This 
was repeated two more times. After removing the ASO the filter was pre- 

10 hybridized in 300 ml pre-hybridization solution at 40°C for over 1 hr. 
Prehybridization solution contained 5 X SSPE (pH 7.4), 2% SDS, and 5 X 
Denhardt's. The second end-labeled ASO corresponding to the other variant 
was removed from storage at -20°C and thawed at room temperature. The 
filter was placed into a glass bottle along with 10 ml hybridization solution and 

15 the entire end-labeled product of the second ASO. Hybridization solution 
included 5 X SSPE (pH 7.4), 2% SDS, and 5 X Denhardt's solution. The 
hybridization reaction was placed in a rotisserie oven (Hybaid) and left at 40°C 
for a minimum of 4 hr. After the hybridization, the filter was washed at various 
temperatures and images captured on film as described above. 

20 The two films that best captured the allele-specific assay with the two 

ASOs were converted into digital images by scanning them into Adobe 
PhotoShop (Adobe, San Jose, CA). These images were overlaid against each 
other in Graphic Converter and then scored. 
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EXAMPLE 12: Association Study Analysis 

1. Case-Control Study : In order to determine whether 
polymorphisms in candidate genes were associated with the asthma 
phenotype, association studies were performed using a case-control study 
5 design. For a well-matched design, the case-control approach is more 
powerful than the family based transmission disequilibrium test (TDT) (N.E. 
Morton and A. Collins, 1998, Proc. Natl. Acad. Sci USA 95:1 1389-93). Case- 
control studies are, however, sensitive to population heterogeneity. 

To avoid issues of population admixture, which can bias case-control 

10 studies, the unaffected controls were collected in both the US and the UK. A 
total of three hundred controls were collected, 200 in the UK and 100 in the 
US. Inclusion into the study required that the control individual was negative 
for asthma, as determined by self-report of never having asthma, had no first- 
degree relatives with asthma, and was negative for eczema and symptoms 

15 indicative of atopy within the past 12 months. Data from an abbreviated 
questionnaire similar to that administered to the affected sib pair families were 
collected. Results from skin prick tests to 4 common aeroallergens (house 
dust mite, cat, grass, and tree) were also collected. The results of the skin 
prick test were used to select a subset of controls that were most likely to be 

20 asthma and atopy negative. 

A subset of unrelated cases was selected from the affected sib pair 
families based on the evidence for linkage at chromosomal locations flanking 
a given gene. One affected sib demonstrating identity-by-descent (IBD) at the 
appropriate marker loci was selected from each family. Since the appropriate 

25 cases could have varied for each gene in the chromosome 20 region, a larger 
collection of individuals who were IBD across a larger interval were genotyped. 
A subset of these individuals was used in the analyses. . On average, 1 30 IBD 
affected individuals and 200 controls were compared for allele and genotype 
frequencies. This number provided an 80% power to detect a difference of 5% 

30 or greater between the two groups for a rare allele (< 5%) at a 0.05 level of 
significance. For a common allele (50%), the number provided an 80% power 
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to detect a difference of 1 0% or more between the two groups. 

For each polymorphism, the frequency of the alleles In the control and 
case populations was compared using a Fisher exact test. A mutation that 
increased susceptibility to the disease was predicted to be more prevalent in 
5 the cases than in the controls, while a protective mutation was predicted to be 
more prevalent in the control group. Similarly, the genotype frequencies of the 
SNPs were compared between cases and controls. P-values for both the allele 
and genotype were plotted against a coordinate system based on genomic 
sequence to visualize regions where allelic association was present. A small 

1 0 p-value (or a large value of -log (p), as plotted In the figures described below) 
was indicative of an association between the SNPs and the disease 
phenotype. The analysis was repeated for the US and UK population 
separately to adjust for the possibility of genetic heterogeneity. 

2. Association test with individual SNPs : Chromosomal regions 

15 harboring asthma susceptibility genes were identified by association studies 
using the SNP typing data. Two separate phenotypes were used in these 
analyses: asthma and bronchial hyper-responsiveness. 

a. Asthma Phenotype : The significance levels (p-values) for allelic 
association of all typed SNPs in Gene 21 6 to the asthma phenotype are plotted 

20 in Figure 25 (combined population) and Figure 26 (US and UK populations, 
separately). The most significant result in the combined population was 
observed for Gene 216 exon SNP V -1 . For this SNP, 92.4% of the cases 
were carriers of the C allele, whereas the C allele was observed in only 85.2% 
of the controls (p = 0.0055). Five additional SNPs in Gene 216 (V4, ST+7, 

25 ST+4, S1 , and Q-1 ) were significant at the 0.05 level. Frequencies of the allele 
seen more often in the cases than in the controls and p-values for the 
association with the asthma phenotype in Gene 216 are presented in Tables 
12, 13, and 14 for the combined population and for the UK and US 
populations, separately. 
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TABLE 12 



Asthma 
Yes/No 
Combined 
US and UK 


SNP 


ALLELE 


FREQUENCIES 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


CNTL 


N 


CASE 


N 


A-1 


T 


2.4% 


212 


2.7% 


130 


0.8039 


0.8016 


D-2 


T 


0.7% 


214 


0.8% 


127 


1.0000 


1.0000 


D-1 


C 


62.4% 


205 


65.7% 


118 


0.4449 


0.5390 


D1 


C 


0.0% 


215 


0.4% 


131 


0.3786 


0.3786 


F1 |A 


96.8% 


217 


96.9% 


129 


1 .0000 


1.0000 


F+1 


G 


65.2% 


197 


70.4% 


120 


0.1913 


0.4109 


G-1 


T 


90.7% 


210 


91.3% 


127 


0.8900 


0.7683 


11 


G 


84.9% 


212 


85.3% 


129 


0.9124 


1 .0000 


KL+1 


G 


96.1% 


217 


97.2% 


125 


0.5223 


0.5145 


KL+2 


C 


71 .3% 


216 


77.1% 


129 


0.1085 


0.2262 


L-2 


G 


92.9% 


212 


93.1% 


131 


1.0000 


0.9379 


L-1 


A 


11.1% 


212 


1 1 .2% 


130 


1 .0000 


1 .0000 


L1 


C 


99.3% 


217 


99.6% 


131 


1.0000 


1.0000 


M+1 


G 


88.7% 


213 


88.9% 


131 


1.0000 


0.9672 


Q-1 


C 


85.0% 


217 


91 .2% 


131 


0.0184 


0.0659 


S1 


G 


89.5% 


209 


94.6% 


130 


0.0233 


0.0717 


S2 


G 


73.7% 


217 


80.0% 


130 


0.0662 


0.0834 


S+1 


A 


51 .2% 


206 


52.5% 


120 


0.8075 


0.6608 


ST+4 


A 


51.5% 


205 


60.1% 


129 


0.0313 


0.1043 


ST+5 


T 


46.4% 


210 


48.8% 


129 


0.5794 


0.4165 


ST+6 


T 


0.5% 


216 


0.8% 


129 


0.6323 


0.6317 


ST+7 


G 


78.1% 


215 


85.8% 


130 


0.0160 


0.0248 


T1 


C 


1 1 .3% 


217 


11.8% 


131 


0.9025 


0.7483 


T2 


T 


9.4% 


208 


10.8% 


125 


0.5928 


0.7656 


T+1 


C 


88.7% 


191 


88.8% 


120 


1.0000 


0.8394 


T+2 


T 


88.3% 


217 


88.9% 


131 


0.8076 


0.9005 


V-4 


G 


24.4% 


215 


26.9% 


130 


0.4713 


0.6808 


V-3 


A 


37.1% 


209 


38.8% 


129 


0.6834 


0.6613 


V-2 


T 


36.8%o 


212 


38.1% 


130 


0.7451 


0.7909 


V-1 


C 


85.2% 


216 


92.4% 


131 


0.0055 


0.0178 


V1 


A 


96.5% 


211 


98.1% 


130 


0.2515 


0.2443 


V2 


C 


96.3% 


215 


98.5% 


129 


0.1576 


0.1513 


V3 


T 


77.8% 


214 


78.4% 


125 


0.9235 


0.9791 


V4 


C 


76.7% 


217 


83.6% 


131 


0.0336 


0.0370 


V5 


A 


96.3% 


215 


98.5% 


129 


0.1576 


0.1513 


V6 


T 


8.7% 


213 


9.5% 


131 


0.7841 


0.6895 


V7 


C 


66.5% 


215 


71.5% 


128 


0.2029 


0.1482 
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TABLE 13 



Asthma 
Yes/No 
UK 

population 


SNP 


ALLELE 


FREQUENCIES 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


CNTL 


N 


CASE 


N 


A-1 


T 


1.1% 


135 


3.4% 


104 


0.1110 


0.1075 


D-2 


T 


0.7% 


139 


1.0% 


101 


1.0000 


1.0000 


D-1 


C 


61.1% 


135 


65.5% 


97 


0.3807 


0.2619 


D1 


C 


0.0% 


139 


0.5% 


104 


0.4280 


0.4280 


F1 


A 


97.9% 


140 


98.0% 


102 


1.0000 


1 .0000 


F+1 


G 


64.1% 


128 


74.2% 


93 


0.0295 


0.0711 


G-1 


C 


9.8% 


137 


9.9% 


101 


1.0000 


0.4913 


11 


G 


83.7% 


138 


89.2% 


102 


0.1094 


0.1323 


KL+1 


G 


97.1% 


140 


98.0% 


99 


0.7685 


0.7655 


KL+2 


C 


71.6% 


139 


79.1% 


103 


0.0717 


0.1519 


L-2 


A 


7.3% 


137 


7.7% 


104 


0.8633 


1.0000 


L-1 


G 


87.2% 


137 


91.8% 


103 


0.1380 


0.3380 


L1 


C 


99.3% 


140 


99.5% 


104 


1.0000 


1.0000 


M+1 


G 


87.0% 


138 


91.8% 


104 


0.1059 


0.2969 


Q-1 


C 


86.1% 


140 


92.3% 


104 


0.0419 


0.0763 


S1 


G 


89.4% 


132 


95.2% 


104 


0.0260 


0.0567 


S2 


G 


72.9% 


140 


84.0% 


103 


0.0041 


0.0128 


S+1 


T 


46.9% 


129 


49.5%> 


97 


0.6346 


0.5458 


ST+4 


A 


48.1% 


^128 


59.2% 


103 


0.0191 


0.0718 


ST+5 


T 


44.4% 


133 


50.0% 


102 


0.2273 


0.2470 


ST+6 


T 


0.0% 


139 


1.0% 


103 


0.1806 


0.1801 


ST+7 


G 


79.5% 


139 


86.4% 


103 


0.0535 


0.1362 


T1 


T 


86.8% 


140 


91.4% 


104 


0.1473 


0.3472 


T2 


C 


89.6% 


134 


91 .8% 


98 


0.4279 


0.7007 


T+1 


C 


86.9% 


122 


91.1% 


95 


0.2211 


0.4281 


T+2 


T 


87.5% 


140 


87.5% 


104 


1.0000 


1 .0000 


V~4 


G 


25.2% 


139 


26.7% 


103 


0.7529 


0.6628 


V~3 


A 


38.1% 


134 


40.2%o 


102 


0.7032 


0.8627 


V~2 


T 


37.6% 


137 


39.3% 


103 


0.7055 


0.9223 


V-1 


C 


86.4% 


140 


93.8% 


104 


0.0105 


0.0243 


V1 


A 


97.8% 


137 


98.5% 


103 


0.7385 


0.7359 


V2 


C 


97.5% 


138 


99.0% 


102 


0.3129 


0.3082 


V3 


T 


78.5% 


137 


80.1% 


98 


0.7301 


0.8875 


V4 


C 


75.4% 


140 


83.7% 


104 


0.0328 


0.0288 


V5 


A 


97.1% 


140 


98.5% 


103 


0.3689 


0.3633 


V6 


T 


8.3% 


139 


9.6% 


104 


0.6308 


0.7329 


V7 


C 


65.8% 


139 


74.3% 


101 


0.0566 


0.1266 
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TABLE 14 



Asthma 
Yes/No 
US 

population 


SNP 


ALLELE 


FREQUENCIES 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


CNTL 


N 


CASE 


N 


A-1 


A 


95.5% 


77 


100.0% 


26 


0.1953 


0.1872 


D-2 


C 


99.3% 


75 


100.0% 


26 


1.0000 


1 .0000 


D-1 


C 


65.0% 


70 


66.7% 


21 


1.0000 


0.7300 


D1 


T 


100.0% 


76 


100.0% 


27 


1.0000 


1 .0000 


F1 


G 


5.2% 


77 


7.4% 


27 


0.5136 


0.5043 


F+1 


A 


32.6% 


69 


42.6% 


27 


0.2401 


0.3270 


G-1 


T 


91.8% 


73 


96.2% 


26 


0.3635 


0.3440 


11 


A 


12.8% 


74 


29.6% 


27 


0.0105 


0.0074 


KL+1 


G 


94.2% 


77 


94.2% 


26 


1.0000 


1 .0000 


KL+2 


A 


29.2% 


77 


30.8% 


26 


0.8614 


0.8889 


L-2 


G 


93.3% 


75 


96.3% 


27 


0.7362 


0.5089 


L-1 


A 


8.0% 


75 


22.2% 


27 


0.0116 


0.0123 


L1 


C 


99.4% 


77 


1 00.0% 


27 


1.0000 


1.0000 


M+1 


T 


8.0% 


75 


22.2% 


27 


0.0116 


0.0123 


Q-1 


C 


83.1% 


77 


87.0% 


27 


0.6654 


0.8280 


S1 


G 


89.6% 


77 


92.3% 


26 


0.7873 


1.0000 


S2 


C 


24.7% 


77 


35.2% 


27 


0.1571 


0.1404 


S+1 


A 


48.1% 


77 


60.9% 


23 


0.1345 


0.3169 


ST+4 


A 


57.1% 


77 


63.5% 


26 


0.5150 


0.5127 


ST+5 


C 


50.0% 


77 


55.6% 


27 


0.5287 


0.6337 


ST+6 


C 


98.7% 


77 


100.0% 


26 


1 .0000 


1.0000 


ST+7 


G 


75.7% 


76 


83.3% 


27 


0.3413 


0.0732 


T1 


C 


7.8% 


77 


24.1% 


27 


0.0030 


0.0055 


T2 


T 


7.4% 


74 


20.4% 


27 


0.0188 


0.0208 


T+1 


T 


8.0% 


69 


20.0% 


25 


0.0334 


0.0361 


T+2 


T 


89.6% 


77 


94.4% 


27 


0.4127 


0.3874 


V-4 


G 


23.0% 


76 


27.8% 


27 


0.5795 


0.6743 


V-3 


G 


64.7% 


75 


66.7% 


27 


0.8684 


0.4960 


V-2 


C 


64.7% 


75 


66.7% 


27 


0.8684 


0.4960 


V-1 


C 


82.9% 


76 


87.0% 


27 


0.5262 


0.8281 


V1 


A 


93.9% 


74 


96.3% 


27 


0.7308 


0.7226 


V2 


C 


94.2% 


77 


96.3% 


27 


0.7320 


0.7241 


V3 


C 


23.4% 


77 


27.8% 


27 


0.5819 


0.6932 


V4 


C 


79.2% 


77 


83.3% 


27 


0.5583 


0.7765 


V5 


A 


94.7% 


75 


98.1% 


26 


0.4519 


0.4404 


V6 


C 


90.5% 


74 


90.7% 


27 


1.0000 


1.0000 


V7 


G 


32.2% 


76 


38.9% 


27 


0.4053 


0.1776 
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b. Bronchial Hyper-responsiveness : The analyses were repeated 
using asthmatic children with borderline to severe BHR (PC 2 o ^ 16 mg/ml) 
or PC 2 o(16), as described in the linkage section. First, sibling pairs were 
identified where both sibs were affected and satisfied this new criterion. Of 
5 these pairs, one sib was included in the case/control analyses if they 
showed evidence of linkage at the gene of interest. This phenotype was 
more restrictive than the Asthma yes/no criteria. Hence, the number of 
cases included in the analyses was reduced approximately in half. If the 
PC2o(16) subgroup represented a more genetically homogeneous sample, 

10 it was expected that an increase in the effect size compared to the one 
observed in the original set of cases would be observed. It was also 
possible that the reduction in sample size would produce estimates that 
were less accurate. Such estimates could obscure a trend in allele 
frequencies in the control group, the original set of cases, and the PC 2 o(16) 

15 subgroup. In addition, it was possible that the reduction in sample size 
would induce a reduction in power (and increase in p values) in spite of the 
larger effect size. 

The significance levels (p-values) for allelic association of all typed 
SNPs in Gene 216 to the BHR phenotype are plotted in Figure 27 (combined 

20 population) and Figure 28 (US and UK populations, separately). Frequencies 
of the alleles seen more often in the cases than in the controls and p-values 
for the association with the BHR phenotype in Gene 216 are presented in 
Tables 15, 16, and 17 for the combined population and for the UK and US 
populations, separately. As before, multiple SNPs in Gene 216 were 

25 associated with the phenotype in each separate population. In the UK 
population, the most significant SNP was in Gene 216, exon S2. For this SNP, 
87% of the cases were carriers of the G allele compared to 72.9% for the 
controls (p = 0.0038). For the US population, the most significant association 
was found with the SNP in Gene 216 exon T 1, where 28.6% of the cases 

30 carried the C allele compared to 7.8% for the controls (p = 0.0041 ). 

In summary, Gene 216 was associated with the phenotypes of both 
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asthma and bronchial hyper-responsiveness. Association was found with 
multiple SNPs in both the UK and US populations. The 3' region of the gene, 
which contains the transmembrane domain, the cytoplasmic domain, and the 
3' UTR, appeared to have the strongest association. Taken together, these 
5 data strongly suggested that Gene 216 was an asthma susceptibility gene. 

TABLE 15 



BHR 

Combined 
US and UK 


SNP 


ALLELE 


FREQUENCIES 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


CNTL 


N 


CASE 


N 


A-1 


A 


97.6% 


212 


97.7% 


64 


1.0000 


1 .0000 


D-2 


T 


0.7% 


214 


0.8% 


63 


1.0000 


1 .0000 


D-1 


G 


37.6% 


205 


38.3% 


60 


0.9149 


0.7497 


D1 


C 


0.0% 


215 


0.8% 


64 


0.2294 


0.2294 


F1 


A 


96.8% 


217 


97.6% 


62 


0.7752 


0.7715 


F+1 


G 


65.2% 


197 


66.7% 


57 


0.8234 


0.3665 


G-1 


C 


9.3% 


210 


9.5% 


63 


1.0000 


0.9355 


11 


G 


84.9% 


212 


86.7% 


64 


0.6709 


0.8958 


KL+1 


G 


96.1% 


217 


97.6% 


63 


0.5874 


0.5802 


KL+2 


C 


71.3% 


216 


75.0% 


64 


0.4343 


0.7291 


L-2 


A 


7.1% 


212 


8.6% 


64 


0.5661 


0.5313 


L-1 


G 


l88.9% 


212 


89.7%o 


63 


0.8722 


1 .0000 


L1 


T 


0.7% 


217 


0.8% 


64 


1.0000 


1 .0000 


M+1 


G 


88.7% 


213 


89.8% 


64 


0.8722 


0.9410 


Q-1 


C 


85.0% 


217 


89.8% 


64 


0.1915 


0.5304 


S1 


G 


89.5% 


209 


93.7% 


63 


0.2251 


0.5211 


S2 


G 


73.7% 


217 


79.7% 


64 


0.2009 


0.0664 


S+1 


A 


51.2% 


206 


51.8% 


57 


1.0000 


0.7632 


ST+4 


A 


51.5% 


205 


58.9% 


62 


0.1521 


0.3393 


ST+5 


T 


46.4% 


210 


46.8% 


63 


1.0000 


0.5530 


ST+6 


C 


99.5% 


216 


100.0% 


63 


1.0000 


1 .0000 


ST+7 


G 


78.1% 


215 


82.5% 


63 


0.3199 


0.1216 


T1 


C 


11.3% 


217 


11.7% 


64 


0.8750 


0.7576 


T2 


C 


90.6% 


208 


91.1% 


62 


1.0000 


1 .0000 


T+1 


c 


88.7% 


191 


89.2% 


60 


1.0000 


0.7540 


T+2 


T 


88.3% 


217 


88.3% 


64 


1.0000 


0.8975 


V-4 


G 


24.4% 


215 


27.0% 


63 


0.5602 


0.2603 


V-3 


A 


37.1% 


209 


39.7% 


63 


0.6016 


0.8755 


V-2 


T 


36.8% 


212 


39.1% 


64 


0.6770 


0.8930 


V-1 


C 


85.2% 


216 


90.6% 


64 


0.1413 


0.3117 


V1 


A 


96.5% 


211 


97.6% 


63 


0.7758 


0.7721 


V2 


C 


96.3% 


215 


97.7% 


64 


0.5856 


0.5786 


V3 


T 


77.8% 


214 


78.3% 


60 


1.0000 


0.8426 


V4 


C 


76.7% 


217 


80.5% 


64 


0.4009 


0.4077 


V5 


A 


96.3% 


215 


98.4% 


62 


0.3878 


0.3797 


V6 


T 


8.7% 


213 


9.4% 


64 


0.8592 


0.6092 
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TABLE 16 



BHR j 
UK 

population 


SNP 


ALLELE 


FREQUENCIES 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


CNTL 


N 


CASE 


N 


A-1 


T 


1.1% 


135 


3.0% 


50 


0.3500 


0.3461 


D-2 


T 


0.7% 


139 


1 .0% 


49 


1.0000 


1.0000 


D-1 


C 


61.1% 


135 


61.5% 


48 


1.0000 


0.5047 


D1 


C 


0.0% 


139 


1 .0% 


50 


0.2646 


0.2646 


F1 


A 


97.9% 


140 


97.9% 


48 


1.0000 


1.0000 


F+1 


G 


64.1% 


128 


73.3% 


43 


0.1466 


0.2885 


G-1 


C 


9.9% 


137 


10.2% 


49 


1.0000 


0.9269 


11 


G 


83.7% 


138 


91.0% 


50 


0.0952 


0.2406 


KL+1 


G 


97.1% 


140 


98.0% 


49 


1.0000 


1.0000 


KL+2 


C 


71.6% 


139 


79.0% 


50 


0.1860 


0.3615 


L-2 


A 


7.3% 


137 


9.0% 


50 


0.6623 


0.5686 


L-1 


G 


87.2% 


137 


93.9% 


49 


0.0899 


0.2787 


L1 


T 


0.7% 


140 


1 .0% 


50 


1.0000 


1 .0000 


M+1 


G 


87.0% 


138 


94.0% 


50 


0.0638 


0.2367 


Q-1 


C 


86.1% 


140 


93.0% 


50 


0.0752 


0.2087 


S1 


G 


89.4% 


132 


96.0% 


50 


0.0603 


0.1968 


S2 


G 


72.9% 


140 


87.0% 


50 


0.0038 


0.0128 


S+1 


T 


46.9% 


129 


51.1% 


45 


0.5407 


0.6988 


ST+4 


A 


48.1% 


128 


57.1% 


49 


0.1538 


0.2564 


ST+5 


T 


44.4% 


133 


49.0% 


49 


0.4771 


0.5020 


ST+6 


C 


100.0% 


139 


100.0% 


49 


1.0000 


1 .0000 


ST+7 


G 


79.5% 


139 


85.7% 


49 


0.2294 


0.3049 


T1 


T 


86.8% 


140 


93.0% 


50 


0.1041 


0.3226 


T2 


C 


89.6% 


134 


94.8% 


48 


0.1494 


0.4752 


T+1 


C 


86.9% 


122 


92.6% 


47 


0.1838 


0.3875 


T+2 


G 


12.5% 


140 


14.0% 


50 


0.7290 


0.6834 


V-4 


G 


25.2% 


139 


26.5% 


49 


0.7889 


0.1160 


V-3 


A 


38.1% 


134 


41.8% 


49 


0.5461 


0.6617 


V-2 


T 


37.6% 


137 


41.0% 


50 


0.5508 


0.6328 


V-1 


C 


86.4% 


140 


94.0% 


50 


0.0454 


0.1307 


V1 


A 


97.8% 


137 


98.0% 


49 


1.0000 


1 .0000 


V2 


C 


97.5% 


138 


98.0% 


50 


1.0000 


1.0000 


V3 


T 


78.5% 


137 


79.4% 


46 


1.0000 


0.9547 


V4 


C 


75.4% 


140 


82.0% 


50 


0.2122 


0.2778 


V5 


A 


97.1% 


140 


98.0% 


49 


1.0000 


1.0000 


V6 


T |8.3% 


139 


9.0% 


50 


0.8352 


0.6515 


V7 


C |65.8% 


139 


74.0% 


48 


0.1635 


0.1885 
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TABLE 17 



BHR 
US 

population 


SNP 


ALLELE 


FREQUENCIES 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


CNTL 


N 


CASE 


N 


A-1 


A 


95.5% 


77 


100.0% 


14 


0.5975 


0.5899 


D-2 


C 


99.3% 


75 


100.0% 


14 


1.0000 


1 .0000 


D-1 


G 


35.0% 


70 


37.5% 


12 


0.8204 


0.8258 


D1 


T 


1 00.0% 


76 


100.0% 


14 


1.0000 


1 .0000 


F1 


A 


94.8% 


77 


96.4% 


14 


1 .0000 


1 .0000 


F+1 


A 


32.6% 


69 


53.6% 


14 


0.0510 


0.0665 


G-1 


T 


91.8% 


73 


92.9% 


14 


1.0000 


1 .0000 


11 


A 


12.8% 


74 


28.6% 


14 


0.0455 


0.0463 


KL+1 


G 


94.2% 


77 


96.4% 


14 


1.0000 


1 .0000 


KL+2 


A 


29.2% 


77 


39.3% 


14 


0.3730 


0.271 1 


L-2 


A 


6.7% 


75 


7.1% 


14 


1.0000 


1 .0000 


L-1 


A 


8.0% 


75 


25.0% 


14 


0.0149 


0.0227 


L1 


C 


99.4% 


77 


100.0% 


14 


1.0000 


1 .0000 


M+1 


T 


8.0% 


75 


25.0% 


14 


0.0149 


0.0227 


Q-1 


T 


16.9% 


77 


21.4% 


14 


0.5910 


0.6593 


S1 


A 


10.4% 


77 


15.4% 


13 


0.4980 


0.4470 


S2 


C 


24.7% 


77 


46.4% 


14 


0.0233 


0.0331 


S+1 


A 


48.1% 


77 


62.5% 


12 


0.2724 


0.4060 


ST+4 


A 


57.1% 


77 


65.4% 


13 


0.5212 


0.7976 


ST+5 


C 


50.0% 


77 


60.7% 


14 


0.3130 


0.4007 


ST+6 


C 


98.7% 


77 


100.0% 


14 


1.0000 


1.0000 


ST+7 


A 


24.3% 


76 


28.6% 


14 


0.6391 


0.2476 


T1 


C 


7.8% 


77 


28.6% 


14 


0.0041 


0.0072 


T2 


T 


7.4% 


74 


21 .4% 


14 


0.0333 


0.0469 


T+1 


T 


8.0% 


69 


23.1% 


13 


0.0321 


0.0452 


T+2 


T 


89.6% 


77 


96.4% 


14 


0.4778 


0.4545 




G 


23.0% 


76 


28.6% 


14 


0.6296 


0.7242 


V-3 


G 


64.7% 


75 


67.9% 


14 


0.8311 


1 .0000 


V-2 


C 


64.7% 


75 


67.9% 


14 


0.8311 


1.0000 


V-1 


A 


17.1% 


76 


21.4% 


14 


0.5937 


0.6635 


V1 


A 


93.9% 


74 


96.4% 


14 


1.0000 


1 .0000 


V2 


C 


94.2% 


77 


96.4% 


14 


1.0000 


1.0000 


V3 


C 


23.4% 


77 


25.0% 


14 


0.8130 


0.7738 


V4 


G 


20.8% 


77 


25.0% 


14 


0.6206 


0.6767 


V5 


A 


94.7% 


75 


100.0% 


13 


0.6065 


0.5986 


V6 


T 


9.5% 


74 


10.7% 


14 


0.7369 


1.0000 


V7 


G 


32.2% 


76 


53.6% 


14 


0.0514 


0.0409 
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EXAMPLE 13: Haplotype analyses 

In addition to the analysis of individual SNPs, haplotype frequencies 
between the case and control groups were also compared. The haplotypes 
5 were constructed using a maximum likelihood approach. Existing software for 
predicting haplotypes was unable to utilize individuals with missing data. 
Accordingly, a program was developed to make use of all individuals. This 
allowed more accurate estimates of haplotype frequency. Haplotype analysis 
based on multiple SNPs in a gene was expected to provide increased evidence 

10 for an association between a given phenotype and that gene, if all haplotyped 
SNPs were involved in the characterization of the phenotype. Otherwise, allelic 
variation involving those haplotyped SNPs would not be associated more 
significantly with different risks or susceptibilities toward the phenotype. 

1 . Asthma phenotype : The estimated frequency of each haplotype 

15 was compared between cases and controls by a permutation test. An overall 
comparison of the distribution of all haplotypes between the two groups was 
also performed. In Tables 18, 19, and 20 the haplotype analysis (2-at-a-time) 
for all SNPs in Gene 216 is presented for the combined, the UK and the US 
populations, respectively. The diagonal entries represent the single SNP p~ 

20 values. The other entries represent the p-values for a test of association 
between the asthma phenotype and the four haplotypes defined by the 2 SNPs 
listed on the horizontal and vertical axes. The frequency of the individual SNPs 
in the cases and controls are shown at the bottom of the tables. Marked cells 
indicate p-values that were statistically significant. Medium gray cells represent 

25 p-values that are less or equal to 0.05 but greater than 0.01 , dark gray, boxed 
cells represent p-values that are less or equal to 0.01 but greater than 0.001, 
light gray, boxed cells represent p-values that are less or equal to 0.001 . 

As seen in Table 1 8, haplotypes defined by SNPs V4 & V1 , V-2 & ST+4, 
V-3 & ST+4, V4 & V2, and V5 & V4, yielded highly significant p-values of 

30 0.00035, 0.000043, 0.000040, 0.00039 and 0.00026 respectively. These 
values were more significant than the analysis of these SNPs alone (SNP V5 
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p = o.16; V4 p = 0.03; V2 p = 0.16; V1 p = 0.25; V-3 p = 0.68; V-2 p = 0.75; 
ST+4 p = 0.04). These associations were also more significant than the one 
observed for the single SNP V-1 reported above. The most significant 
association in Gene 216 was found in the UK population (Table 19). Five SNP 
5 combinations showed significance at the 0.001 level (SNPs V-2 & ST+4 p = 
0.000005; ST+5 & ST+4 p = 0.00047; ST+4 & S+1 p = 0.00039; V-3 & ST+4 
p = 0.000003, and ST+4 & S2 p = 0.00029; Table 19) in the UK population. 
Forty SNP combinations were significant at the 0.01 level in Gene 216 in the 
UK population (Table 19). In the US population, numerous SNP combinations 

1 0 were significant at the 0.01 level for Genes 216 (Table 20). 

All SNP combinations in Table 18, 19, and 20 that demonstrated a 
significant difference (p < 0.05) in the distribution of frequencies of the four 
haplotypes between the cases and the control populations were further 
analyzed to identify individual haplotypes that were also significant. Table 21 

1 5 presents the haplotypes that were significantly associated, at the 0.05 level of 
significance, with the asthma phenotype. Haplotypes with higher allele 
frequency in the case population than in the control population acted as risk 
factors that increased the susceptibility to asthma. Haplotypes with lower allele 
frequencies in the case population than in the control population acted as 

20 protective factors that decreased the susceptibility to asthma. 

In the combined populations, the two most significant haplotypes were 
protective and contained the C allele at SNP ST+4 in combination with the G 
allele at SNP V-3 (p = 0.00002) or the C allele at SNP V-2 (p = 0.00003). 
Additionally, haplotypes C/C (SNPs ST+4/V-4, p = 0.0004) and A/C (SNPs 

25 ST+7A/-2, p = 0.0005) were protective and significant at the 0.001 level of 
significance. Five haplotypes involving allele A at SNP ST+4 were 
susceptibility haplotypes, associated with an increased risk of asthma at the 
0.001 level of significance. They were haplotypes C/A (SNPs Q-1/ST+4, p = 
0.0005), C/A (SNPs KL+2/ST+4, p = 0.0007), A/G (SNPs ST+4/ST+7, p = 

30 0.0006), A/C (SNPs ST+4A/-1, p = 0.0006) and A/C (SNPs ST+4A/4, p = 
0.001 ). Other susceptibility haplotypes that were significant at the 0.001 level 
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were T/G (SNPs Q-1/T+2, p < 0.0001 ), G/A (SNPs T+2/V-1, p = 0.0009) and 
C/C (SNPs V-1A/4, p = 0.0009). 

A similar pattern was observed in the UK and US populations 
separately, where haplotypes involving the C allele in SNP ST+4 were 
5 protective while haplotypes involving the A allele in SNP ST+4 increased the 
susceptibility to asthma. In the UK population, the most significant haplotypes 
were protective and, as in the combined population, were C/G (SNPs ST+4/V- 
3, p = 0.000002) and C/C (SNPs ST+4/V-2, p = 0.000003). The following 
protective haplotypes were significant at the 0.0001 level: T/C (SNPs 

10 S+1/ST+4, p = 0.0002), C/T (SNPs ST+4/ST+5, p = 0.0002), A/C (SNPs 
ST+7A/-2, p = 0.0007), C/C (SNPs ST+4A/-4, p = 0.0006), and C/C (SNPs 
S2/V6, p = 0.0008). The susceptibility haplotypes, significant at the 0.001 
level, were G/A (SNPs F+1/ST+4, p = 0.0002), G/A (SNPs S1/ST+4, p = 
0.0005), G/A (SNPs S2/ST+4, p = 0.0001), C/A (SNPs Q-1/ST+4, p = 0.0006), 

15 C/A (SNPs KL+2/ST+4, p = 0.0008), AG (SNPs ST+4/ST+7, p = 0.0008), T/G 
(SNPs Q-1/T+2, p = 0.0001), G/C (SNPs L-1AM, p = 0.0009), A/C (SNPs 
ST+4/V-1, p = 0.0008), A/C (SNPs ST+4/V7, p < 0.0001) and A/C (SNPs 
ST+4/V4, p = 0.0004). In the US population, four susceptibility haplotypes 
involving allele A at SNP 11 were significant at the 0.001 level: A/A (SNPs 

20 I1/ST+4, p = 0.0004), A/T (SNPs 11/V3, p = 0.0009), A/C (SNPs I1A/2, p « 
0.0009) and A/A (SNPs 11 A/1 , p = 0.0006). 

In addition, haplotypes consisting of SNPs present in the mature mRNA 
were analyzed. Table 22 presents the 15-SNP haplotypes (for SNPs 
D1/F1/i1/L1/S1/S2/T1/T2/V1/V2A/3/V4A/5A/6A/7) in the combined and 

25 separate UK and US populations. In the combined populations, four 
haplotypes (T/A/A/C/G/CyT/C/A/C/C/G/A/C/C, T/A/G/C/A/C/T/C/A/C/T/G/A/C/G, 
T/A/G/C/G/G/T/C/A/C/T/C/A/C/C, and T/G/A/C/G/C/T/C/TrT/C/G/G/C/G) were 
significant at the 0.05 level. In the UK population, two haplotypes 
(T/A/G/C/A/C/T/C/A/CyT/G/A7C/G and T/A/G/C/G/GAT/C/A/C/T/C/A/C/G) were 

30 significant at the 0.05 level, and one (T/A/G/C/G/G/T/C/A/C/T/C/A/C/C) was 
significant at the 0.003 level. In the US population, three haplotypes 
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(T/A/A/C/G/C/C/T/A/C/T/C/A/T/G, T/A/G/C/G/G/T/C/A/C/T/G/A/C/C, and 
T/G/A/C/G/C/T/C/T/T/C/G/G/C/G) and the overall test were significant at the 
0.05 level. 

2. Bronchial Hyper-responsiveness : A similar test for association 
5 of 2~SNP~at~a-time haplotypes with BHR (PC 2 o £ 16 mg/ml) was performed. 
In Tables 23, 24, and 25, the haplotype analysis (2-at-a-time) for all SNPs in 
Gene 216 is presented for the combined US and UK populations, the UK 
population, and the US population, respectively. Two SNP combinations in 
Gene 216 were significant at the 0.01 level in the combined sample (Table 23: 

10 SNPs V-2 & ST+4, p = 0.0038, and SNPs V-3 & ST+4, p = 0.0042). In 
contrast, in the UK population, twenty-seven SNP combinations were 
significant at the 0.01 level in Gene 216 (Table 24). In the US population, ten 
SNP combinations were significant at the 0.01 level in Gene 216 (Table 25). 
Tables 18, 19, and 20 and Tables 23, 24, and 25 showed similar patterns of 

15 significance with lower level achieved in the BHR analysis due to the reduced 
sample size in the (PC20 ^ 16 mg/ml) subgroup. 

All SNP combinations in Table 23, 24, and 25, that demonstrated a 
significant difference (p < 0.05) in the distribution of frequencies of the four 
haplotypes between the cases and the control populations, were further 

20 analyzed to identify individual haplotypes that were also significant. Table 26 
presents the haplotypes that were significantly associated, at the 0.05 level of 
significance, with the BHR phenotype. In the combined populations, the two 
most significant haplotypes were protective and contained the C allele in SNP 
ST+4 in combination with the G allele at SNP V-3 (p = 0.0025) or the C allele 

25 at SNP V-2 (p = 0.0022). In the UK population, the most significant 
haplotypes, A/A (SNPs S1/S+1, p = 0.0001) and T/G (SNPs Q-1/T+2, p = 
0.0003), increased the susceptibility toward BHR. The protective haplotype 
C/T (SNPs S2/T+2) was significant at the 0.001 level. Due to the smaller 
sample size in the US population, no haplotype was significantly associated 

30 with BHR at the 0.001 level. 

As for the asthma yes/no phenotype, we analyzed haplotypes consisting 
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of SNPs present in the mature mRNA. Table 27 presents the 15-SNP 
haplotypes (for SNPs D1/F1/I1/L1/S1/S2/T1/T2A/1A/2A/3A/4A/5/ V6A/7) in the 
combined and separate UK and US populations. In the combined populations, 
one haplotype (T/G/A/C/G/C/T/C/T/T/C/G/G/C/G) was significant at the 0.05 
5 significance level. In the UK population, one haplotype 
(T/A/G/C/G/G/T/C/A/CAT/C/A/C/G) was significant at the 0.05 level .In the US 
population , three haplotypes (T/A/A/C/G/C/C/C/A/C/T/C/A/C/G , 
T/A/A/C/G/C/C/T/A/C/T/C/A/C/G and T/A/G/C/G/G/T/C/A/C/T/G/A/T/G) were 
significant at the 0.05 level. 

10 It is noted that for Tables 21 , 22, 26, and 27, the haplotypes are written 

without slashes separating each allele. Thus, the haplotype that is 
T/G/A/C/G/C/T/C/T/T/C/G/G/C/G is written as TGACGCTCTTCGGCG in Table 
27. This represents the short-hand designations for the haplotypes and is not, 
in anyway, meant to represent contiguous nucleotide sequences. 

15 In summary, haplotype analysis of SNPs significantly strengthened the 

evidence in support of Gene 216 as an asthma susceptibility gene. In some 
SNP combinations, the association was increased by an order of magnitude. 
The most striking association again appeared in the 3' region of the gene, in 
agreement with the single SNP analysis. 

20 
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Asthma Yes/No 
Combined 
US and UK 


SNP 

POMRI NATION 


HAP I OTYPE 


FREQUENCIES 


P-VALUE 


CNTL 


CASE 


U I/O 1 » *T 


TA 


U.0l4b 


u.oyt>y 


f) 0.449 


I— / 1 / O 1 » *-t 


TP 

1 w 


U.4oo4 


u.oyy^d 




F+1/ST+4 


vjj/-\ 


U.oUoU 


u.4zyi 


0 00?8 


O '11 0 1 ~*T 


TP 


U.Uol o 


n r\A do 
U.U1 oo 


0.0013 


S1/ST+4 


AA 

r\r\ 


U.1U1 o 


U.Uooo 


0.0321 


S 1 /ST+4 




n A A ~7 A 

U.41 f 1 


U.04D0 




S2/ST+4 


GA 


ri O A A "7 


u.4zyo 


0.0019 


0-1 /ST+4 

Vj<— 1 / O I * *T 


PA 


u.4uyi 


U.OOUo 


U.UUUu 


Vat*" I / O I * *T 


TA 


0.1069 


U.UoUb 


0 09D1 


Kl +9/ST+4 


AA 
r\r\ 


U.1Uo4 


u.uoi y 




l\L « /I/O 1 ' *T 


PA 


0.4048 


0.5487 


U.vUU r 






0.3082 


0.2234 


U.UZ I *r 


QT+4/ST+^J 


O | 


0.0475 


0.0132 




O I/O I t^O 


AT 
M I 


0.1050 


0.0539 




OZ/O I 


PT 
O 1 


0.1322 


0.0638 


U.U I 


Q9/QT+R 
oZ/o I 


O 1 


0.3319 


0.4252 


n H9Rn 
u.uzou 


VJ- I/O I 


O 1 


0.3449 


0.4328 


n H99Q 


*oi- l/o I +0 


TT 
1 1 


0.1 193 


0.0554 


n nnao 
u.uuyy 


Kl +9/QT+R 


AT 
M 1 


0.1342 


0.0482 




Kl +9/QT+t^ 
iaLt// o 1 to 


PT 
O 1 


0.3301 


0.4415 


u.uuoo 


M- l/o 1 t ( 


A A 
MM 


0.2149 


0.1423 


U.U I o/ 


A i /QT+7 
M- l/o 1 ■ / 


Mo 


0.7615 


0.8307 


n n^f^7 

U.UOOf 


U~ I/O I * / 


PA 


0.1228 


0.0453 


U.UUUu 


U- I/O 1 ^ f 




0.5013 


0.61 10 


n ni 99 

U.U I 


m /st+7 

U I /O I T f 


TA 

1 M 


U./Cl ob 


n A A oo 


n H149 

U.U I 


D 1 /ST+7 


TG 
i \j 


0.7814 


0.8539 


0.0186 


F1/ST+7 


AA 


0.1861 


0.1246 


0.0356 


F1/ST+7 


AG 


0.7816 


0.8446 


0.0455 


F1/ST+7 


GG 


0.0000 


0.0132 


0.0465 


G-1/ST+7 


TA 


0.1459 


0.0748 


0.0073 


G-1/ST+7 


TG 


0.7598 


0.8370 


0.0165 


M+1/ST+7 


GA 


0.2177 


0.1424 


0.0165 


M+1/ST+7 


GG 


0.6700 


0.7469 


0.0326 


L-2/ST+7 


GA 


0.1495 


0.0731 


0.0020 


L-2/ST+7 


GG 


0.7791 


0.8582 


0.0096 


L1/ST+7 


CA 


0.2186 


0.1423 


0.0132 


L1/ST+7 


CG 


0.7745 


0.8539 


0.0100 


ST+4/ST+7 


AA 


0.1108 


0.0542 


0.0328 
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ST+4/ST+7 


AG 


0.4038 


0.5472 


0.0006 


ST+5/ST+7 


TA 


0.1266 


0.0486 


0.0033 


ST+5/ST+7 


TG 


0.3376 


0.4395 


0.0089 


ST+6/ST+7 


CA 


0.2140 


0.1392 


0.0170 


ST+6/ST+7 


CG 


0.7814 


0.8530 


0.0225 


S+1/ST+7 


TA 


0.1504 


0.0594 


0.0013 


S+1/ST+7 


TG 


0.3363 


0.4154 


0.0441 


S1/ST+7 


AA 


0.1036 


0.0564 


0.0381 


S1/ST+7 


GG 


0.7822 


0.8546 


0.0213 


S2/ST+7 


CA 


0.1481 


0.0743 


0.0089 


Q-1/ST+7 


TA 


0.1473 


0.0750 


0.0054 


Q-1/ST+7 


CG 


0.7802 


0.8424 


0.0487 


Q-1/ST+7 


TG 


0.0025 


0.0128 


0.0325 


F+1/S+1 


AT 


0.1660 


0.0750 


0.0093 


S2/S+1 


CT 


0.1628 


0.0702 


0.0034 


Q-1/S+1 


TT 


0.1405 


0.0646 


0.0041 


KL+2/S+1 


AT 


0.1379 


0.0456 


0.0026 


D-1/S1 


CA 


0.1057 


0.0538 


0.0229 


D-1/S1 


GA 


0.0000 


0.0000 


0.0013 


D-1/S1 


CG 


0.5192 


0.6025 


0.0430 


D1/S1 


TA 


0.1053 


0.0538 


0.0224 


D1/S1 


TG 


0.8947 


0.9423 


0.0349 


L-2/S1 


AA 


0.0000 


0.0040 


0.0027 


L-2/S1 


GA 


0.1056 


0.0499 


0.0150 


L-2/S1 


GG 


0.8239 


0.8814 


0.0435 


KL+1/S1 


GA 


0.1052 


0.0536 


0.0215 


KL+1/S1 


GG 


0.8557 


0.9184 


0.0162 


KL+2/S1 


AA 


0.1056 


0.0543 


0.0223 


D-1/S2 


CC 


0.1249 


0.0508 


0.0120 


D-1/S2 


CG 


0.4996 


0.5940 


0.0293 


ST+4/T+1 


CT 


0.0300 


0.0000 


0.0182 


ST+7/T+1 


AC 


0.2182 


0.1424 


0.0140 


ST+7/T+1 


GC 


0.6704 


0.7450 


0.0387 


S1/T+1 


AC 


0.1057 


0.0475 


0.0111 


S1/T+1 


AT 


0.0000 


0.0063 


0.0178 


ST+7/T+2 


AG 


0.0000 


0.0000 


0.0478 


ST+7/T+2 


AT 


0.2189 


0.1418 


0.0128 


ST+7/T+2 


GT 


0.6636 


0.7475 


0.0173 


S1/T+2 


AG 


0.0000 


0.0054 


0.0151 


S1/T+2 


AT 


0.1052 


0.0485 


0.0107 


S1/T+2 


GT 


0.7773 


0.8408 


0.0352 


Q-1/T+2 


TG 


0.0000 


0.0142 


<0.0001 


Q-1/T+2 


CT 


0.7327 


0.8157 


0.0114 


Q-1/T+2 


TT 


0.1498 


0.0736 


0.0034 


S2/T2 


CC 


0.1690 


0.0942 


0.0082 
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A-1 A/-1 


AA 


0.1418 


0.0763 


0.0123 


A-1/V-1 


AC 


0.8346 


0.8967 


0.0291 


D-1/V-1 


CA 


0.1288 


0.0478 


0.0046 


D-1A/-1 


CC 


0.4959 


0.6089 


0.0104 


D-2A/-1 


CA 


0.1482 


0.0763 


0.0067 


D-2A/-1 


CC 


0.8448 


0.9158 


0.0093 


D1A/-1 


TA 


0.1481 


0.0763 


0.0062 


D1A/-1 


TC 


0.8519 


0.9198 


0.0088 


F+1/V-1 


AA 


0.1451 


0.0763 


0.0094 


F1A/-1 


AA 


0.1 158 


0.0579 


0.0187 


F1/V-1 


AC 


0.8520 


0.9114 


0.0302 


F1/V-1 


GC 


0.0000 


0.0123 


0.0496 


G-1A/-1 


TA 


0.1424 


0.0763 


0.0106 


G-1 AM 


TC 


0.7649 


0.8372 


0.0228 


11AM 


GA 


0,1 187 


0.0526 


0.0093 


M+1AM 


GA 


0.1482 


0.0749 


0.0046 


M+1/V-1 


TA 


0.0000 


0.0014 


0.0222 


M+1A/-1 


GC 


0.7393 


0.8144 


0.0289 


L-1A/-1 


AA 


0.0000 


0.0013 


0.0077 


IMAM 


GA 


0.1482 


0.0751 


0.0053 


IMAM 


GC 


0.741 1 


0.8133 


0.0306 


L-2AM 


GA 


0.1478 


0.0745 


0.0047 


L-2AM 


GC 


0.7815 


0.8568 


0.0172 


L1AM 


CA 


0.1482 


0.0763 


0.0043 


L1A/-1 


CC 


0.8449 


0.9198 


0.0040 


ST+4AM 


AA 


0.1078 


0.0529 


0.0245 


ST+4AM 


AC 


0.4083 


0.5477 


0.0006 


ST+5AM 


TA 


0.1199 


0.0600 


0.0155 


ST+5A/-1 


TC 


0.3443 


0.4280 


, 0.0306 


ST+6AM 


CA 


0.1463 


0.0729 


0.0043 


ST+6AM 


CC 


0.8491 


0.9194 


0.0070 


ST+7/V-1 


AA 


0.1490 


0.0763 


0.0057 


ST+7AM 


GC 


0.7827 


0.8547 


0.0188 


S+1A/-1 


TA 


0.1412 


0.0763 


0.0129 


S1 AM 


AA 


0.1046 


0.0594 


0.0469 


S1A/-1 


GC 


0.8523 


0.9237 


0.0042 


S2/V-1 


CA 


0.1475 


0.0763 


0.0057 


T+1A/-1 


LA 


0.1482 


U.U/44 


(J.U(J4z 


T+1AM 


TA 


0.0000 


0.0019 


0.0089 


T+1A/-1 


CC 


0.7386 


0.8128 


0.0280 


T+2A/-1 


GA 


0.0000 


0.0064 


0.0009 


T+2A/-1 


TA 


0.1482 


0.0700 


0.0015 


T+2A/-1 


TC 


0.7342 


0.8193 


0.0076 


T1/V-1 


CA 


0.0000 


0.0003 


0.0194 


T1AM 


TA 


0.1482 


0.0760 


0.0050 
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T1/V-1 


TC 


0.7389 


0.8057 


0.0467 


T2A/-1 


CA 


0.1482 


0.0763 


0.0057 


V-2/V-1 


CA 


0.1475 


0.0763 


0.0056 


V-3A/-1 


GA 


0.1475 


0.0763 


0.0048 


V-4A/-1 


CA 


0.1475 


0.0763 


0.0074 


Q-1A/-1 


TA 


0.1475 


0.0763 


0.0057 


Q-1AM 


CC 


0.8502 


0.9122 


0.0183 


KL+1/V-1 


GA 


0.1115 


0.0567 


0.0176 


KL+1A/-1 


GC 


0.8494 


0.9153 


0.0129 


KL+2A/-1 


AA 


0.1157 


0.0573 


0.0153 


KL+2A/-1 


CC 


0.6805 


0.7530 


0.0401 


ST+4A/-2 


AC 


0.5145 


0.5990 


0.0390 


ST+4/V-2 


CC 


0.1137 


0.0232 


0.00003 


ST+7AA-2 


AC 


0.1672 


0.0700 


0.0005 


ST+7A/-2 


GC 


0.4655 


0.5494 


0.0449 


Q-1A/-2 


TC 


0.1461 


0.0713 


0.0056 


ST+4A/-3 


CG 


0.1118 


0.0233 


0.00002 


ST+7/V-3 


AG 


0.1629 


0.0675 


0.0013 


Q-1/V-3 


TG 


0.1460 


0.0701 


0.0049 


KL+2/V-3 


AG 


0.1262 


0.0513 


0.0071 


G-1A/-4 


CC 


0.0387 


0.0050 


0.0130 


ST+4A/-4 


AC 


0.5160 


0.5966 


0.0489 


ST+4/V-4 


CC 


0.2409 


0.1311 


0.0004 


ST+7A/-4 


AC 


0.1671 


0.0786 


0.0029 


V-1/V7 


AG 


0.1425 


0.0763 


0.0103 


ST+7A/6 


AC 


0.1413 


0.0752 


0.0128 


S2A/6 


CC 


0.2563 


0.1671 


0.0083 


S2/V6 


GC 


0.6570 


0.7374 


0.0295 


V-1A/6 


AC 


0.1478 


0.0763 


0.0065 


ST+7A/5 


AA 


0.1839 


0.1256 


0.0427 


ST+7A/5 


GA 


0.7788 


0.8578 


0.0103 


S1A/5 


AA 


0.1048 


0.0538 


0.0257 


S1/V5 


GA 


0.8580 


0.9306 


0.0044 


V-1/V5 


AA 


0.1132 


0.0594 


0.0185 


V-1A/5 


CA 


0.8494 


0.9237 


0.0039 


V4A/5 


CA 


0.7345 


0.8359 


0.0022 


V4A/5 


GA 


0.2283 


0.1481 


0.0112 


V4A/5 


CG 


0.0328 


0.0000 


0.0083 


D-1/V4 


CC 


0.5072 


0.6171 


0.0147 


D-1/V4 


CG 


0.1178 


0.0423 


0.0048 


D1A/4 


TC 


0.7673 


0.8321 


0.0404 


D1A/4 


TG 


0.2327 


0.1641 


0.0275 


F1/V4 


AC 


0.7379 


0.8223 


0.0099 


F1A/4 


AG 


0.2299 


0.1469 


0.0079 


F1A/4 


GG 


0.0029 


0.0172 


0.0386 
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ST+4/V4 


AC 


0.4053 


0.5468 


0.0010 


ST+4A/4 


AG 


0.1086 


0.0548 


0.0467 


ST+7/V4 


GC 


0.6602 


0.7614 


0.0046 


V-1A/4 


AC 


0.0423 


0.0042 


0.0035 


V-1A/4 


CC 


0.7250 


0.8317 


0.0009 


V2A/4 


CC 


0.7358 


0.8359 


0.0023 


V2A/4 


TC 


0.0315 


0.0000 


0.0075 


V2/V4 


CG 


0.2270 


0.1485 


0.0111 


V1/V4 


AC 


0.7369 


0.8359 


0.0024 


V1/V4 


TC 


0.0304 


0.0000 


0.0091 


V1A/4 


AG 


0.2277 


0.1450 


0.0078 


Q-1A/4 


CC 


0.7226 


0.8276 


0.0016 


Q-1/V4 


TC 


0.0446 


0.0083 


0.0063 


KL+W4 


GC 


0.7324 


0.8266 


0.0044 


KL+1A/4 


GG 


0.2285 


0.1454 


0.0075 


KL+2A/4 


AG 


0.1205 


0.0537 


0.0107 


S1A/3 


AT 


0.1050 


0.0537 


0.0224 


F1A/2 


GC 


0.0000 


0.0133 


0.0076 


ST+4A/2 


AC 


0.5158 


0.6009 


0.0372 


ST+7/V2 


GC 


0.7817 


0.8578 


0.0144 


S1A/2 


AC 


0.1048 


0.0536 


0.0272 


S1A/2 


GC 


0.8580 


0.9308 


0.0043 


V-1/V2 


AC 


0.1105 


0.0603 


0.0307 


V-1/V2 


CC 


0.8520 


0.9237 


0.0061 


Q-1/V2 


CC 


0.8502 


0.9122 


0.0170 


F1A/1 


GA 


0.0000 


0.0116 


0.0241 


ST+7A/1 


AA 


0.1810 


0.1226 


0.0420 


ST+7/V1 


GA 


0.7817 


0.8578 


0.0153 


S1A/1 


AA 


0.1042 


0.0536 


0.0281 


S1/V1 


GA 


0.8603 


0.9271 


0.0088 


V-1/V1 


AA 


0.1111 


0.0573 


0.0190 


V-1A/1 


CA 


0.8520 


0.9237 


0.0058 


Q-1/V1 


CA 


0.8502 


0.9122 


0.0162 


A-1/Q-1 


AT 


0.1435 


0.0878 


0.0350 


D-1/Q-1 


CC 


0.4945 


0.6157 


0.0066 


D-1/Q-1 


CT 


0.1302 


0.0404 


0.0018 


D1/Q-1 


TC 


0.8502 


0.9084 


0.0288 


D1/Q-1 


TT 


0.1498 


0.0878 


0.0224 


F1/Q-1 


GC 


0.0000 


0.0124 


0.0499 
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Asthma Yes/No 
UK Population 


SNP 

COMBINATION 


HAPLOTYPE 


FREQUENCIES 


P-VALUE 


CNTL 


CASE 


A-1/F+1 


AA 


0.3566 


0.2591 


0.0326 


D1/F+1 


TA 


0.3594 


0.2582 


0.0243 


D1/F+1 


TG 


0.6406 


0.7370 


0.0299 


F+1/G-1 


AT 


0.2593 


0.1505 


0.0067 


F+1/G-1 


GT 


0.6416 


0.7475 


0.0169 


KL+2/M+1 


AG 


0.2845 


0.2089 


0.0491 


KL+2/M+1 


CG 


0.5863 


0.7094 


0.0044 


KL+2/L-1 


AG 


0.2845 


0.2089 


0.0463 


KL+2/L-1 


CG 


0.5886 


0.7083 


0.0038 


F+1/L-2 


AG 


0.2825 


0.1781 


0.0113 


F+1/L-2 


GG 


0.6453 


0.7449 


0.0265 


KL+2/L1 


AC 


0.2842 


0.2039 


0.0364 


KL+2/L1 


CC 


0.7087 


0.7913 


0.0341 


KL+2/L1 


AT 


0.0000 


0.0048 


0.0013 


A-1/ST+4 


AC 


0.5160 


0.4079 


0.0279 


D-1/ST+4 


CA 


0.3518 


0.5012 


0.0036 


D-1/ST+4 


CC 


0.2594 


0.1556 


0.0097 


D1/ST+4 


TA 


0.4805 


0.5874 


0.0297 


D1/ST+4 


TC 


0.5195 


0.4078 


0.0243 


F+1/ST+4 


GA 


0.2879 


0.4771 


0.0002 


I1/ST+4 


GA 


0.3783 


0.5243 


0.0048 


M+1/ST+4 


GA 


0.3805 


0.5102 


0.0104 


L-1/ST+4 


GA 


0.3817 


0.5088 


0.0099 


L-2/ST+4 


GA 


0.4814 


0.5927 


0.0201 


L-2/ST+4 


GC 


0.4456 


0.3304 


0.0115 


L1/ST+4 


CA 


0.4730 


0.5923 


0.0164 


L1/ST+4 


CC 


0.5199 


0.4029 


0.01 54 


S+1/ST+4 


TA 


0.3623 


0.4892 


0.0096 


S+1/ST+4 


TC 


0.1076 


0.01 66 


0.0002 


S1/ST+4 


AA 


0.1004 


0.0481 


0.0469 


S1/ST+4 


GA 


U.3ob9 


0.5440 


0.0005 


S1/ST+4 


GC 


0.5062 


0.4080 


0.0478 


S2/ST+4 


GA 


0.2839 


0.4669 


0.0001 


Q-1/ST+4 


CA 


0.3894 


0.5556 


0.0006 


Q-1/ST+4 


TA 


0.0933 


0.0367 


0.0349 


Q-1/ST+4 


CC 


0.4713 


0.3675 


0.0275 


KL+2/ST+4 


CA 


0.3767 


0.5474 


0.0008 


KL+2/ST+4 


CC 


0.3392 


0.2451 


0.0321 


F+1/ST+5 


AT 


0.1239 


0.0516 


0.0325 


F+1/ST+5 


GT 


0.3209 


0.4485 


0.0084 


ST+4/ST+5 


AT 


0.3673 


0.4973 


0.0075 
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ST+4/ST+5 


CT 


0.0745 


0.0053 


0.0002 


S1/ST+5 


AT 


0.1060 


0.0481 


0.0194 


S1/ST+5 


GT 


0.3389 


0.4514 


0.0162 


S2/ST+5 


CT 


0.1158 


0.0535 


0.0441 


S2/ST+5 


GT 


0.3280 


0.4471 


0.0146 


Q-1/ST+5 


CT 


0.3321 


0.4553 


0.0091 


Q-1/ST+5 


TT 


0.1116 


0.0446 


0.0130 


KL+2/ST+5 


AT 


0.1371 


0.0486 


0.0112 


KL+2/ST+5 


CT 


0.3069 


0.4528 


0.0026 


A-1/ST+6 


AC 


0.9889 


0.9566 


0.0352 


A-1/ST+6 


AT 


0.0000 


0.0097 


0.0420 


F+1/ST+6 


AC 


0.3594 


0.2558 


0.0219 


F+1/ST+6 


GC 


0.6406 


0.7345 


0.0379 


ST+4/ST+6 


AC 


0.4805 


0.5824 


0.0315 


ST+4/ST+6 


CC 


0.5195 


0.4078 


0.0200 


S1/ST+6 


AC 


0.1061 


0.0435 


0.0141 


S1/ST+6 


GC 


0.8939 


0.9468 


0.0448 


S2/ST+6 


CC 


0.2714 


0.1563 


0.0024 


S2/ST+6 


GC 


0.7286 


0.8339 


0.0059 


Q-1/ST+6 


TC 


0.1393 


0.0725 


0.0275 


KL+2/ST+6 


AC 


0.2842 


0.2057 


0.0441 


A-1/ST+7 


TG 


0.0050 


0.0337 


0.0331 


D-1/ST+7 


CA 


0.1215 


0.0403 


0.0079 


D-1/ST+7 


CG 


0.4896 


0.6163 


0.0109 


F+1/ST+7 


GG 


0.6436 


0.7514 


0.0136 


M+1/ST+7 


GA 


0.2044 


0.1357 


0.0478 


M+1/ST+7 


TA 


0.0000 


0.0003 


0.0318 


M+1/ST+7 


GG 


0.6647 


0.7825 


0.0050 


L-1/ST+7 


AA 


0.0000 


0.0001 


0.0141 


L-1/ST+7 


GG 


0.6671 


0.7814 


0.0062 


L-2/ST+7 


GA 


0.1374 


0.0583 


0.0049 


L-2/ST+7 


GG 


0.791 1 


0.8648 


0.0346 


ST+4/ST+7 


AG 


0.3873 


0.5539 


0.0008 


ST+4/ST+7 


CG 


0.4077 


0.3100 


0.0358 


ST+5/ST+7 


TA 


0.1173 


0.0378 


0.0063 


ST+5/ST+7 


TG 


0.3264 


0.4616 


0.0044 


ST+6/ST+7 


CA 


0.2050 


0.1320 


0.0388 


S+l/oT+7 


TA 


0.1356 


0.0466 


0.0053 


S+1/ST+7 


TG 


0.3319 


0.4491 


0.0167 


S2/ST+7 


CA 


0.1338 


0.0612 


0.0187 


S2/ST+7 


GG 


0.6567 


0.7646 


0.0124 


Q-1/ST+7 


TA 


0.1355 


0.0601 


0.0078 


F+1/S+1 


AT 


0.1466 


0.0594 


0.0180 


F+1/S+1 


GT 


0.3228 


0.4347 


0.0234 


S1/S+1 


AT 


0.1061 


0.0481 


0.0228 
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S2/S+1 


CT 


0.1383 


0.0594 


0.0253 


S2/S+1 


GT 


0.3307 


0.4341 


0.0361 


Q-1/S+1 


CT 


0.3373 


0.4446 


0.0208 


Q-1/S+1 


TT 


0.1295 


0.0494 


0.0050 


KL+2/S+1 


AT 


0.1417 


0.0455 


0.0039 


KL+2/S+1 


CT 


0.3269 


0.4547 


0.0105 


A-1/S1 


AA 


0.0982 


0.0481 


0.0473 


A-1/S1 


TA 


0.0107 


0.0000 


0.0233 


A-1/S1 


TG 


0.0000 


0.0337 


0.0033 


D-1/S1 


CA 


0.1069 


0.0481 


0.0213 


D-1/S1 


GA 


0.0000 


0.0000 


0.0063 


D-1/S1 


CG 


0.5058 


0.6053 


0.0326 


D1/S1 


TA 


0.1061 


0.0481 


0.0281 


D1/S1 


TG 


0.8939 


0.9471 


0.0439 


F+1/S1 


AA 


0.1052 


0.0481 


0.0289 


F+1/S1 


GG 


0.6393 


0.7464 


0.0134 


I1/S1 


GA 


0.1059 


0.0481 


0.0272 


H/S1 


GG 


0.7308 


0.8440 


0.0035 


M+1/S1 


GA 


0.1061 


0.0477 


0.0251 


M+1/S1 


GG 


0.7632 


0.8705 


0.0041 


L-1/S1 


AA 


0.0000 


0.0003 


0.0453 


L-1/S1 


GA 


0.1061 


0.0478 


0.0246 


L-1/S1 


GG 


0.7658 


0.8697 


0.0031 


KL+2/S1 


AA 


0.1067 


0.0481 


0.0210 


A-1/S2 


AC 


0.2661 


0.1602 


0.0054 


A-1/S2 


AG 


0.7227 


0.8061 


0.0351 


A-1/S2 


TG 


0.0058 


0.0337 


0.0430 


D-1/S2 


CC 


0.1196 


0.0437 


0.0144 


D-1/S2 


CG 


0.4913 


0.6091 


0.0175 


D-2/S2 


CC 


0.2700 


0.1602 


0.0047 


D-2/S2 


CG 


0.7228 


0.8298 


0.0062 


D1/S2 


TC 


0.2714 


0.1602 


0.0042 


D1/S2 


TG 


0.7286 


0.8350 


0.0048 


F+1/S2 


AC 


0.2575 


0.1527 


0.0048 


F+1/S2 


GG 


0.6159 


0.7381 


0.0055 


F1/S2 


AC 


0.2500 


0.1408 


0.0035 


F1/S2 


AG 


0.7286 


0.8400 


0.0034 


G-1/S2 


TC 


0.2605 


0.1453 


0.0017 


G-1/S2 


TG 


0.6414 


0.7558 


0.0070 


I1/S2 


GC 


0.1127 


0.0521 


0.0196 


I1/S2 


GG 


0.7245 


0.8356 


0.0042 


M+1/S2 


GC 


0.1429 


0.0777 


0.0261 


M+1/S2 


GG 


0.7286 


0.8405 


0.0029 


L-1/S2 


GC 


0.1464 


0.0806 


0.0265 


L-1/S2 


GG 


0.7286 


0.8353 


0.0060 
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L-2/S2 


GC 


0.2714 


0.1603 


0.0031 


L-2/S2 


GG 


0.6550 


0.7628 


0.0086 


L1/S2 


CC 


0.2714 


0.1602 


0.0037 


L1/S2 


CG 


0.7214 


0.8350 


0.0032 


S1/S2 


AC 


0.1056 


0.0481 


0.0258 


S1/S2 


GG 


0.7286 


0.8402 


0.0037 


Q-1/S2 


TC 


0.1393 


0.0769 


0.0430 


Q-1/S2 


CG 


0.7286 


0.8405 


0.0048 


KL+1/S2 


GC 


0.2429 


0.1402 


0.0061 


KL+1/S2 


GG 


0.7286 


0.8400 


0.0039 


KL+2/S2 


AC 


0.1220 


0.0456 


0,0150 


KL+2/S2 


CG 


0.5661 


0.6769 


0.0145 


ST+4/T+1 


AC 


0.3961 


0.5021 


0.0315 


ST+4/T+1 


CT 


0.0449 


0.0000 


0.0160 


ST+7/T+1 


GC 


0.6645 


0.7748 


0.0088 


S1/T+1 


AC 


0.1068 


0.0476 


0.0248 


S1/T+1 


GC 


0.7600 


0.8629 


0.0068 


S2/T+1 


CC 


0.1539 


0.0781 


0.0142 


S2/T+1 


GC 


0.7233 


0.8348 


0.0037 


Q-1/T+1 


CC 


0.7277 


0.8334 


0.0064 


Q-1/T+1 


TC 


0.1393 


0.0769 


0.0300 


ST+4/T+2 


AT 


0.4805 


0.5898 


0.0213 


ST+4/T+2 


CT 


0.3945 


0.2852 


0.0158 


S1/T+2 


AG 


0.0000 


0.0090 


0.0094 


S1/T+2 


AT 


0.1061 


0.0391 


0.0094 


S2/T+2 


CG 


0.0000 


0.0014 


0.0367 


S2/T+2 


CT 


0.2714 


0.1590 


0.0033 


S2/T+2 


GT 


0.6036 


0.7160 


0.0086 


Q-1/T+2 


TG 


0.0000 


0.0219 


0.0001 


Q-1/T+2 


CT 


0.7357 


0.8200 


0.0286 


Q-1/T+2 


TT 


0.1393 


0.0550 


0.0040 


ST+4/T1 


AT 


0.3801 


0.5053 


0.0106 


ST+7/T1 


AC 


0.0000 


0.0000 


0.0411 


ST+7/T1 


GT 


0.6634 


0.7774 


0.0060 


S1/T1 


AT 


0.1061 


0.0480 


0.0250 


S1/T1 


GT 


0.7617 


0.8654 


0.0046 


S2/T1 


CT 


0.1435 


0.0781 


0.0276 


S2/T1 


GT 


0.7244 


0.8353 


0.0030 


Q-1/T1 


CT 


0.7286 


0.8365 


0.0041 


Q-1/T1 


TT 


0.1393 


0.0769 


0.0320 


KL+2/T1 


CT 


0.5834 


0.7045 


0.0058 


ST+4/T2 


AC 


0.4031 


0.5091 


0.0300 


ST+4/T2 


CT 


0.0282 


0.0000 


0.0472 


S2/T2 


CC 


0.1674 


0.0802 


0.0044 


S2/T2 


GC 


0.7243 


0.8405 


0.0020 
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A- 1AM 


AA 


0.1292 


0.0625 


0.0197 


A- 1AM 


TC 


0.0044 


0.0337 


0.0209 


D-1A/-1 


CA 


0.1227 


0.0408 


0.0032 


D-1/V-1 


CC 


0.4893 


0.6139 


0.0107 


D-2A/-1 


CA 


0.1357 


0.0625 


0.0105 


D-2AM 


CC 


0.8571 


0.9276 


0.0204 


D1A/-1 


TA 


0.1357 


0.0625 


0.0112 


D1/V-1 


TC 


0.8643 


0.9327 


0.0143 


F+1/V-1 


AA 


0.1357 


0.0625 


0.0112 


F+1A/-1 


GC 


0.6385 


0.7478 


0.0135 


F1AM 


AA 


0.1143 


0.0483 


0.0152 


F1A/-1 


AC 


0.8643 


0.9323 


0.0175 


G-1/V-1 


TA 


0.1280 


0.0625 


0.0209 


11AM 


GA 


0.1152 


0.0487 


0.0165 


I1A/-1 


GC 


0.7217 


0.8436 


0.0033 


M+1AM 


GA 


0.1357 


0.0625 


0.0066 


M+1AM 


GC 


0.7336 


0.8558 


0.0011 


L-1/V-1 


GA 


0.1357 


0.0625 


0.0091 


IMAM 


GC 


0.7361 


0.8549 


0.0009 


L-2AM 


GA 


0.1357 


0.0625 


0.0079 


L-2/V-1 


GC 


0.7910 


0.8606 


0.0476 


L1A/-1 


CA 


0.1357 


0.0625 


0.0099 


L1/V-1 


CC 


0.8571 


0.9327 


0.0077 


ST+4A/-1 


AA 


0.0951 


0.0400 


0.0412 


ST+4A/-1 


AC 


0.3881 


0.5522 


0.0008 


ST+5AM 


TA 


0.1126 


0.0513 


0.0222 


ST+5A/-1 


TC 


0,3312 


0.4483 


0.0121 


ST+6/V-1 


CA 


0.1357 


0.0580 


0.0068 


ST+6A/-1 


CC 


0.8643 


0.9323 


0.0183 


ST+7AM 


AA 


0.1357 


0.0625 


0.0076 


S+1/V-1 


TA 


0.1308 


0.0625 


0.0142 


S+1/V-1 


TC 


0.3360 


0.4299 


0.0427 


S1AM 


AA 


0.1056 


0.0481 


0.0242 


S1A/-1 


GC 


0.8643 


0.9375 


0.0080 


S2AM 


CA 


0.1357 


0.0625 


0.0111 


S2A/-1 


GC 


0.7286 


0.8404 


0.0033 


T+1A/-1 


CA 


0.1357 


0.0625 


0.0082 


T+1/V-1 


CC 


0.7313 


0.8480 


0.0031 


T+2A/-1 


GA 


0.0000 


0.0124 


0.001 1 


T+2A/-1 


TA 


0.1357 


0.0501 


0.0028 


T+2AM 


TC 


0.7393 


0.8249 


0.0252 


T1/V-1 


TA 


0.1357 


0.0625 


0.0076 


T1/V-1 


TC 


0.7321 


0.8510 


0.001 1 


T2A/-1 


CA 


0.1357 


0.0625 


0.0085 


T2A/-1 


CC 


0.7595 


0.8560 


0.0109 
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V-2/V-1 


CA 


0.1357 


0.0625 


0.0078 


V-3A/-1 


GA 


0.1357 


0.0625 


0.0076 


V-4A/-1 


CA 


0.1357 


0.0625 


0.0081 


Q-1A/-1 


TA 


0.1357 


0.0625 


0.0096 


Q-1A/-1 


CC 


0.8607 


0.9231 


0.0327 


KL+1AM 


GA 


0.1071 


0.0471 


0.0157 


KL+1A/-1 


GC 


0.8643 


0.9322 


0.0185 


F+1/V-2 


AC 


0.2594 


0.1241 


0.0016 


F+1A/-2 


GC 


0.3646 


0.4832 


0.0147 


ST+4A/-2 


AC 


0.4811 


0.5912 


0.0214 


ST+4/V-2 


CC 


0.1352 


0.0194 


0.000003 


ST+7/V-2 


AC 


0.1518 


0.0468 


0.0007 


S2/V-2 


CC 


0.2602 


0.1415 


0.0026 


S2A/-2 


GC 


0.3611 


0.4648 


0.0271 


Q-1 A/-2 


TC 


0.1333 


0.0500 


0.0047 


KL+2A/-2 


AC 


0.1372 


0.0452 


0.0032 


F+1A/-3 


AG 


0.2594 


0.1198 


0.0017 


F+1/V-3 


GG 


0.3604 


0.4779 


0.0188 


ST+4A/-3 


AG 


0.4846 


0.5861 


0.0337 


ST+4A/-3 


CG 


0.1352 


0.0195 


0.000002 


ST+7A/-3 


AG 


0.1448 


0.0422 


0.0017 


S2A/-3 


CG 


0.2656 


0.1393 


0.0018 


S2/V-3 


GG 


0.3552 


0.4609 


0.0253 


Q-1A/-3 


TG 


0.1332 


0.0464 


0.0031 


KL+2A/-3 


AG 


0.1334 


0.0402 


0.0025 


F+1A/-4 


AC 


0.2730 


0.1336 


0.0013 


F+1A/-4 


GC 


0.4752 


0.5994 


0.0134 


ST+4A/-4 


AC 


0.4830 


0.5888 


0.0283 


ST+4A/-4 


CC 


0.2670 


0.1405 


0.0006 


S2A/-4 


CC 


0.2635 


0.1608 


0.0116 


ST+4A/7 


AC 


0.2894 


0.4602 


<0.0001 


ST+5A/7 


TC 


0.3313 


0.4526 


0.0101 


ST+5/V7 


TG 


0.1135 


0.0475 


0.0319 


ST+6A/7 


CG 


0.3417 


0.2551 


0.0459 


S1/V7 


GC 


0.6588 


0.7437 


0.0491 


S1A/7 


AG 


0.1044 


0.0481 


0.0265 


S2/V7 


GC 


0.6442 


0,7297 


0.0480 


S2/V7 


CG 


0.2557 


0.1424 


0.0025 


V-1/V7 


CC 


0.6543 


0.7440 


0.0373 


V-1A/7 


AG 


0.1311 


0.0625 


0.0160 


V-2/V7 


CC 


0.3626 


0.4677 


0.0309 


V-2A/7 


CG 


0.2615 


0.1394 


0.0032 


V-3/V7 


GC 


0.3577 


0.4643 


0.0319 


V-3A/7 


GG 


0.2625 


0.1332 


0.0041 


V-4A/7 


CC 


0.4828 


0.5823 


0.0361 
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V-4/V / 


CG 


U./lbo4 




U.UU49 


Vb/V ( 


uu 


n OCQ7 

U.boo/ 


U. /449 


(~\ r\A no 


Vb/V ( 


CG 


0.2583 


r\ a coo 

0.1 oo9 


n A/177 

0.0077 


\ f A A / ~7 

V4/W 


GG 


0.1376 


f\ r\ A O A 

U.0481 


U.U100 


r+1/Vb 


AC 


r\ 0700 

0.2728 


0.1677 


0.0085 


F+1/V6 


GC 


0.6434 


0.7361 


0.0366 


on A /c 

oz/Vd 


cc 


r\ r>oo ft 

0.2630 


0.1341 


r\ An An 

0.0008 


S2A/6 


GC 


0.6543 


0.7697 


0.0067 


V-1/V6 


AC 


0.1357 


0.0625 


0.0070 


S1 A/5 


AA 


0.1052 


ft r\ a o a 

0.0481 


0.0272 


S1 A/5 


GA 


ft ft ft /■»*-» 

0.8662 


0.9374 


0.0134 


S2A/5 


CA 


0.2429 


0.1457 


0.0096 


S2/V5 


GA 


0.7286 


ft noft ft 

0.8399 


0.0030 


\ / A A /C 

V- i/Vo 


A A 

AA 


0.1071 


ft A/1 Oil 

0.0481 


0.0170 


V-1/V5 


CA 


0.8643 


0.9375 


r\ ftft ft ft 

0.0092 


V4A/5 


CA 


0.7322 


0.8365 


0.0058 


\ //I A /C 

V4/V5 


/"» A 

GA 


ft. ft o 

0.2392 


0.1490 


0.0125 


A -1 A //I 

A-1/V4 


AG 


0.2406 


0.1598 


0.0286 


U-1/V4 


CC 


0.4933 


0.6166 


ft ftH ftft 

0.0129 


U-1/V4 


CG 


0.1 175 


0.0436 


ft ft A ft A 

0.0104 


D1A/4 


TC 


0.7536 


0.8317 


0.0347 


D1/V4 


TG 


0.2464 


0.1635 


ft nAno 

0.0282 


F+1 A/4 


GC 


0.5378 


0.6500 


0.0227 


F+1 A/4 


AG 


0.1461 


0.0698 


0.0294 


C>i A 1 A 

M/V4 


AC 


0.7357 


0.8305 


0.0142 


CT H A 1 A 

F1/V4 


AG 


ft ft A ftft 

0.2429 


0.1501 


0.01 13 


11 A/4 


GC 


0.6087 


0.7379 


0.0052 


M+1/V4 


GC 


0.631 3 


0.7548 


ft ft ft O ft 

0.0033 


IVI+1/V4 


GG 


0.2378 


0.1635 


0.0415 


t H A //I 

L-1/V4 


GC 


0.6336 


0.7539 


0.0045 


1 H A 1 A 

L-1/V4 


GG 


0.2385 


0.1635 


ft ft A A\ ft 

0.0418 


0~T_i_ /I A I A 

o 1 +4/V4 


AC 


0.3687 


0.5492 


0.0004 


ST+4A/4 


CC 


0.3849 


0.2874 


0.0370 


0"T"_i_M A / /I 

b 1 +4/V4 


AG 


0.1 112 


0.0437 


ftft <-» /~i 
0.0280 


ST+5/V4 


TC 


0.3273 


0.4535 


0.01 1 1 


b 1 +b/V4 


TG 


0.1 163 


0.0473 


0.0218 


ST+6/V4 


CC 


0.7536 


0.8305 


0.0404 


o 1 ^D/v4 


CG 


U.z4o4 




U.uZo 1 


S2A/4 


GC 


0.6128 


0.7342 


0.0046 


S2A/4 


CG 


0.1307 


0.0575 


0.0224 


T+1A/4 


CC 


0.6337 


l_ 0.7460 


0.0108 


T+2A/4 


TG 


0.1299 


0.0588 


0.0148 


T1/V4 


TC 


0.6314 


0.7500 


0.0060 


T1A/4 


TG 


0.2365 


0.1635 


0.0487 


V-1/V4 


CC 


0.7287 


0.8312 


0.0058 



-202- 



WO 02/083077 



PCT/US02/12063 



V-1/V4 


AG 


0.1109 


0.0572 


0.0487 


V-3A/4 


GG 


0.1290 


0.0465 


0.0169 


V3/V4 


TC 


0.5555 


0.6792 


0.0126 


V3A/4 


TG 


0.2281 


0.1218 


0.0110 


V2A/4 


CC 


0.7359 


0.8365 


0.0074 


V2A/4 


CG 


0.2388 


0.1536 


0.0181 


V1A/4 


AC 


0.7372 


0.8365 


0.0088 


V1/V4 


AG 


0.2409 


0.1490 


0.0115 


KL+2A/4 


CC 


0.5826 


0.6791 


0.0297 


KL+2A/4 


AG 


0.1137 


0.0511 


0.0356 


ST+4/V3 


AT 


0.4809 


0.5922 


0.0217 


ST+4A/3 


CT 


0.3040 


0.2114 


0.0260 


S1A/3 


AT 


0.1056 


0.0481 


0.0271 


S2A/3 


CT 


0.2307 


0.1339 


0.0199 


S2A/3 


GT 


0.5541 


0.6669 


0.0196 


F+1A/2 


AC 


0.3349 


0.2474 


0.0471 


F+1A/2 


GC 


0.6396 


0.7428 


0.0206 


ST+4/V2 


AC 


0.4816 


0.5923 


0.0225 


ST+4A/2 


CC 


0.4930 


0.3977 


0.0482 


S1A/2 


AC 


0.1047 


0.0481 


0.0293 


S1/V2 


GC 


0.8699 


0.9421 


0.0100 


S2A/2 


CC 


0.2461 


0.1502 


0.0098 


S2/V2 


GC 


0.7286 


0.8399 


0.0028 


V-1A/2 


AC 


0.1100 


0.0521 


0.0265 


V-1/V2 


CC 


0.8643 


0.9375 


0.0089 


S2A/1 


CA 


0.2494 


0.1457 


0.0054 


S2/V1 


GA 


0.7286 


0.8399 


0.0032 


V-1/V1 


AA 


0.1131 


0.0481 


0.0127 


V-1/V1 


CA 


0.8643 


0.9375 


0.0080 


A-1/Q-1 


TC 


0.0044 


0.0337 


0.0204 


D-1/Q-1 


CC 


0.4861 


0.6205 


0.0077 


D-1/Q-1 


CT 


0.1258 


[ 0.0353 


0.0017 


D1/Q-1 


TT 


0.1393 


0.0769 


0.0437 


F+1/Q-1 


GC 


0.6387 


0.7449 


0.0142 


F+1/Q-1 


AT 


0.1393 


0.0769 


0.0410 


I1/Q-1 


GC 


0.7216 


0.8449 


0.0040 


I1/Q-1 


GT 


0.1153 


0.0477 


0.0175 


M+1/Q-1 


GC 


0.7300 


0.8413 


0.0021 


M+1/Q-1 


GT 


0.1393 


0.0769 


0.0334 


L-1/Q-1 


GC 


0.7325 


0.8405 


0.0033 


L-1/Q-1 


GT 


0.1393 


0.0769 


0.0314 


A-1/KL+2 


TC 


0.0045 


0.0337 


0.0210 


F+1/KL+2 


GC 


0.4735 


0.6026 


0.0137 


I1/KL+2 


GC 


0.5614 


0.6826 


0.0071 
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TABLE 21 (CON'T) 



Asthma Yes/No 
US Papulation 


SNP 

COMBINATION 


HAPLOTYPE 


FREQUENCIES 


P-VALUE 


CNTL 


CASE 


A-1/11 


AA 


0.1208 


0.2963 


0.0025 


A-1/11 


AG 


0.8337 


0.7037 


0.0387 


D-1/11 


GA 


0.1281 


0.2963 


0.0039 


D-2/11 


CA 


0.1275 


0.2963 


0.0049 


D-2/I1 


CG 


0.8658 


0.7037 


0.0065 


D1/H 


TA 


0.1284 


0.2963 


0.0037 


D1/I1 


TG 


0.8716 


0.7037 


0.0037 


F+1/11 


AA 


0.1228 


0.2963 


0.0048 


F1/I1 


AA 


0.0745 


0.2222 


0.0047 


F1/I1 


AG 


0.8736 


0.7037 


0.0026 


G-1/11 


TA 


0.1280 


0.2963 


0.0054 


A-1/M+1 


AT 


0.0773 


0.2222 


0.0072 


D-1/M+1 


GT 


0.0705 


0.2222 


0.0066 


D-2/M+1 


CG 


0.9138 


0.7778 


0.0134 


D-2/M+1 


CT 


0.0795 


0.2222 


0.0072 


D1/M+1 


TG 


0.9200 


0.7778 


0.0080 


D1/M+1 


TT 


0.0800 


0.2222 


0.0080 


F1/M+1 


AG 


0.8679 


0.7037 


0.0032 


F1/M+1 


AT 


0.0801 


0.2222 


0.0058 


G-1/M+1 


TT 


0.0720 


0.2222 


0.0052 


I1/M+1 


GG 


0.8654 


0.7037 


0.0065 


I1/M+1 


AT 


0.0729 


0.2222 


0.0047 


L-1/M+1 


GG 


0.9200 


0.7778 


0.0081 


L-1/M+1 


AT 


0.0800 


0.2222 


0.0064 


L-2/M+1 


GT 


0.0795 


0.2222 


0.0060 


L1/M+1 


CG 


0.9135 


0.7778 


0.0117 


L1/M+1 


CT 


0.0800 


0.2222 


0.0075 


KL+1/M+1 


GG 


0.8614 


0.7194 


0.0119 


KL+1/M+1 


GT 


0.0801 


0.2222 


0.0052 


KL+2/M+1 


CG 


0.6284 


0.4667 


0.0317 


KL+2/M+1 


CT 


0.0794 


0.2222 


0.0057 


A-1/L-1 


AA 


0.0773 


0.2222 


0.0045 


D-1/L-1 


GA 


0.0705 


0.2222 


0.0036 


D-2/L-1 


CA 


0.0795 


0.2222 


0.0051 


D-2/L-1 


CG 


0.9138 


0.7778 


0.0127 


D1/L-1 


TA 


0.0800 


0.2222 


0.0081 


D1/L-1 


TG 


0.9200 


0J77d 


0.0081 


F+1/L-1 


AA 


0.0742 


0.2222 


0.0059 
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F1/L-1 


AA 


0.0801 


0.2222 


0.0047 


F1/L-1 


AG 


0.8679 


0.7037 


0.0028 


G-1/L-1 


TA 


0.0720 


0.2222 


0.0034 


I1/L-1 


AA 


0.0729 


0.2222 


0.0053 


I1/L-1 


GG 


0.8654 


0.7037 


0.0057 


L-2/L-1 


GA 


0.0795 


0.2222 


0.0053 


KL+1/L-1 


GA 


0.0801 


0.2222 


0.0038 


KL+1/L-1 


GG 


0.8614 


0.7194 


0.0131 


KL+2/L-1 


CA 


0.0794 


0.2222 


0.0067 


KL+2/L-1 


CG 


0.6284 


0.4667 


0.0317 


I1/L-2 


AG 


0.1285 


0.2963 


0.0047 


I1/L-2 


GG 


0.8048 


0.6667 


0.0293 


I1/L1 


AC 


0.1284 


0.2963 


0.0050 


I1/L1 


GC 


0.8651 


0.7037 


0.0064 


L-1/L1 


AC 


0.0800 


0.2222 


0.0075 


L-1/L1 


GC 


0.9135 


0.7778 


0.0130 


I1/ST+4 


AA 


0.0925 


0.2963 


0.0004 


M+1/ST+4 


TA 


0.0797 


0.2222 


0.0057 


L-1/ST+4 


AA 


0.0797 


0.2222 


0.0072 


S+1/ST+4 


AA 


0.0944 


0.2795 


0.0014 


F1/ST+5 


GT 


0.0000 


0.0741 


0.0248 


M+1/ST+5 


TC 


0.0800 


0.2222 


0.0141 


L-1/ST+5 


AC 


0.0800 


0.2222 


0.0142 


ST+4/ST+5 


AC 


0.0784 


0.2483 


0.0023 


ST+4/ST+5 


CT 


0.0070 


0.0572 


0.0174 


H/ST+6 


AC 


0.1284 


0.2963 


0.0035 


I1/ST+6 


GC 


0.8586 


0.7037 


0.0067 


M+1/ST+6 


GC 


0.9070 


0.7778 


0.0132 


M+1/ST+6 


TC 


0.0800 


0.2222 


0.0067 


L-1/ST+6 


AC 


0.0800 


0.2222 


0.0059 


L-1/ST+6 


GC 


0.9070 


0.7778 


0.0121 


F+1/ST+7 


AG 


0.1008 


[_ 0.2593 


0.0066 


I1/ST+7 


AG 


0.0765 


0.2530 


0.0090 


M+1/ST+7 


TG 


0.0786 


0.2222 


0.0099 


L-1/ST+7 


AG 


0.0786 


0.2222 


0.0090 


M+1/S+1 


TA 


0.0801 


0.2222 


0.0127 


L-1/S+1 


AA 


0.0801 


0.2222 


0.0119 


S2/S+1 


CA 


0.0376 


0.2323 


0.0031 


I1/S1 


AG 


0.1271 


0.2871 


0.0044 


F1/S2 


AG 


0.7532 


0.6026 


0.0483 


I1/S2 


AC 


0.1302 


0.2561 


0.0307 


I1/S2 


GG 


0.7532 


0.6080 


0.0435 


M+1/S2 


TC 


0.0823 


0.2222 


0.0101 


Q-1/S2 


CC 


0.0779 


0.2222 


0.0072 


A-1/T+1 


AT 


0.0771 


0.2000 


0.0178 
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D-1/T+1 


GT 


0.0572 


0.2034 


0.0064 


D-2/T+1 


CC 


0.9136 


0.8000 


0.0254 


D-2/T+1 


CT 


0.0798 


0.2000 


0.0219 


D1/T+1 


TC 


0.9203 


0.8000 


0.0259 


D1/T+1 


TT 


0.0797 


0.2000 


0.0229 


I1/T+1 


GC 


0.8511 


0.6828 


0.0051 


I1/T+1 


AT 


0.0638 


0.2029 


0.0059 


M+1/T+1 


GC 


0.9078 


0.7588 


0.0080 


M+1/T+1 


TT 


0.0779 


0.2222 


0.0046 


L-1/T+1 


GC 


0.9078 


0.7588 


0.0092 


L-1/T+1 


AT 


0.0779 


0.2222 


0.0062 


L1/T+1 


CC 


0.9137 


0.8000 


0.0291 


L1/T+1 


CT 


0.0798 


0.2000 


0.0232 


T1AT+1 


TC 


0.9078 


0.7407 


0.0040 


T1/T+1 


CT 


0.0779 


0.2407 


0.0028 


T2/T+1 


CC 


0.9077 


0.7584 


0.0077 


T2/T+1 


TT 


0.0768 


0.2037 


0.0154 


I1/T+2 


AT 


0.1290 


0.2963 


0.0042 


M+1/T+2 


TT 


0.0802 


0.2198 


0.0080 


L-1/T+2 


AT 


0.0802 


0.2198 


0.0073 


T1/T+2 


CT 


0.0779 


0.2407 


0.0042 


T2/T+2 


TT 


0.0747 


0.1986 


0.0153 


A-1/T1 


AC 


0.0751 


0.2407 


0.0025 


A-1/T1 


AT 


0.8795 


0.7593 


0.0440 


D-1/T1 


GC 


0.0678 


0.2069 


0.0076 


D-2/T1 


CC 


0.0779 


0.2407 


0.0049 


D-2/T1 


CT 


0.9154 


0.7593 


0.0051 


D1/T1 


TC 


0.0779 


0.2407 


0.0052 


D1/T1 


TT 


0.9221 


0.7593 


0.0052 


F+1/T1 


AC 


0.0677 


0.2189 


0.0030 


F1/T1 


AC 


0.0779 


0.2407 


0.0040 


F1/T1 


AT 


0.8701 


0.6852 


0.0039 


G-1/T1 


TC 


0.0699 


0.2407 


0.0012 


I1/T1 


AC 


0.0710 


0.2210 


0.0054 


I1/T1 


GT 


0.8674 


0.6839 


0.0033 


M+1/T1 


TC 


0.0779 


0.2222 


0.0068 


M+1/T1 


GT 


0.9221 


0.7593 


0.0045 


L-1/T1 


AC 


0.0779 


0.2222 


0.0055 


L-1/T1 


GT 


0.9221 


0.7593 


0.0033 


L-2/T1 


GC 


0.0774 


0.2407 


0.0024 


L-2/T1 


GT 


0.8558 


0.7222 


0.0350 


L1/T1 


CC 


0.0779 


0.2407 


0.0039 


L1/T1 


CT 


0.9156 


0.7593 


0.0064 


ST+4/T1 


AC 


0.0779 


0.2407 


0.0032 


ST+5/T1 


CC 


0.0779 


0.2134 


0.0126 
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ST+6/T1 


CC 


0.0779 


0.2407 


0.0038 


ST+6/T1 


CT 


0.9091 


0.7593 


0.0059 


ST+7/T1 


GC 


0.0779 


0.2407 


0.0026 


S+1/T1 


AC 


0.0779 


0.2098 


0.0138 


S1/T1 


GC 


0.0779 


0.2222 


0.0071 


S2/T1 


CC 


0.0779 


0.2205 


0.0051 


Q-1/T1 


CC 


0.0779 


0.2407 


0.0041 


KL+1/T1 


GC 


0.0779 


0.2407 


0.0041 


KL+1/T1 


GT 


0.8636 


0.7009 


0.0072 


KL+2/T1 


CC 


0.0779 


0.2407 


0.0035 


KL+2/T1 


CT 


0.6299 


0.4478 


0.0201 


A-1/T2 


AT 


0.0713 


0.2037 


0.0099 


D-1/T2 


GT 


0.0629 


0.2037 


0.0059 


D-2/T2 


CC 


0.9190 


0.7963 


0.0225 


D-2/T2 


CT 


0.0743 


0.2037 


0.0133 


D1/T2 


TC 


0.9257 


0.7963 


0.0154 


D1/T2 


TT 


0.0743 


0.2037 


0.0154 


G-1/T2 


TT 


0.0657 


0.2037 


0.0079 


I1/T2 


GC 


0.8670 


0.7037 


0.0072 


I1/T2 


AT 


0.0680 


0.2037 


0.0093 


M+1/T2 


GC 


0.9221 


0.7778 


0.0046 


M+1/T2 


TT 


0.0779 


0.2037 


0.0112 


L-1/T2 


GC 


0.9221 


0.7778 


0.0053 


L-1/T2 


AT 


0.0779 


0.2037 


0.0127 


L1/T2 


CC 


0.9192 


0.7963 


0.0216 


L1/T2 


CT 


0.0743 


0.2037 


0.0128 


ST+6/T2 


CC 


0.9126 


0.7963 


0.0233 


ST+6/T2 


CT 


0.0744 


0.2037 


0.0109 


ST+7/T2 


GT 


0.0743 


0.2037 


0.0126 


T1/T2 


TC 


0.9221 


0.7593 


0.0027 


T1/T2 


CT 


0.0779 


0.2037 


0.0105 


KL+2/T2 


CC 


0.6343 


0.4855 


0.0424 


KL+2/T2 


CT 


0.0735 


0.2037 


0.0101 


11AM 


AC 


0.0791 


0.2412 


0.0093 


M+1A/-1 


TC 


0.0792 


0.2222 


0.0069 


L-1A/-1 


AC 


0.0792 


0.2222 


0.0095 


S2A/-1 


CC 


0.0779 


0.2222 


0.0078 


T1/V-1 


CC 


0.0779 


0.2407 


0.0034 


I1A/-2 


AC 


0.1272 


0.2963 


0.0020 


M+1A/-2 


TC 


0.0795 


0.2222 


0.0074 


L-1A/-2 


AC 


0.0795 


0.2222 


0.0084 


T1A/-2 


CC 


0.0779 


0.2407 


0.0025 


H/V-3 


AG 


0.1272 


0.2963 


0.0033 


M+1A/-3 


TG 


0.0795 


0.2222 


0.0058 


L-1A/-3 


AG 


0.0795 


0.2222 


0.0060 
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T1AA3 


CG 


0.0779 


0.2407 


0.0033 


I1A/-4 


AC 


0.1266 


0.2963 


0.0029 


I1A/-4 


GC 


0.6427 


0.4259 


0.0031 


M+1A/-4 


GC 


0.6902 


0.5000 


0.0057 


M+1A/-4 


TC 


0.0793 


0.2222 


0.0054 


L-W-4 


AC 


0.0793 


0.2222 


0.0058 


L-1/V-4 


GC 


0.6902 


0.5000 


0.0071 


T+1/V-4 


CC 


0.6995 


0.5159 


0.0132 


T+1A/-4 


TC 


0.0701 


0.2063 


0.0078 


T1AA4 


CC 


0.0779 


0.2407 


0.0028 


T1/V4 


TC 


0.6916 


0.4815 


0.0053 


T2/V-4 


CC 


0.6955 


0.5185 


0.0159 


T2/V-4 


TC 


0.0743 


0.2037 


0.0118 


MA/7 


AG 


0.1341 


0.2544 


0.0489 


M+1/V7 


TG 


0.0848 


0.2222 


0.0106 


L-1/V7 


AG 


0.0848 


0.2222 


0.0104 


T1/V7 


CG 


0.0779 


0.2198 


0.0068 


V4A/7 


GC 


0.1042 


0.0000 


0.0223 


M+1A/6 


TT 


0.0000 


0.0492 


0.0328 


L-1/V6 


AT 


0.0000 


0.0492 


0.0321 


ST+4A/6 


AT 


0.0000 


0.0668 


0.0303 


ST+7A/6 


GT 


0.0000 


0.0600 


0.0158 


T1/V6 


CC 


0.0779 


0.1883 


0.0237 


T1A/6 


CT 


0.0000 


0.0525 


0.0210 


T2/V6 


TT 


0.0000 


0.0501 


0.0274 


HA/5 


AA 


0.0897 


0.2765 


0.0010 


I1/V5 


GA 


0.8579 


0.7037 


0.0095 


M+1A/5 


TA 


0.0801 


0.2222 


0.0073 


L-1/V5 


AA 


0.0801 


0.2222 


0.0068 


T1/V5 


CA 


0.0779 


0.2407 


0.0043 


T1/V5 


TA 


0.8686 


0.7398 


0.0248 


T2/V5 


TA 


0.0745 


0.2037 


0.0112 


F+1/V4 


GG 


0.1061 


0.0000 


0.0208 


I1/V4 


AG 


0.0000 


0.0645 


0.0242 


M+1/V4 


TC 


0.0786 


0.2222 


0.0090 


L-1A/4 


AC 


0.0786 


0.2222 


0.0096 


L-2/V4 


AG 


0.0000 


0.0370 


0.0297 


ST+7/V4 


AC 


0.1396 


0.0188 


0.0120 


ST+7A/4 


GC 


0.6526 


0.8145 


0.0190 


ST+7/V4 


GG 


0.1049 


0.0188 


0.0362 


T1A/4 


CC 


0.0779 


0.2407 


0.0034 


V-1/V4 


AC 


0.0728 


0.0000 


0.0279 


V1/V4 


TC 


0.0594 


0.0000 


0.0483 


Q-1A/4 


TC 


0.0715 


0.0000 


0.0322 


11/V3 


AT 


0.0903 


0.2963 


0.0009 
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I1/V3 < 


3T 


0.6759 


0.4259I 


0.0016 


M+1/V3 


GT 


0.6863 


0.5000 


0.0068 


M+1/V3 


TT 


0.0800 


0.2222 


0.0053 


L-1/V3 


AT 


0.0800 


0.2222 


0.0048 


L-1A/3 


GT 


0.6863 


0.5000 


0.0033 


T+1A/3 


CT 


0.6939 


0.5159 


0.0067 


T+1/V3 


TT 


0.0723 


0.2063 


0.0087 


T1/V3 


CT 


0.0779 


0.2407 


0.0036 


T1A/3 


TT 


0.6883 


0.4815 


0.0023 


T2A/3 


CT 


0.6923 


0.5185 


0.0132 


T2/V3 


TT 


0.0739 


0.2037 


0.0097 


I1/V2 


AC 


0.0752 


0.2593 


0.0009 


I1/V2 


GC 


0.8664 


0.7037 


0.0065 


M+1/V2 


GC 


0.8614 


0.7407 


0.0391 


M+1A/2 


TC 


0.0801 


0.2222 


0.0078 


L-1/V2 


AC 


0.0801 


0.2222 


0.0069 


L-1/V2 


GC 1 


0.8614 


0.7407 


0.0390 


T1/V2 


CC 


0.0779 


0.2407 


0.0056 


T1/V2 


TC 


0.8636 


0.7222 


0.0252 


I1/V1 


AA 


0,0750 


0.2593 


0.0006 


I1/V1 


GA 


0.8648 


0.7037 


0.0075 


M+1/V1 


GA 


0.8591 


0.7407 


0.0457 


M+1/V1 


TA 


0.0800 


0.2222 


0.0070 


L-1/V1 


AA 


0.0800 


0.2222 


0.0090 


L-1/V1 


GA 


0.8591 


0.7407 


0.0486 


T1A/1 


CA 


0.0779 


0.2407 


0.0042 


T1/V1 


TA 


0.861 1 


0.7222 


0.0223 


I1/Q-1 


AC 


0.0788 


0.2412 


0.0094 


M+1/Q-1 


TC 


0.0792 


0.2222 


0.0063 


L-1/Q-1 


AC 


0.0792 


0.2222 


0.0072 


I1/KL+1 


AG 


0.0752 


! 0.237C 


0.0024 


I1/KL+1 


GG 


0.8664 


[ 0.7037 


0.0060 


I1/KL+2 


AC 


0.1 26S 


) 0.2962 


0.0032 


I1/KL+2 


GC 


0.580E 


) 0.3994 


0.0091 
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TABLE 22 



. ; r- 

Haplotypes for 15-SNP Combination 

D 1 /F1 /1 1 /L1 /S 1 /S2/T1 /T2/V1 A/2A/3A/4/V5/V6/V7 


rrec] oumuui 


Proq Gels© 
(Asthma) 


3 val-2sided 


uornDineu uo cx ur\ 








p a p, p p ptp a ptp ap n 


0.0000 


0.0038 


0.379 


TAAPAPTPAPTGACG 


0.0016 


0.0000 


0.759 


t a apgppp aptpapg 

1 MMO Uuu OM o i ^i~vwvi« 


0.0075 


0.0076 


0.968 




0.0023 


0.0000 


0.731 


T A APGPPTAPPPAOG 


0.0000 


0.0000 


0.542 


xa apppptaptpapg 

1 MnwW^u i MO l omovj 


0.0830 


0.0800 


0.915 


r TAAPGPPTAGTCATG 


0.0049 


0.0192 


0.130 


TAAf^rrTAfTRAnfi 

1 MMOOOO l MO 1 unuvj 


0.0082 


0.0000 


0.246 


ta Ar^fTrAnnnAnn 


0.0023 


0.0046 


0.642 


taappptpappgaoc 


0.0000 


0.0069 


0-048 


t a appptpttpggpg 

1 AnUUw lull 0^v3 0VJ 


0.0046 


0.0000 


0.518 


x a a p p ptp a ppp A nfi 

I AAOwU I OMOOOMOU 


0 0000 


0.0042 


0.055 


x a a p p ptp a ptp a pp 


0.0032 


0.0000 


0.765 


t a p p a nr a ptp a pp 

1 Mu OMO I vn^ 1 O/AO w 


0.0023 


0.0000 


0.723 


' TAPPAPTPAPTPAPG 

1 MUUnv 1 OMO 1 Unwva 


0.0026 


0.0040 


0.984 


r TAPPAPTPAPTPAPG 


0.0971 


0.0532 


0.046 


T TAPPPPPTAPTPAPG 
1 MOOOOO 1 MO I w/-\v-»vj 


0.0025 


0.0000 


0.657 


TAPPPPPTAPTPATG 

1 MOOOOO I MO i i 


0.0022 


0.0000 


0.864 


TAPPPPTPAPTCACG 

1 AuvUu 1 OMO 1 uriwu 


0.0029 


0.0000 


0.353 


u TAPPPPTPAPTPATG 
1 MUOVJlO i OMO 1 OM 1 \J 


0.0002 


0.0000 


0.842 


TAPPPPTPAPTGAPP 

1 MOOOO 1 OMO 1 «MOO 


0.0029 


0.0000 


0.501 


T A PPPPTP APTGAPG 


0.0009 


0.0000 


0.972 


TAPPPPTPAPTPATG 

I AOUUO 1 OMO 1 On 1 


0.0000 


0.0000 


0.383 


~r A PPPPTPTTPPPPG 
1 AvlvOu lull UUUUU 


0.0023 


0.0000 


1.000 


TAPPPPPP APTPAPG 

1 MOOv3v-lOOMO 1 UAW« 


0.0000 


0.0038 


0.377 


T AP PPP PPAPTP ATP 

I AubUOuUMw 1 OM 1 O 


0.0000 


0.0038 


0.401 


TAPPPPPTAPTPATG 

I AuOUUw 1 MO 1 OA 1 


0.0023 


0.0000 


0.998 


T APP^PTP APPPAPP 
1 Aw Ova O l OMOOOMOO 


0.1762 


0.1534 


0.473 


TAPPGGTPAPPPATG 


0.0000 


0.0079 


0.051 


TAPPPPTPAPPGACG 


0.0065 


0.0073 


0.898 


TAGPGGTPACTCACC 

1 MVJOVJIVa 1 OMO 1 unvv 


0.3648 


0.4558 


0.025 


TAG PGGTO ACTC ACG 

I Mv3IOV3i\JI 1 OMO I wnwtJ 


0.0024 


0.0079 


0.527 


TAGCGGTCACTCATG 


0.0707 


0.0645 


0.771 


TAGCGGTCACTCGCC 


0.0023 


0.0000 


0.462 


TAGCGGTCACTGACC 


0.0993 


0.0777 


0.393 


TAGCGGTCACTGACG 


0.0027 


O.OOOC 


0.918 


TAGTGGTCACCCACC 


O.OOOC 


0.0038 


0.133 


TAGTG GTC ACTC ACC 


0.0062 


t O.OOOC 


0.451 


TAGTGGTCACTGACC 


; o.oooe 


> O.OOOC 


0.838 


TGACGCCTACTCACG 


O.OOOC 


) 0.0036 


I 0.389 


TGACGCTCTTCCACG 


i 0.0032 


! O.OOOC 


) 0.399 


TGACGCTCTTCCGCC 


; o.oo2e 


> O.OOOC 


) 0.681 


TGACGCTCTTCCGCG 


5 0.016S 


) O.OOOC 


) 0.068 
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TGACGCTCTTCCGTG 


0.0059 


0.0000 


0.369 


TG ACG CTCTTCG ACG 


0.0014 


0.0000 


0.956 


TGACGCTCTTCGGCG 


0.0023 


0.0191 


0.026 


TGACGGTCACCCACC 


0.0000 


0.0076 


0.127 


















Overall Test 






0.052 



TABLE 22 (CONT) 



Haplotypes for 15-SNP Combination 

0 1/F1 /1 1 /L1 /S1 /S2/T1/T2/V1 /V2/V3/V4/V5A/6/V7 


= req_Control 


Freq_Case 
Asthma 


Pval-2sided 


UK 








CAGCGGTCACTCACC 


0.00000 


0.00481 


0.439 


T AAC ACT C ACTG AC G 


0.00220 


0.00000 


0.856 


TAACGCCCACTCACG 


0.01154 


0.00481 


0.442 


TAACGCCCACTGACC 


0.00357 


0.00000 


0.999 


TAACGCCTACCCACG 


0.00000 


0.01035 


0.352 


TAACGCCTACTCACG 


0.08954 


0.04715 


0.146 


TAACG CCTACTCATG 


0.00709 


0.01461 


0.519 


TAACGCCTACTGACG 


0.01321 


0.00000 


0.162 


TAACGCTCACCCACC 


0.00357 


0.00576 


0.635 


TAACGCTCACCGACC 


0.00000 


0.00867 


0.060 


TAACG CTCTTCG GCG 


0.00714 


0.00000 


0.512 


TAACGGTCACCCACC 


0.00000 


0.00000 


0.642 


TAACG GTC ACCCACG 


0.00000 


0.00522 


0.072 


TAACGGTCACTCACC 


0.00505 


0.00000 


0.603 


TAGCACTCACTCACG 


0.00412 


0.00511 


0.974 


TAGCACTCACTGACG 


0.09716 


0.04297 


0.033 


TAGCGCCTACTCATG 


0.00362 


0.00000 


0.834 


TAG CGCTCACTGACC 


0.00439 


0.00000 


0.554 


TAGCGCTCACTGACG 


0.00285 


0.00000 


0.815 


TAGCGCTCACTGATG 


0.00001 


0.00000 


0.487 


TAGCGGCCACTCACG 


0.00000 


0.00481 


0.422 


TAGCGGCTACTGATG 


0.00357 


0.00000 


1.000 


TAGCGGTCACCCACC 


0.18077 


0.12795 


0.168 


TAGCGGTCACCCATG 


0.00000 


0.00527 


0.271 


TAG CGGTCACCG ACC 


0.00000 


0.00497 


0.154 


TAGCGGTCACCGACG 


0.00617 


0.00910 


0.798 


TAGCGGTCACTCACC 


0.35686 


0.50471 


0.003 


TAG CG GT C ACTC ACG 


O.OOOOC 


0.01009 


0.050 


TAG CG GTC ACTC ATG 


0.06867 


0.07627 


0.765 


TAGCG GTC ACTG ACC 


0.0957S 


0.0833S 


0.674 


TAG CG GTC ACTG ACG 


0.00454 


[ O.OOOOC 


) 0.895 


TAGTG GTC ACCC ACC 


O.OOOOC 


) 0.00481 


0.216 


TAGTGGTCACTCACC 


0.00486 


J O.OOOOC 


) 0.637 


TAGTGGTCACTGACC 


j_ 0.00226 


> O.OOOOC 


) 0.731 


TGACGCCTACTCACG 


O.OOOOC 


) 0.00481 


0.426 
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TGACGCTCTTCCGCC 


0.00357 


0.00000 


1.000 


TGACGCTCTTCCGCG 


0.01429 


0.00000 


0.183 


TGACGCTCTTCGGCG 


0.00357 


0.01442 


0.259 


















Overall Test 






0.061 



TABLE 22 (CON'T) 



ui*ai-ki/-v+\/r\cio irsr 1 c.QMP finmhi nation 
rlapiOTypeS TOI lO'OiNr lyuiiiDiiiauuii 

D1/F1 /I1/L1 /S1/S2rt"1 /T2/V1/V2/V3A/4/V5A/6/V7 


"rea Control 


Freq_Case 
Asthma 


Pva!-2sided 


uo 










0.00000 


0.01852 


0.239 


t a a n fi n rrr Acrrc ACG 


0.07143 


0.14815 


0.113 


TAAPfiHPTACTCATG 


0.00000 


0.03704 


0-025 


t a a pr ncrr a ctg acg 


0.00000 


0.01852 


0.058 


T A A OCiCnTG A ccg acg 


0.00003 


0.00000 


0.858 


TAAn^^TnTncGACG 


0.00003 


0.00000 


0.858 


TACSHACTCACTCACC 1 


0.00653 


0.00000 


0.351 


T Art C ACTC ACTG ACG 


0.09737 


0.09259 


0.986 


TA CGC CTACTC ACG 


0.00649 


0.00000 


0.999 


TAGCGCTCACTCACG 


0.00539 


0.00000 


0.466 


TAGCGCTCACTCATG 


0.00052 


0.00000 


0.836 


TAGCGCTCACTCGCG 


0.00035 


0.00000 


0.527 


TAGCGCTCACTCGTG 


0.00024 


0.00000 


0.568 


TAGCGCTCTTCCGCG 


0.00649 


0.00000 


0.991 


TAG CGG CC ACT CATC 


0.00000 


0.01852 


0.257 


TAG CGGTCACCC ACC 


0.16880 


0.20370 


0.612 


TAGCGGTCACCGACG 


0.00643 


0.00000 


0.148 


TAGCGGTCACTCACC 


0.38579 


0.35185 


0.690 


TAGCGGTCACTCACG 


0.00658 


0.00000 


0.788 


TAG CG GTCACTC ATG 


0.07254 


0.01852 


0.197 


TAGCGGTCACTCGCC 


0.00682 


0.00000 


0.517 


TAGCGGTCACTGACC 


0.09970 


0.00000 


0.044 


TAG CGGTC ACTG ATG 


0.00000 


0.01852 


0.080 


TAGCGGTCTCCGACG 


0.00003 


0.00000 


0.858 


TAGTGGTCACTCACC 


0.00649 


0.00000 


1.000 


TGACGCTCTTCCACG 


0.00889 


0.00000 


0.285 


TGACGCTCTTCCGCG 


0.02049 


0.00000 


0.510 


TGACGCTCTTCCGTG 


0.01837 


0.00000 


0.478 


TGACGCTCTTCGACG 


0.0042C 


0,00000 


0.839 


TGACGCTCTTCGGCG 


O.OOOOC 


> 0.03704 


0.022 


TG ACG GTC ACCC ACC 


0.0000C 


) 0.03704 


0.067 


















Overall Test 






0.011 
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0.9927| 


0.5997| 


0.9456I 


0.334| 


0.9186| 


0.9644 


0.9985| 


0.8966| 


0.7074| 


0.6919| 


0.9002| 






















































88.9% I 


89.1%| 


















































































L-2 , 


0.8919 


0.8051 


0.8363 


0.215 


0.8079 


0.7771 1 


0.7582 


0.813 


0.6992 


0.5272 


0.5661 






















































7.1%| 


CD 
CO 


KL+2 


0.67121 


0.746! 


0.7187| 


0.1686| 


0.8173! 


0.8423) 


0.7226! 


0.7252! 


0.6914! 


0.4343[ 
























































71 .3% | 


75.0%| 


KL+1 


0.8188 


0.7512 


0.7977 


0.144 


0.5252 


0.7876 


0.8013 


0.8084 


0.5874 


























































96.1%| 


97.6% | 




0.9431! 


0.8645 


0.7015! 


0.2155! 


0.8418 


0.7429| 


0.8821 1 


0.6709; 




























































84.9% | 


86.7% | 




0.9928| 


0.9928| 


0.8715( 


0.3307[ 


0.9568| 


0.7017 
































































9.3% | 


9.5% | 


6 


















































































0.7964 


0.901 1 


0.3054 


0.281 [ 


0.8773 [ 


0.8234 
































































65.2%| 


CO 
CD 


F+1 
















































































Ll 


0.9539| 


0.9034 ; 


0.8483| 


0.2716! 


0.7752 ' 


































































96.8%| 


97.6%| 




0.2876! 


0.2165 


0.2925| 


0.2294: 




































































0.0%| 


0.8% | 


Q 


















































































0.9727' 


0.9567 


0.91491 






































































37.6%| 


CO 


6 
















































































CM 

6 


0.9529 j 










































































0.7% I 


0.8% | 


S 












































































97.6% I 








CM 

6 


5 


5 




+ 

LL 


G-1 I 




KL+1 I 


KL+2 


CM 
—J 


_J 


Ij 


+ 
2 


Q-1 I 


CO 


CM 
CO 


S+1 I 


ST+4 | 


ST+5 


ST+6 S 


N- 

it 

CO 




pi 


T1 | 


T + 2 | 


3 


CO 

> 


V-2 | 


V-1 | 


> 




2 


5 


S 




5 


CNTL ! 


CASE 



CMBftlmnP Purrr to u r> mm*. 



HI 
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1— 


0.9799| 


0.9723| 


0.9545| 


0.3049I 


0.91381 


0.9835I 


0.88371 


0.7182| 


0.6974 


0.8954 
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0.0067 


0.0147 


L-1/V-1 


GA 


0.1357 


0.0533 


0.0291 


L-W-1 


GC 


0.7361 


0.8856 


0.0018 


ST+4A/-1 


AA 


0.0951 


0.0211 


0.0449 


ST+4A/-1 


AC 


0.3881 


0.5506 


0.0076 


ST+5A/-1 


CA 


0.0231 


0.0600 


0.0463 


ST+5A/-1 


TA 


0.1126 


0.0000 


0.0017 


ST+5A/-1 


TC 


0.3312 


0.4904 


0.0069 


ST+6AM 


CA 


0.1357 


0.0600 


0.0497 


ST+6A/-1 


CC 


0.8643 


0.9400 


0.0497 


S2A/-1 


CA 


0.1357 


0.0600 


0.0467 


S2A/-1 


GC 


0.7286 


0.8700 


0.0035 


T+1/V-1 


CA 


0.1357 


0.0543 


0.0314 


T+1A/-1 


CC 


0.7313 


0.8713 


0.0058 


T+2/V-1 


GA 


0.0000 


0.0251 


0.0011 


T+2A/-1 


TA 


0.1357 


0.0349 


0.0068 


T1/V-1 


CA 


0.0000 


0.0061 


0.0492 


T1/V-1 


TA 


0.1357 


0.0539 


0.0285 


T1AM 


TC 


0.7321 


0.8761 


0.0039 


T2AM 


CA 


0.1357 


0.0526 


0.031 1 


T2A/-1 


TA 


0.0000 


0.0074 


0.0275 


T2A/-1 


CC 


0.7595 


0.8955 


0.0063 


ST+4/V-2 


CC 


0.1352 


0.0203 


0.0015 


ST+7A/-2 


AC 


0.1518 


0.0371 


0.0080 


S2A/-2 


CC 


0.2602 


0.1116 


0.0027 


S2A/-2 


GC 


0.3611 


0.4784 


0.0472 


KL+2A/-2 


AC 


0.1372 


0.0251 


0.0047 


ST+4/V-3 


CG 


0.1352 


0.0203 


0.0014 


ST+7/V-3 


AG 


0.1448 


0.0374 


0.0115 


S2A/-3 


CG 


0.2656 


0.1090 


0.0019 


S2A/-3 


GG 


0.3552 


0.4749 


0.0452 


KL+2A/-3 


AG 


0.1334 


0.0254 


0.0081 


S2/V-4 


CC 


0.2635 


0.1300 


0.0095 


S2/V-4 


GC 


0.4852 


0.6039 


0.0488 


ST+4A/7 


AC 


0.2894 


0.4587 


0.0043 
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S2/V7 


CG 


0.2557 


0.1186 


0.0049 


V-2A/7 


CG 


0.2615 


0.1221 


0.01 01 


V-3/V7 


GG 


0.2625 


0.1152 


0.0087 


S2A/6 


CC 


0.2630 


0.1230 


0.0042 


S2/V6 


GC 


0.6543 


0.7870 


0.0174 


S2/V5 


CA 


0.2429 


0.1100 


0.0063 


S2A/5 


GA 


0.7286 


0.8700 


0.0050 


S2/V4 


GC 


0.6128 


0.7546 


0.0106 


S2A/2 


CC 


0.2461 


0.1100 


0.0031 


S2A/2 


GC 


0.7286 


0.8700 


0.0045 


S2A/1 


CA 


0.2494 


0.1100 


0.0044 


S2A/1 


GA 


0.7286 


0.8700 


0.0062 


D-1/Q-1 


CC 


0.4861 


0.6165 


0.0427 


D-1/Q-1 


CT 


0.1258 


0.0000 


0.0011 


D-1/Q-1 


GT 


0.0135 


0.0700 


0.0218 


I1/Q-1 


AC 


0.1391 


0.0519 


0.0430 


I1/Q-1 


GC 


0.7216 


0.8781 


0.0043 


H/Q-1 


GT 


0.1153 


0.0319 


0.0223 


M+1/Q-1 


GC 


0.7300 


0.8761 


0.0023 


M+1/Q-1 


TT 


0.0000 


0.0061 


0.0455 


L-1/Q-1 


AC 


0.1282 


0.0552 


0.0496 


L-1/Q-1 


GC 


0.7325 


0.8748 


0.0025 


L-1/Q-1 


AT 


0.0000 


0.0060 


0.0162 


L-1/Q-1 


GT 


0.1393 


0.0640 


0.0491 


TABLE 26fCON'T) 


BHR 

US Population 


SNP 

COMBINATION 


HAPLOTYPE 


FREQUENCIES 


P-VALUE 


CNTL 


CASE 


A-1/M+1 


AT 


0.0773 


0.2500 


0.0082 


D-1/M+1 


GT 


0.0705 


0.2500 


0.0079 


D-2/M+1 


CG 


0.9138 


0.7500 


0.0179 


D-2/M+1 


CT 


0.0795 


0.2500 


0.0133 


D1/M+1 


TG 


0.9200 


0.7500 


0.0144 


D1/M+1 


TT 


0.080C 


0.2500 


0.0144 


L-1/M+1 


GG 


0.9200 


0.7500 


0.0129 


L-1/M+1 


AT 


0.080C 


0.2500 


0.0124 


KL+2/M+1 


CG 


0.6284 


0.3571 


0.0045 


KL+2/M+1 


CT 


0.0794 


0.2500 


0.0041 


A-1/L-1 


AA 


0.0773 


0.2500 


0.0111 


D-1/L-1 


GA 


0.0705 


0.2500 


0.0065 


D-2/L-1 


CA 


0.0795 


0.2500 


0.0113 


D-2/L-1 


CG 


0.9138 


0.7500 


0.0163 
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D1/L-1 


TA 


0.0800 


0.2500 


0.0154 


D1/L-1 


TG 


0.9200 


0.7500 


0.0154 


KL+2/L-1 


CA 


0.0794 


0.2500 


0.0030 


KL+2/L-1 


CG 


0.6284 


0.3571 


0.0047 


L-1/L1 


AC 


0.0800 


0.2500 


0.0115 


L-1/L1 


GC 


0.9135 


0.7500 


0.0122 


M+1/ST+7 


GG 


0.6794 


0.4643 


0.0205 


M+1/ST+7 


TG 


0.0786 


0.2500 


0.0054 


L-1/ST+7 


AG 


0.0786 


0.2500 


0.0051 


L-1/ST+7 


GG 


0.6794 


0.4643 


0.0200 


S2/S+1 


CA 


0.0376 


0.2754 


0.0062 


A-1/S2 


AC 


0.2405 


0.4643 


0.0250 


D1/S2 


TC 


0.2468 


0.4643 


0.0276 


D1/S2 


TG 


0.7532 


0.5357 


0.0276 


F1/S2 


AC 


0.1948 


0.4286 


0.0111 


F1/S2 


AG 


0.7532 


0.5357 


0.0246 


Q-1/S2 


CC 


0.0779 


0.2500 


0.0107 


Q-1/S2 


CG 


0.7532 


0.5357 


0.0203 


T1H+1 


TC 


0.9078 


0.7143 


0.0116 


T1/T+1 


CT 


0.0779 


0.2857 


0.0039 


KL+2/T+1 


CC 


0.6356 


0.3643 


0.0041 


KL+2/T+1 


CT 


0.0722 


0.2429 


0.0106 


T1/T+2 


CT 


0.0779 


0.2857 


0.0029 


A-1/T1 


AC 


0.0751 


0.2857 


0.0037 


A-1/T1 


AT 


0.8795 


0.7143 


0.0360 


D-1/T1 


GC 


0.0678 


0.2398 


0.0069 


D-2/T1 


CC 


0.0779 


0.2857 


0.0066 


D-2/T1 


CT 


0.9154 


0.7143 


0.0084 


D1/T1 


TC 


0.0779 


0.2857 


0.0058 


D1yT1 


TT 


0.9221 


0.7143 


0.0058 


F+1/T1 


AC 


0.0677 


0.2420 


0.0077 


F+1/T1 


GT 


0.6648 


0.4206 


0.0251 


F1/T1 


AC 


0.0779 


0.2857 


0.0059 


F1/T1 


AT 


0.8701 


0.6786 


0.0151 


G-1/T1 


TC 


0.0699 


0.2857 


0.0020 


G-1/T1 


TT 


0.8479 


0.6429 


0.0120 


I1/T1 


AC 


0.0710 


0.2489 


0.0090 


I1/T1 


GT 


0.8674 


0.6774 


0.0166 


M+1/T1 


TC 


0.0779 


0.2500 


0.0126 


M+1/T1 


GT 


0.9221 


0.7143 


0.0059 


L-1/T1 


AC 


0.0779 


0.2500 


0.0099 


L-1/T1 


GT 


0.9221 


0.7143 


0.0040 


L-2/T1 


GC 


0.0774 


0.2857 


0.0023 


L-2/T1 


GT 


0.8558 


0.6429 


0.0067 


L1/T1 


CC 


0.0779 


0.2857 


0.0061 
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L1/T1 


CT 


0.9156 


0.7143 


0.0080 


ST+4/T1 


AC 


0.0779 


0.2857 


0.0039 


ST+5/T1 


CC 


0.0779 


0.2242 


0.0125 


ST+6/T1 


CC 


0.0779 


0.2857 


0.0052 


ST+6/T1 


CT 


0.9091 


0.7143 


0.0106 


ST+7/T1 


GC 


0.0779 


[ 0.2857 


0.0031 


ST+7/T1 


GT 


0.6801 


0.4286 


0.0139 


S+1/T1 


AC 


0.0779 


0.2235 


0.0133 


S1/T1 


GC 


0.0779 


0.2857 


0.0036 


S1/T1 


GT 


0.8182 


0.5556 


0.0032 


S2/T1 


CC 


0.0779 


0.2451 


0.0116 


S2/T1 


GT 


0.7532 


0.4951 


0.0070 


Q-1/T1 


CC 


0.0779 


0.2857 


0.0041 


Q-1/T1 


CT 


0.7532 


0.5000 


0.0075 


KL+1/T1 


GC 


0.0779 


0.2857 


0.0055 


KL+1/T1 


GT 


0.8636 


0.6786 


0.0213 


KL+2/T1 


CC 


0.0779 


0.2857 


0.0039 


KL+2/T1 


CT 


0.6299 


0.3214 


0.0022 


M+1/T2 


GC 


0.9221 


0.7500 


0.0051 


M+1/T2 


TT 


0.0779 


0.2143 


0.0277 


L-1/T2 


GC 


0.9221 


0.7500 


0.0055 


L-1/T2 


AT 


0.0779 


0.2143 


0.0330 


T1/T2 


CC 


0.0000 


0.0714 


0.0213 


T1/T2 


TC 


0.9221 


0.7143 


0.0025 


T1/T2 


CT 


0.0779 


0.2143 


0.0310 


KL+2/T2 


CC 


0.6343 


0.3929 


0.0091 


KL+2/T2 


CT 


0.0735 


0.2143 


0.0135 


S2/V-1 


CC 


0.0779 


0.2500 


0.0109 


S2/V-1 


GC 


0.7532 


0.5357 


0.0171 


T1A/-1 


CC 


0.0779 


0.2857 


0.0055 


T1/V-1 


TC 


0.7508 


0.5000 


0.0126 


T1A/-2 


CC 


0.0779 


0.2857 


0.0024 


T1/V-3 


CG 


0.0779 


0.2857 


0.0036 


M+1A/-4 


GC 


0.6902 


0.4643 


0.0160 


M+1A/-4 


TC 


0.0793 


0.2500 


0.0032 


L-1A/-4 


AC 


0.0793 


0.2500 


0.0045 


L-1A/-4 


GC 


0.6902 


0.4643 


0.0148 


S2A/-4 


CC 


0.2468 


0.4643 


0.0194 


S2A/-4 


GC 


0.5240 


0.2500 


0.0039 


T1A/-4 


CC 


0.0779 


0.2857 


0.0036 


T1/V-4 j 


TC 


0.6916 


0.4286 


0.0080 


A-1/V7 


AG 


0.3226 


0.5357 


0.0459 


D1A/7 


TC 


0.6776 


0.4643 


0.0392 


D1A/7 


TG 


0.3224 


0.5357 


0.0392 


F+W7 


AG 


0.3020 


0.5357 


0.0102 
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ST+6A/7 


CC 


0.6777 


0.4643 


0.0374 


ST+6A/7 


CG 


0.3093 


0.5357 


0.0337 


T1/V7 


TC 


0.6784 


0.4206 


0.0164 


T1A/7 


CG 


0.0779 


0.2420 


0.0120 


V-4A/7 


CC 


0.4984 


0.2192 


0.0032 


V-4A/7 


CG 


0.2716 


0.4951 


0.0214 


V5A/7 


AG 


0.2839 


0.5357 


0.0146 


T1A/6 


CC 


0.0779 


0.2500 


0.0092 


T1A/6 


TC 


0.8279 


0.6429 


0.0326 


F+1/V5 


AA 


0.2817 


0.5357 


0.0319 


I1A/5 


AA 


0.0897 


0.2857 


0.0064 


M+W5 


TA 


0.0801 


0.2500 


0.0117 


L-1/V5 


AA 


0.0801 


0.2500 


0.0107 


S2A/5 


CA 


0.2021 


0.4643 


0.0046 


S2A/5 


GA 


0.7446 


0.5357 


0.0378 


T+1A/5 


TA 


0.0790 


0.2308 


0.0212 


T1A/5 


CA 


0.0779 


0.2857 


0.0058 


T1/V5 


TA 


0.8686 


0.7143 


0.0344 


T2A/5 


TA 


0.0745 


0.2143 


0.0260 


F+1/V4 


AG 


0.1017 


0.2500 


0.0463 


M+W4 


GC 


0.7136 


0.5000 


0.0164 


M+1A/4 


TC 


0.0786 


0.2500 


0.0061 


T1/V4 


CC 


0.0779 


0.2857 


0.0041 


T1A/4 


TC 


0.7143 


0.4643 


0.0125 


T1A/3 


CT 


0.0779 


0.2857 


0.0056 


T1A/3 


TT 


0.6883 


0.4643 


0.0159 


S2A/2 


CC 


0.1883 


0.4286 


0.0082 


S2/V2 


GC 


0.7532 


0.5357 


0.0259 


T1A/2 


CC 


0.0779 


0.2857 


0.0048 


T1A/2 


TC 


0.8636 


0.6786 


0.0219 


S2A/1 


CA 


0.1851 


0.4286 


0.0088 


S2A/1 


GA 


0.7532 


0.5357 


0.0245 


T1/V1 


CA 


0.0779 


0.2857 


0.0066 


T1A/1 


TA 


0.8611 


0.6786 


0.0242 


I1/KL+2 


AC 


0.1269 


0.2857 


0.0377 


I1/KL+2 


GC 


0.5809 


0.3214 


0.0058 



TABLE 27 



Haplotypes for 15-SNP Combination 

D1/F1/H/L1/S1/S2/T1/T2A/1A/2A/3A/4A/5A/6A/7 


Freq_Control 


Freq Case 
BHR 


Pval-2sided 


Combined US & UK 








CAGCGGTCACTCACC 


0.0000 


0.0078 


0.22 


TAACACTCACTGACG 


0.0016 


0.0000 


0.84 


TAACGCCCACTCACG 


0.0075 


0.0156 


0.62 
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TAAC GGCGACTG AUG 


0.0023 


o.oouo 


0.86 


T A A AAA AT A ATA A aa 

TAACGCCTACTCACG 


0.0830 


0.0859 


0.89 


ta a a i^A a ata a 

TAAC GC CTACTCATG 


0.0049 


0.0000 


0.46 


TAACGCOTACTGACG 


0.0082 


0.0000 


0.24 


"I- A A A/~vATA A AAA A AA 

TAACGCTCACCCACC 


A a a*"io 

0.0023 


0.0000 


1 .00 


TAACGCTCACCGACC 


0.0000 


0.0078 


0.14 


TAAC GCTCTTCG G C G 


A A A /I A 

0.0046 


A AAA A 

0.0000 


0.75 


TAAC G GTC ACTC ACC 


0.0032 


0.0000 


0.32 


TAGCACTCACTCACC 


0.0023 


0.0000 


0.73 


TAGCACTCACTCACG 


A A A A A 

0.0026 


a AAA A 

0.0000 


0.29 


TAG C ACTC ACTG AC G 


0.0971 


0.0703 


0.37 


TAGCGCCTACTCACG 


0.0025 


A A A A A 

0.0000 


0.32 


TAGCGCCTACTCATG 


A A A A A 

0.0022 


A AAA A 

0.0000 


0.82 


TAGCGCTCACTCACG 


0.0029 


0.0000 


0.27 


T A A,A AT A^ a at a a t/~» 

TA G C G CT C ACTC ATG 


r\ Anno 

0.0002 


0.0000 


0.92 


T A A A A ATA A ata A a/"* 

TAG C G CTC ACTG ACC 


0.0029 


0.0000 


0.67 


T A /"^ A/~* AT A A ATA A A-/"^ 

TAGCGCTCACTGACG 


0.0009 


0.0000 


0.77 


t a A A A ata a ata a t a 

TAGOGG lOAU 1 GATG 


0.0000 


0.0000 


0.41 


ta a a a at ah — r a a a a a 


0.0023 


0.0000 


1 .00 


TA /^rVO/^(^/^ A ATA A A A. 

TAGCGGCCACTCACG 


0.0000 


0.0078 


0.22 


TAGCGGCCACTCATC 


a a a a a 

0.0000 


0.0078 


0.22 


T A /"» A A*» A AT A AT/"^ A TA^ 

TAGCGGGTACTGATG 


0.0023 


0.0000 


1.00 


TAGCG GTCACCCACC 


0.1762 


0.1677 


0.83 


TAGCGGTCACCCACG 


0.0000 


0.0000 


0.15 


TAGCGGTCACCGACG 


0.0065 


0.0078 


0.91 


TAGCGGTCACTCACC 


0.3648 


0.4023 


0.45 


TAG CG GTC ACTC ACG 


A AAA A 

0.0024 


0.0159 


0.09 


TAGCGGTCACTCATG 


0.0707 


0.0859 


0.59 


TAGCGGTCACTCGCC 


0.0023 


0.0000 


0.50 


f A AAAATA A 0*T"/^ A AA 

TAGCGGTCACTGACC 


0.0993 


0.0859 


0.70 


-r* A AAA yk^ - / - "* A ATA A A j* - * 

TAGCGGTCACTGACG 


0.0027 


0.0000 


0.87 


TAGTGGTCACCCACC 


0.0000 


0.0078 


0.08 


TAGTG GTC ACTC ACC 


0.0063 


A A A A A 

0.0000 


0.70 


T A <—*T/"~*/""»T A A ATi^» A AA 

TAGTG GTC ACTG ACC 


0.0006 


0.0000 


0.48 


TO A A A AT AT — TO A A A A 


U.UUoZ 


U.UUUU 


n oo 

u.zy 


TGACGCTCTTCCGCC 


0.0026 


0.0000 


0.68 


TGACGCTCTTCCGCG 


0.0169 


0.0000 


0.28 


TGACGCTCTTCCGTG 


0.0059 


0.0000 


0.49 


TGACGCTCTTCGACG 


0.0014 


0.0000 


0.89 


TGACGCTCTTCGGCG 


0.0023 


0.0234 


0.02 


















Overal Test 






0.61 
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TABLE 27(CONT) 



Haplotypes for 15-SNP Combination 

D1/F1/1 1 /L1/S1/S2/T1 /T2A/1/V2/V3A/4/V5A/6A/7 


Freq_Control 


Freq Case 
BHR 


Pval-2sided 


UK 








CAGCGGTCACTCACC 


0.0000 


0.0100 


0.28 


TAACACTCACTGACG 


0.0022 


0.0000 


0.99 


TAACGCCCACTCACG 


0.0115 


0.0100 


0.82 


TAACGC CCACTGACC 


0.0036 


0.0000 


1.00 


TAACGC CTACTC AC G 


0.0895 


0.0500 


0.29 


TAACGC CTACTCATG 


0.0071 


0.0000 


0.77 


TAACGC CTACTGACG 


0.0132 


0.0000 


0.30 


TAACGCTCACCCACC 


0.0036 


0.0000 


0.99 


TAACGCTCACCGACC 


0.0000 


0.0100 


0.15 


TAACGCTCTTCGGCG 


0.0071 


0.0000 


0.78 


TAACGGTCACTCACC 


0.0050 


0.0000 


0.28 


TAGCACTCACTCACG 


0.0041 


0.0000 


0.41 


TAG CACTCACTGAC G 


0.0972 


0.0400 


0.08 


TAGCGCCTACTCATG 


0.0036 


0.0000 


0.26 


TAGCGCTCACTGACC 


0.0044 


0.0000 


0.64 


TAGCGCTCACTGACG 


0.0028 


0.0000 


0.63 


TAGCGCTCACTGATG 


0.0000 


0.0000 


0.41 


TAGCGGCCACTCACG 


0.0000 


0.0100 


0.25 


TAG CGG CTACTG ATG 


0.0036 


0.0000 


1.00 


TAGCGGTCACCCACC 


0.1808 


0.1472 


0.49 


TAGCGGTCACCGACC 


0.0000 


0.0055 


0.15 


TAGCGGTCACCGACG 


0.0062 


0.0098 


0.90 


TAG CG GTC ACTC ACC 


0.3569 


0.4722 


0.08 


TAGCGGTCACTCACG 


0.0000 


0.0207 


0.03 


TAGCGGTCACTCATG 


0.0687 


0.0900 


0.54 


TAGCGGTCACTGACC 


0.0958 


0.0947 


0.97 


TAGCGGTCACTGACG 


0.0045 


0.0000 


0.79 


TAGTGGTCACCCACC 


0.0000 


0.0100 


0.15 


TAGTG GTCACTC ACC 


0.0049 


0.0000 


0.88 


TAGTGGTCACTGACC 


0.0023 


0.0000 


0.52 


TGACGCTCTTCCGCC 


0.0036 


0.0000 


1.00 


TGACGCTCTTCCGCG 


0.0143 


0.0000 


0.42 


TGACGCTCTTCGGCG 


0.0036 


0.0200 


0.12 


















Overall Test 






0.27 



TABLE 27(CONT) 



Haplotypes for 15-SNP Combination 

D1 /F1 /1 1 /L1/S1/S2/T1 /T2/V1 /V2/V3A/4/V5A/6/V7 


Freq_Coritrol 


Freq Case 
BHR 


Pval-2sided 


US 








TAACGCCCACTCACG 


0.0000 


0.0357 


0.038 


TAACGCCTACTCACG 


0.0714 


0.2143 


0.030 
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TAACGGTCAOCGACG 


0.0000 


0.0000 


0.920 


TAACGGTC 1 LrCGAGG 


O.UOOO 


0.0000 


0.9zi0 


TAGCACTCAO T OAGC 


0.0065 


0.0000 


0.169 


TAGCACTCACTGACG 


0.0974 


0.1786 


0.275 


TAGCGCCTACTCACG 


0.0065 


0.0000 


0.992 


TAGCGCTCACTCACG 


0.0054 


0.0000 


0.547 


TAGCGCTCACTCATG 


0.0005 


0.0000 


0.781 


TAGCGCTCACTCGCG 


0.0003 


0.0000 


0.280 


TAGCGCTCACTCGTG 


0.0002 


0.0000 


0.296 


TAGCGCTCTTCCGCG 


0.0065 


O A«AA 

0.0000 


0.984 


TAGCGGCCACTCATC 


0.0000 


0.0357 


0.134 


TA G C G G TCACC GA CC 


0.1688 


0.2143 


0.543 


TAGCGGTCACCGACG 


0.0064 


0.0000 


0.1 18 


TAGCGGTCACTCACC 


0.3858 


0.2143 


0.071 


TAGCGGTCACTCACG 


0.0066 


0.0000 


0.721 


TAGCGGTCAOTOATG 


0.0725 


0.0357 


0.655 


TAGCGGTCACTCGCG 


O.OObo 


0.0000 


0.460 


TAGCGGTCACTGACC 


0.0997 


0.0000 


0.099 


TAGCGGTCACTGATG 


0.0000 


0.0357 


0.022 


TAGCGGTCTOCGACG 


0.0000 


0.0000 


0.920 


TAGTGGTGACTCACC 


0.0065 


0.0000 


1 .000 


TGACAC T CTTCGACG 


0.0000 


0.0089 


0.093 




U.UUUU 


u.uuoy 


U.Uao 


TGACGCTCTTCCACG 


0.0089 


0.0000 


0.189 


TGACGCTCTTCCGCG 


0.0205 


0.0000 


0.804 


TGACGCTCTTCCGTG 


0.0184 


0.0000 


0.655 


TGACGCTCTTCGACG 


0.0042 


0.0089 


0.922 


TGACGCTCTTCGGCG 


0.0000 


0.0089 


0.093 


















Overall Test 






0.105 



EXAMPLE 14: Transmission Disequilibrium Test (TDT) 

To ensure that the significant association observed in the case-control 
studies was not an artifact due to population admixture, a transmission 
5 disequilibrium test (TDT) was conducted. By selecting a single affected 
offspring in each family, the TDT performed as a family based test of 
association (due to linkage disequilibrium) in the presence of linkage. The TDT 
determined whether a particular allele was preferentially transmitted to an 
affected individual, as compared to what would be expected by chance. Only 
10 heterozygous parents were considered informative for the TDT. In addition, 
heterozygous parents transmitting different alleles to two affected offspring 
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were ignored. Accordingly, the TDT was based on the same families that 
contributed to the linkage signal. The significance levels were estimated by 
Markov Chain Monte Carlo simulation methods as implemented in TDTEXfrom 
the S.A.G.E. program (Department of Epidemiology and Biostati sties, 
5 Rammelkamp Center for Education and Research, MetroHealth Campus, Case 
Western Reserve University, Cleveland, OH (1997)). 

1. Asthma Phenotype : Eleven candidate SNPs were typed in the 
extended population in order to confirm the association seen in the 
case/control study. The eleven SNPs were in Gene 216 exons L-1, S1, S2, 

10 ST+4, ST+7, T1, V-4, V-3, V-1, V1 and V4. In addition to analyzing SNPs 
separately, SNP haplotypes (all 2-at-a-time, all 3-at-a-time and selected 4-at-a- 
time and 5-at-a-time) were constructed based on family data with the program 
GENEHUNTER (Kruglyak et al., 1996). This served to increase the 
informativeness of the single SNPs, as only heterozygote parents contributed 

15 information to the TDT. These haplotypes were used as "alleles" in 
subsequent TDT analyses. In addition, p-values obtained from the TDT 
analyses were compared to the p-values obtained from the haplotyping in the 
case/control setting. To check for consistency, the p-values were recorded to 
compare the haplotype frequencies between the cases and controls of the 

20 over-transmitted alleles/haplotypes. 

The TDT results strongly supported the association previously observed 
in the case control studies (Table 28). Five of the eleven SNPs showed alleles 
that were preferentially transmitted to affected offspring (p < 0.05 to p < 0.005) 
in either the combined or one of the separate populations. When these SNPs 

25 were haplotyped together, most combinations had a haplotype that was 
preferentially transmitted to affected offspring in either the combined or one of 
the separate populations (p < 0.05 to p = 0.0002). The most significant SNP 
in the combined population was S2 (p = 0.0049) while the most significant 2-at- 
a-time haplotype included SNPs S2A/-1 (p = 0.0011). The most significant 

30 haplotype in the combined population included SNPs S2/ST+7A/-1 (p = 
0.0006). .In the UK population, the most significant SNP was S1 (p = 0.0043), 
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while the most significant 2-at-a-time haplotype included SNPs ST+7/T1 (p = 
0.0013). The most significant haplotype in the UK population included SNPs 
S2/ST+7/T1 (p = 0.0002). In the US population, the most significant SNP was 
S2 (p = 0.0351) and the most significant haplotype included SNPs S2/V-1 (p 
5 = 0.0106). The lower significance in the US was most likely due to the smaller 
sample size in that population and to the reduced power of the TDT versus the 
case-control study design. 

Importantly, for almost all of the significant single SNP or multiple SNP 
haplotypes, the allele that was over-transmitted in either the combined 

10 population or in the UK sample was more frequent in the cases than in the 
controls. A summary of the TDT analyses and a comparison between the 
case/control and TDT results are presented in Table 28. 

2. Bronchial Hyper-responsiveness : The TDT analyses were 
repeated using only those asthmatic pairs that satisfied the additional criteria 

1 5 of having a PC 2 o ^16 mg/ml (Table 29). The vast majority of single SNP and 
multiple SNP haplotypes showed increased significance with the more 
restricted phenotype. The most significant SNP in the combined population 
was S2 (p = 0.0029). The most significant 2-at-a-time haplotypes included 
SNPs S1/T1 as well as SNPs T1/V4 (p = 0.0005), while the most significant 3- 

20 at-a-time haplotype included SNPs ST+7/TW4 (p = 0.0003). The most 
significant haplotypes included SNPs S2/ST+7/T1 A/-3/V-1 as well as SNPs 
S2/ST+7/T1 AM/V4 (p = 0.00013). In the UK population, the most significant 
SNP was S2 (p = 0.0055). The most significant 2-at-a-time haplotype included 
SNPs S2/ST+7 (p = 0.00013), while the most significant 3-at-a-time haplotype 

25 included SNPs ST+7/T1/V4 (p = 0.000019). Increased significance was 
observed in both the selected 4 and 5-at-a-time haplotypes. The most 
significant 4-at-a-time haplotype included SNPs S2/ST+7/T1 A/-1 (p = 
0.000009), while the most significant 5-at-a-time haplotype included SNPs 
S2/ST+7/T1 A/-3/V-1 (p = 0.000001 ). In the US population, the most significant 

30 SNP was L-1 (p = 0.0386) and the most significant haplotype included SNPs 
L-1/ST+7 (p = 0.0423). Similar to the yes/no phenotype, the over-transmitted 
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alleles in the TDT were more frequent in the cases for the majority of the 
alleles in both the combined and UK population. In summary, the analysis of 
single SNPs and SNP haplotypes by the TDT test provided confirmatory 
evidence for Gene 216 as an asthma susceptibility gene. 
5 It is noted that for Tables 28 and 29, the haplotypes are written without 

slashes separating each allele. Thus, the haplotype that is G/T/G/C/C is 
written as GTGCC in Table 28. This represents the short-hand designations 
for the haplotypes and is not, in any way, meant to represent contiguous 
nucleotide sequences. 
10 TABLE 28 



Asthma Yes/No 
Combined 


SNP(s) 


Over- 
Transmitted 
Allele or 
Haplo 


T 


NT 


TDT 
p-value 


Case/ 
Cntl 

p-value 


Case 
freq 


Cntl 
freq 


L-1 


G 


37 


31 


0.5446 


1.0000 


89% 


89% 


S1 


G 


37 


20 


0.0331 


0.0233 


95% 


89% 


S2 


G 


73 


42 


0.0049 


0.0662 


80% 


74% 


ST+4 


A 


93 


93 


1 .0000 


0.0313 


60% 


51% 


ST+7 


G 


59 


41 


0.0886 


0.0160 


86% 


78% 


T1 


T 


43 


27 


0.0722 


0.9025 


88% 


89% 


V-4 


G 


69 


68 


1 .0000 


0.4713 


27% 


24% 


V-3 


A 


83 


76 


0.6343 


0.6834 


39% 


37% 


V-1 


C 


43 


27] 


0.0722 


0.0055 


92% 


85% 


V1 


A 


7 


7 


1 .0000 


0.2515 


98% 


96% 


V4 


C 


73 


55 


0.1326 


0.0336 


84% 


77% 


L-1/S1 


GG 


64 


38 


0.0167 


0.0925 


84% 


78% 


L-1/S2 


GG 


78 


46 


0.0083 


0.0787 


80% 


74% 


L-1/ST+4 


GA 


96 


89 


0.5802 


0.1040 


49% 


42% 


L-1/ST+7 


GG 


85 


58 


0.0517 


0.0426 


75% 


67% 


L-1/T1 


GT 


40 


27J 


0.0841 


0.6682 


88% 


89% 


L-1 /V-4 


GC 


98 


81 


0.3053 


0.4698 


62% 


65% 


L-1/V-3 


GG 


94 


87 


0.6144 


0.6823 


50% 


52% 


L-1 /V-1 


GC 


71 


42 


0.0094 


0.0306 


81% 


74% 


L-1/V1 


GA 


43 


30 


0.2394 


0.5631 


87% 


85% 


L-1/V4 


GC 


84 


58 


0.0615 


0.0937 


72% 


66% 


S1/S2 


GG 


74 


40 


0.0020 


0.0649 


80% 


74% 


S1/ST+4 


GA 


99 


88 


0.1935 


0.0012 


55% 


42% 


S1/ST+7 


GG 


59 


40 


0.0823 


0.0213 


85% 


78% 


S1/T1 


GT 


72 


38 


0.0016 


0.1232 


83% 


78% 


SW-4 


GC 


88 


68 


0.0671 


0.4747 


68% 


65% 


SW-3 


GG 


93 


81 


0.1306 


0.4049 


56% 


53% 
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S1/V-3 


GA 


83 


77 


0.1306 


0.6842 


39% 


37% 


S1/V-1 


GC 


44 


26 


0.0540 


0.0042 


92% 


85% 


S1/V1 


GA 


43 


26 


0.0565 


0.0088 


93% 


86% 


S1/V4 


GC 


75 


57 


0.1318 


0.0302 


83% 


76% 


S2/ST+4 


GA 


112 


77 


0.0165 


0.0019 


43% 


31% 


S2/ST+7 


GG 


95 


56 


0.0041 


0.0746 


73% 


67% 


S2/T1 


GT 


78 


42 


0.0029 


0.0844 


79% 


73% 


S2/V-4 


GC 


114 


73 


0.0017 


0.4269 


53% 


50% 


S2/V-3 


GG 


105 


70 


0.0034 


0.1681 


43% 


37% 


S2/V-1 


GC 


77 


40 


0.0011 


0.0690 


80% 


74% 


S2/V1 


GA 


74 


40 


0.0019 


0.0638 


80% 


74% 


S2/V4 


GC 


90 


54 


0.0039 


0.0174 


71% 


62% 


ST+4/ST+7 


AG 


104 


86 


0.2999 


0.0006 


55% 


40% 


ST+4/T1 


AT 


101 


82 


0.1920 


0.1 333 


48% 


42% 


ST+4/V-4 


AC 


92 


89 


0.9431 


0.0489 


60% 


52% 


ST+4/V-3 


AG 


95 


85 


0.3543 


0.0559 


59% 


52% 


ST+4/V-1 


AC 


103 


88 


0.2800 


0.0006 


' 55% 


41% 


ST+4/V1 


CA 


90 


88 


0.9281 


0.0972 


38% 


45% 


ST+4/V4 


AC 


102 


88 


0.2943 


0.0010 


55% 


41% 


ST+7/T1 


GT 


86 


52 


0.0060 


0.0537 


74% 


67% 


ST+7/V-4 


GC 


99 


76 


0.0819 


0.1027 


65% 


59% 


ST+7/V-3 


GG 


103 


81 


0.1801 


0.0507 


54% 


47% 


ST+7/V-1 


GC 


63 


44 


0.0889 


0.0188 


85% 


78% 


ST+7/V1 


GA 


59 


43 


0.2206 


0.0153 


86% 


78% 


ST+7/V4 


GC 


83 


62 


0.1327 


0.0046 


76% 


66% 


T1A/-4 


TC 


95 


74 


0.1090 


0.4155 


61% 


64% 


T1/V-3 


TG 


97 


80 


0.2083 


0.6220 


50% 


52% 


T1/V-1 


TC 


80 


46 


0.0053 


0.0467 


81% 


74% 


T1/V1 


TA 


49 


31 


0.0738 


0.6598 


86% 


85% 


T1/V4 


TC 


97 


60 


0.0064 


0.1334 


72% 


66% 


V-4/V-3 


CA 


52 


43 


0.2513 


0.6973 


12% 


13% 


NM/V-3 


GG 




1 


0.2513 


0.3277 


0% 


0% 


V-4AM 


CC 


91 


70 


0.0560 


0.2280 


65% 


61% 


V-4/V1 


CA 


70 


66 


0.8701 


0.8035 


71% 


72% 


V-4/V4 


CC 


97 


78 


0.3465 


0.3691 


57% 


53% 


V-3A/-1 


GC 


98 


83 


0.1240 


0.1818 


54% 


48% 


V-3/V1 


GA 


83 


80 


0.8925 


0.9739 


59% 


59% 


V-3/V4 


GC 


98 


79 


0.1049 


0.4273 


54% 


51% 


V-1/V1 


CA 


43 


29 


0.1385 


0.0058 


92% 


85% 


V-1/V4 


CC 


76 


57 


0.1440 


0.0009 


83% 


73% 


V1/V4 


AC 


77 


55 


0.0972 


0.0024 


84% 


74% 


L-1/S1/S2 


GGG 


70 


41 


0.0124 


0.0810 


80% 


74% 


L-1/S1/ST+4 


GGA 


90 


70 


0.1572 


0.0032 


43% 


32% 


L-1/S1/ST+7 


GGG 


76 


51 


0.0826 


0.0368 


75% 


67% 


L-1/S1/T1 


GGT 


63 


37 


0.0170 


0.1622 


83% 


78% 


L-1/S1/V-4 


GGC 


99 


68 


0.0393 


0.5521 


57% 


54% 


L-1/S1/V-3 


GGG 


89 


72 


0.1596 


0.4148 


45% 


42% 


L-1/S1/V-1 


GGC 


69 


40 


0.0166 


0.0268 


82% 


74% 


L-1/S1/V1 


GGA 


67 


40 


0.0260 


0.0303 


82% 


75% 
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L-1/S1/V4 


GGC 


83 


58 


0.0956 


0.0831 


72% 


66% 


L-1/S2/ST+4 


GGA 


96 


67 


0.0428 


0.0016 


43% 


31% 


L-1/S2/ST+7 


GGG 


89 


55 


0.0072 


0.0949 


73% 


67% 


L-1/S2/T1 


GGT 


71 


39 


0.0055 


0.1188 


79% 


74% 


L-1/S2/V-4 


GGC 


105 


68 


0.0088 


0.4614 


53% 


50% 


L-1/S2/V-3 


GGG 


93 


66 


0.0257 


0.1716 


42% 


37% 


L-1/S2/V-1 


GGC 


73 


41 


0.0056 


0.0866 


80% 


74% 


L-1/S2/V1 


GGA 


72 


41 


0.0097 


0.0765 


80% 


74% 


L-1/S2/V4 


GGC 


81 


53 


0.0323 


0.0135 


71% 


62% 


L-1/ST+4/ST+7 


GAG 


97 


72 


0.2083 


0.0013 


44% 


31% 


L-1/ST+4/T1 


GAT 


87 


75 


0.2354 


0.1679 


48% 


42% 


L-1/ST+4/V-4 


GAC 


95 


81 


0.6248 


0.1018 


49% 


42% 


L-1/ST+4/V-3 


GAG 


91 


78 


0.5715 


0.1123 


48% 


42% 


L-1/ST+4/V-1 


GAC 


93 


72 


0.1470 


0.0020 


43% 


31% 


L-1/ST+4/V1 


GAA 


89 


81 


0.5655 


0.1157 


49% 


42% 


L-1/ST+4/V4 


GAC 


93 


70 


0.1319 


0.0051 


43% 


32% 


L-1/ST+7/T1 


GGT 


77 


48 


0.0131 


0.0767 


74% 


67% 


L-1/ST+7/V-4 


GGC 


107 


75 


0.0670 


0.1309 


54% 


48% 


L-1/ST+7/V-3 


GGG 


99 


70 


0.0847 


0.0549 


43% 


36% 


L-1/ST+7/V-1 


GGC 


81 


53 


0.031 1 


0.0516 


74% 


67% 


L-1/ST+7/V1 


GGA 


77 


52 


0.0929 


0.0361 


75% 


67% 


L-1 /ST+7/V4 


GGC 


89 


61 


0.0986 


0.0141 


65% 


56% 


L-1/T1/V-4 


GTC 


89 


69 


0.0344 


0.3591 


61% 


64% 


L-1/T1/V-3 


GTG 


84 


74 


0.0748 


0.5549 


49% 


52% 


L-1/T1/V-1 


GTC 


70 


39 


0.0033 


0.0654 


80% 


74% 


L-1/T1/V1 


GTA 


44 


28 


0.1126 


0.7542 


86% 


85% 


L-1/T1/V4 


GTC 


84 


57 


0.0378 


0.1764 


71% 


66% 


L-1A/-4/V-3 


GCG 


93 


86 


0.2018 


0.6863 


50% 


52% 


L-1 A/-4/V-3 


GCA 


49 


38 


0.2018 


0.6803 


12% 


13% 


L-1A/-4/V-1 


GCC 


102 


69 


0.0264 


0.2783 


54% 


50% 


L-1 A/-4/V1 


GCA 


91 


73 


0.3709 


0.7722 


60% 


61% 


L-1A/-4/V4 


GCC 


96 


64 


0.0834 


0.5577 


45% 


43% 


L-1/V-3/V-1 


GGC 


92 


70 


0.0389 


0.1787 


43% 


37% 


L-1 A/-3A/1 


GGA 


86 


77 


0.5533 


0.9622 


48% 


48% 


L-1A/-3/V4 


GGC 


93 


69 


0.1149 


0.61 76> 


43% 


41% 


L-1/V-1/V1 


GCA 


69 


42 


0.0300 


0.0234 


82% 


74% 


L-1 AM A/4 


GCC 


85 


58 


0.0632 


0.0089 


72% 


62% 


L-1 A/1 /V4 


GAC 


86 


57 


0.0646 


0.0123 


72% 


63% 


S1/S2/ST+4 


GGA 


99 


69 


0.0296 


0.0041 


43% 


31% 


S1/S2/ST+7 


GGG 


86 


48 


0.0037 


0.0958 


73% 


67% 


S1/S2/T1 


GGT 


76 


42 


0.0091 


0.0862 


79% 


73% 


S1/S2A/-4 


GGC 


104 


66 


0.0058 


0.4350 


53% 


50% 


S1/S2A/-3 


GGG 


94 


65 


0.0180 


0.1734 


43% 


37% 


S1/S2AM 


GGC 


75 


40 


0.0028 


0.0669 


80% 


74% 


S1/S2/V1 


GGA 


73 


40 


0.0032 


0.0679 


80% 


74% 


S1/S2A/4 


GGC 


89 


54 


0.0031 


0.0296 


70% 


62% 


S1/ST+4/ST+7 


GAG 


90 


81 


0.3274 


0.0007 


54% 


41% 


S1/ST+4/ST+7 


GCG 


76 


66 


0.3274 


0.1272 


31% 


37% 


S1/ST+4/T1 


GAT 


103 


73 


0.0296 


0.0046 


43%| 32%| 
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S1/ST+4/V-4 


GAC 


93 


78 


0.2035 


0.0017 


54% 


42% 


S1/ST+4/V-3 


GAG 


95 


' 76 


0.1558 


0.0020 


54% 


42% 


S1/ST+4/V-1 


GAC 


100 


87 


0.4802 


0.0019 


54% 


41% 


S1/ST+4/V1 


GAA 


98 


88 


0.3370 


0.0012 


55% 


42% 


S1/ST+4/V4 


GAC 


100 


89 


0.5596 


0.0005 


54% 


41% 


S1/ST+7/T1 


GGT 


84 


48 


0.0065 


0.0555 


74% 


67% 


S1/ST+7/V-4 


GGC 


88 


67 


0.1482 


0.1176 


65% 


59% 


S1/ST+7/V-3 


GGG 


90 


75 


0.1843 


0.0676 


54% 


47% 


S1/ST+7/V-1 


GGC 


61 


40 


0.0749 


0.0178 


85% 


78% 


S1/ST+7/V1 


GGA 


58 


40 


0.1649 


0.0192 


85% 


78% 


S1/ST+7/V4 


GGC 


82 


62 


0.2201 


0.0068 


76% 


66% 


S1H-1/V-4 


GTC 


1071 


69 


0.0041 


0.6112 


56% 


54% 


S1/T1/V-3 


GTG 


101 


72 


0.0248 


0.4429 


44% 


41% 


S1/T1/V-1 


GTC 


77 


41 


0.0030 


0.0351 


81% 


74% 


S1/T1/V1 


GTA 


75 


41 


0.0041 


0.0550 


81% 


75% 


S1/T1/V4 


GTC 


96 


60 


0.0120 


0.1214 


72% 


66% 


S1/V-4Af-3 


GCG 


90 


80 


0.2072 


0.3996 


56% 


53% 


S1/V-4/V-3 


GCA 


47 


37 


0.2072 


0.6561 


11% 


13% 


S1/V-4A/-1 


GCC 


90 


68 


0.0936 


0.2473 


65% 


61% 


S1/V-4/V1 


GCA 


89 


68 


0.1044 


0.2773 


66% 


62% 


S1/V-4/V4 


GCC 


97 


79 


0.3192 


0.4350 


56% 


53% 


S1/V-3/V-1 


GGC 


96 


80 


0.1266 


0.1910 


54% 


48% 


S1/V-3/V1 


GGA 


94 


79 


0.1781 


0.2146 


54% 


49% 


S1/V-3/V4 


GGC 


97 


80 


0.2252 


0.4951 


54% 


51% 


S1/V-1/V1 


GCA 


42 


26 


0.0900 


0.0047 


92% 


85% 


S1/V-1/V4 


GCC 


76 


57 


0.1512 


0.0012 


83% 


73% 


S1/V1/V4 


GAC 


77 


56 


0.1683 


0.0020 


83% 


73% 


S2/ST+4/ST+7 


GAG 


100 


69 


0.1005 


0.0033 


43% 


32% 


S2/ST+4/T1 


GAT 


100 


66 


0.0388 


0.0020 


43% 


31% 


S2/ST+4/V-4 


GAC 


102 


65 


0.0217 


0.0028 


43% 


31% 


S2/ST+4/V-3 


GAG 


104 


64 


0.0093 


0.0029 


42% 


31% 


S2/ST+4AM 


GAC 


103 


67 


0.0191 


0.0022 


43% 


31% 


S2/ST+4/V1 


GAA 


101 


69 


0.0126 


0.0031 


43% 


32% 


S2/ST+4/V4 


GAC 


102 


68 


0.0387 


0.0021 


43% 


31% 


S2/ST+7/T1 


GGT 


90 


48 


0.0039 


0.0984 


72% 


67% 


S2/ST+7/V-4 


GGC 


109 


72 


0.0143 


0.1733 


53% 


48% 


S2/ST+7/V-3 


GGG 


100 


64 


0.0091 


0.0624 


43% 


35% 


S2/ST+7AM 


GGC 


91 


48 


0.0006 


0.0763 


73% 


67% 


S2/ST+7/V1 


GGA 


88 


48 


0.0017 


0.0659 


73% 


67% 


S2/ST+7/V4 


GGC 


95 


54 


0.0023 


0.0288 


64% 


55% 


S2/T1/V-4 


GTC 


104 


62 


0.0019 


0.4882 


52% 


50% 


S2/T1/V-3 


GTG 


97 


62 


0.0060 


0.2075 


42% 


37% 


S2/T1/V-1 


GTC 


79 


42 


0.0042 


0.0814 


79% 


74% 


S2/T1/V1 


GTA 


77 


42 


0.0035 


0.0922 


79% 


73% 


S2/T1/V4 


GTC 


91 


54 


0.0111 


0.0175 


71% 


62% 


S2/V-4/V-3 


GCG 


98 


69 


0.0093 


0.1562 


43% 


37% 


S2/V-4/V-1 


GCC 


107 


66 


0.0019 


0.4078 


53% 


50% 


S2/V-4/V1 


GCA 


105 


66 


0.0029 


0.4168 


53% 


50% 


S27V-4/V4 


GCC 


99 


61 


^0.0093 


0.2022 


44% 


39% 
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S2/V-3/V-1 


GGC 


99 


65 


0.0065 


0.1670 


43% 


37% 


S2/V-3/V1 


GGA 


97 


65 


0.0065 


0.1578 


43% 


37% 


S2/V-3/V4 


GGC 


100 


63 


O.OOoO 


0.1658 


43% 


37% 


S2/V-1/V1 


GCA 


75 


40 


0.0027 


0.0627 


80% 


74% 


S2/V-1/V4 


GCC 


89 


54 


0.0079 


0.0197 


71% 


62% 


S2/V1/V4 


GAC 


89 


54 


0.0025 


0.0185 


71% 


62 % 


ST+4/ST+7/T1 


AGT 


96 


70 


0.0457 


0.0029 


43% 


31% 


ST+4/ST+7/V-4 


AGC 


97 


80 


0.4361 


0.0009 


54% 


40% 


ST+4/ST+7/V-3 


AGG 


99 


78 


0.2168 


0.001 1 


54% 


41% 


ST+4/ST+ 7/V-1 


AGC 


93 


82 


0.4387 


0.0013 


55% 


41% 


ST+4/ST+7/V-1 


CGC 


77 


68 


0.4387 


0.1220 


31% 


37% 


ST+4/ST+7/V1 


AGA 


91 


81 


0.5479 


0.0008 


55% 


41 % 


ST+4/ST+ 7/V1 


CGA 


75 


68 


0.5479 


0.1043 


31% 


38% 


ST+4/ST+7/V4 


AGC 


94 


82 


0.4633 


0.0013 


54% 


41% 


ST+4/T1 /V-4 


ATC 


93 


74 


0.21 70 


0.1338 


48% 


42% 


ST+4/T1 /V-3 


ATG 


96 


70 


0.0648 


0.1383 


48% 


42% 


ST+4/T1/V-1 


ATC 


108 


74 


0.0307 


0.0039 


43% 


31% 


ST+4/T1 /V1 


ATA 


100 


83 


0.2959 


0.1454 


48% 


42% 


S I +4/ I 1/V4 


ATC 


106 


73 


0.0451 


0.0079 


42% 


32% 


ST+4/V-4/V-3 


CCA 


45 


38 


0.1845 


0.4920 


11% 


13% 


ST+4/V-4/V-1 


ACC 


96 


79 


0.1591 


0.0010 


55% 


41% 


ST+4/V-4/V1 


CCA 


45 


41 


0.7601 


0.0019 


11% 


20% 


ST+4/V-4/V4 


ACC 


95 


76 


0.2691 


0.0017 


54% 


41% 


ST+4/V-3/V-1 


AGC 


98 


77 


0.1923 


0.0019 


54% 


42% 


t>*r* ■ m t\ t *yt\fA 

ST+4/V-3/V1 


AGA 


86 


76 


0.2006 


0.0593 


60% 


52% 


e>-r* ■ AM I\ I A 

ST+4/V-3/V4 


AGC 


99 


76 


0.1960 


0.0016 


54% 


40% 


ST+4/V-1/V1 


ACA 


101 


87 


0.3833 


0.0014 


54% 


42% 


ST+4A/-1/V4 


ACC 


102 


87 


0.4317 


0.0008 


54% 


41% 


ST+4/V1/V4 


AAC 


102 


87 


0.3252 


0.0013 


55% 


41% 


a *T IT A f\ i A 

ST+7/T1/V-4 


GTC 


106 


68 


0.0054 


0.1502 


53% 


48% 


ST+7/T1/V-3 


GTG 


98 


68 


0.0295 


0.0658 


43% 


36% 


ST+7/T1/V-1 


GTC 


89 


53 


0.0058 


0.0743 


74% 


67% 


ST+7/T1/V1 


GTA 


84 


52 


0.0177 


0.0518 


74% 


67% 


ST+7/T1/V4 


GTC 


97 


58 


0.0058 


0.0289 


64% 


56% 


ST+7/V-4/V-3 


GCG 


97 


80\ 


0.0783 


0.1 188 


54% 


47% 


ST+7/V-4/V-1 


GCC 


92 


68 


0.0203 


0.0934 


i_ 65% 


59% 


ST+7/V-4/V1 


GCA 


90 


68 


0.0857 


0.1003 


65% 


59% 


ST+7/V-4/V4 


GCC 


94 


74 


0.2441 


0.0392 


56% 


47% 


aTi TAl Ahf *f 

ST+7/V-3/V-1 


GGC 


95 


77 


0.0626 


0.0807 


54% 


47% 


Of if AJ AAM 

ST+7/V-3/V1 


GGA 


93 


76 


0.3285 


0.0616 


54% 


47% 






ao 


to 


U.U/ OD 




54 To 


4 f /o 


ST+7/V-1/V1 


GCA 


61 


44! 


0.2511 


0.0165 


85% 


78% 


ST+7/V-1/V4 


GCC 


83 


62 


0.1579 


0.0032 


76% 


66% 


ST+7/V1/V4 


GAC 


82 


61 


0.1877 


0.0050 


76% 


66% 


T1/V-4/V-3 


TCG 


90 


79 


0.1410 


0.6399 


50% 


52% 


T1/V-4/V-3 


TCA 


46 


36 


0.1410 


0.6812 


12% 


13% 


T1/V-4/V-1 


TCC 


111 


69 


0.0014 


0.3062 


54% 


50% 


T1/V-4/V1 


TCA 


95 


73, 


0.1714 


0.7134 


59% 


61% 


T1/V-4/V4 


TCC 


104 


66 


0.0217 


0.6140 


45% 


43% 
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T1/V-3/V-1 


TGC 


105 


71 


0.0113 


0.2080 


42% 


37% 


T1/V-3/V1 


TGA 


96 


76 


0.2641 


0.8990 


48% 


48% 


T1/V-3/V4 


TGC 


105 


69 


0.0138 


0.6748 


42% 


41% 


T1/V-1/V1 


TCA 


76 


45 


0.0131 


0.0390 


81% 


74% 


TW-W4 


TCC 


97 


59 


0.0063 


0.0146 


71% 


62% 


T1/V1/V4 


TAC 


98 


58 


0.0035 


0.0207 


72% 


63% 


V-4/V-3AM 


CGC 


92 


80 


0.1065 


0.1999 


54% 


48% 


V-4/V-3AM 


CAC 


48 


37 


0.1065 


0.7257 


12% 


12% 


V-4/V-3/V1 


CAA 


45 


39 


0.8169 


0.7142 


12% 


13% 


V-4/V-3/V4 


CGC 


92 


76 


0.2275 


0.5461 


54% 


52% 


V-4/V-1/V1 


CCA 


90 


69 


0.0889 


0.2305 


65% 


61% 


V-4/V-1/V4 


CCC 


96 


77 


0.2412 


0.0903 


56% 


49% 


V-4/V1/V4 


CAC 


97 


75 


0.3730 


0.1078 


57% 


50% 


V-3/V-1/V1 


GCA 


97 


81 


0.2076 


0.1888 


54% 


48% 


V-3/V-1/V4 


GCC 


99 


78 


0.1115 


0.1616 


54% 


48% 


V-3/V1/V4 


GAC 


99 


76 


0.1093 


0.1643 


54% 


48% 


V-1/V1/V4 


CAC 


75 


56 


0.0973 


0.0013 


83% 


73% 


S1/T1/V-1A/1 


GTCA 


74 


41 


0.0053 


0.0390 


L 81% 


74% 


S1/T1/V-1/V4 


GTCC 


96 


58 


0.0051 


0.0122 


72% 


62% 


S1/T1/V1/V4 


GTAC 


97 


58 


0.0059 


0.0182 


72% 


63% 


S1/V-1/V1/V4 


GCAC 


75 


56 


0.1539 


0.0011 


83% 


73% 


S2/ST+7/T1/V-4 


GGTC 


102 


62 


0.0059 


0.1211 


53% 


47% 


S2/ST+7/T1/V-3 


GGTG 


93 


59 


0.0049 


0.0960 


42% 


36% 


S2/ST+7/T1/V-1 


GGTC 


92 


48 


0.0016 


0.1038 


72% 


67% 


S2/ST+7/T1/V4 


GGTC 


95 


52 


0.0031 


0.0189 


64% 


55% 


S2/ST+7/V-4/V-1 


GGCC 


106 


66 


0.0015 


0.1972 


53% 


48% 


S2/ST+7/V-4/V4 


GGCC 


95 


59 


0.0265 


0.0340 


45% 


36% 


S2/ST+7/V-3/V-1 


GGGC 


95 


61 


0.0021 


0.0731 


43% 


36% 


S2/ST+7/V-3/V4 


GGGC 


96 


61 


0.0075 


0.0684 


42% 


36% 


S2/ST+7/V-1/V4 


GGCC 


95 


54 


0.0017 


0.0224 


64% 


55% 


S2/T1/V-4/V-1 


GTCC 


105 


62 


0.0028 


0.2782 


54% 


49% 


S2/T1/V-4/V4 


GTCC 


98 


57 


0.0141 


0.2564 


44% 


39% 


S2/T1/V-3/V-1 


GTGC 


99 


62 


0.0087 


0.2362 


42% 


37% 


S2/T1/V-3/V4 


GTGC 


99 


59 


0.0061 


0.2263 


42% 


37% 


S2/T1/V-1/V4 


GTCC 


90 


54 


0.0133 


0.0189 


71% 


62% 


S2/V-4/V-1/V4 


GCCC 


98 


61 


0.0166 


0.1085 


45% 


39% 


S2/V-3/V-1/V4 


GGCC 


100 


63 


0.0111 


0.1818 


42% 


37% 


ST+7/T1/V-4/V-1 


GTCC 


108 


68 


0.0062 


0.1381 


53% 


48% 


ST+7/T1/V-4/V4 


GTCC 


99 


62 


0.0318 


0.0851 


44% 


37% 


ST+7/T1 /V-3/V-1 


GTGC 


100 


68 


0.0222 


0.0906 


42% 


36% 


ST+7/T1 A/-3/V4 


GTGC 


100 


65 


0.0138 


0.1179 


42% 


36% 


ST+7/T1/V-1/V4 


GTCC 


97 


58 


0.0071 


0.0144 


64% 


55% 


ST+7/V-4/V-1/V4 


GCCC 


94 


74 


0.1296 


0.0304 


56% 


47% 


ST+7/V-3/V-1/V4 


GGCC 


96 


75 


0.0825 


0.0736 


54% 


47% 


TW-4/V-1/V4 


TCCC 


103 


64 


0.0102 


0.1670 


44% 


39% 


T1/V-3/V-1/V4 


TGCC 


105 


66 


0.0081 


0.1992 


42% 


37% 


T1/V-1/V1/V4 


TCAC 


96 


59 


0.0029 


0.0140 


72% 


62% 


S1/T1/V-1/V1/V4 


GTCAC 


95 


58 


0.0111 


0.0122 


72% 


62% 


S2/ST+7/T1/V-3/V-1 


GGTGC 


95 


59| 


0.0028 


0.1002 


42% 


36% 
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S2/ST+7/T1/V-3/V4 


GGTGC 


95 


58 


0.0062 


0.1055 


42% 


36% 


S2/ST+7/T1/V-1/V4 


GGTCC 


95 


52 


0.0031 


0.0208 


64% 


55% 


S2/ST+7/V-3/V-1/V4 


GGGCC 


96 


61 


0.0114 


0.0819 


42% 


36% 


S2/T1/V-3/V-1/V4 


GTGCC 


99 


59 


0.0126 


0.2204 


42% 


37% 


ST+7/T1/V-3/V-1/V4 


GTGCC 


100 


65 


0.0176 


0.0919 


42% 


36% 


TABLE 28 (CON'T) 


Asthma Yes/No 
UK 


SNP(s) 


Over- 
Transmitted 
Allele or 
Hapio 


T 


NT 


TDT 
p-value 


Case/ 
Cntl 

p-value 


Case 
freq 


Cntl 
freq 


L-1 


A 


23 


22 


1 .0000 


0.1380 


8% 


13% 


S1 


G 


30 


11 


0.0043 


0.0260 


95% 


89% 


S2 


G 


50 


32 


0.0598 


0.0041 


84% 


73% 


ST+4 


A 


75 


73 


0.9345 


0.0191 


59% 


48% 


ST+7 


G 


52 


28 


0.0097 


0.0535 


86% 


80% 


T1 


T 


26 


19 


0.3713 


0.1473 


91% 


87% 


V-4 


G 


59 


54 


0.7069 


0.7529 


27% 


25% 


V-3 


A 


65 


60 


0.7207 


0.7032 


40% 


38% 


V-1 


C 


36 


17 


0.0127 


0.0105 


94% 


86% 


V1 


T 


6 


5 


1 .0000 


0.7385 


1% 


2% 


V4 


C 


61 


40 


0.0460 


0.0328 


84% 


75% 


L-1/S1 


GG 


47 


26 


0.0110 


0.0031 


87% 


77% 


L-1/S2 


GG 


55 


32 


0.0073 


0.0060 


84% 


73% 


L-1 /ST+4 


GC 


71 


68 


0.9772 


0.0942 


41% 


49% 


L-1/ST+7 


GG 


67 


40 


0.0073 


0.0062 


78% 


67% 


L-1/T1 


GT 


24 


19 


0.1855 


0.2007 


91% 


87% 


L-1/V-4 


GC 


68 


65 


0.9154 


0.5705 


65% 


62% 


L-1 /V-3 


GA 


66 


64 


0.9916 


0.6691 


40% 


38% 


L-1/V-1 


GC 


53 


28 


0.0038 


0.0009 


85% 


74% 


L-1/V1 


GA 


26 


22 


0.8253 


0.0902 


90% 


85% 


L-1 A/4 


GC 


64 


43 


0.0866 


0.0045 


75% 


63% 


S1/S2 


GG 


53 


29 


0.0044 


0.0037 


84% 


73% 


S1/ST+4 


GA 


79 


68 


0.0599 


0.0005 


54% 


39% 


S1/ST+4 


GC 


73 


66 


0.0599 


0.0478 


41% 


51% 


S1/ST+7 


GG 


50 


25 


0.0037 


0.0666 


86% 


80% 


S1/T1 


GT 


52 


25 


0.0020 


0.0046 


87% 


76% 


S1/V-4 


GC 


69 


52 


0.0198 


0.3178 


68% 


64% 


S1A/-3 


GG 


74 


61 


0.0341 


0.4179 


55% 


51% 


S1/V-1 


GC 


36 


16 


0.0052 


0.0080 


94% 


86% 


S1/V1 


GA 


34 


16 


0.0104 


0.0242 


94% 


87% 


S1/V4 


GC 


60 


42 


0.0200 


0.0286 


83% 


75% 


S2/ST+4 


GA 


80 


60 


0.0774 


0.0001 


47% 


28% 


S2/ST+7 


GG 


73 


40 


0.0016 


0.0124 


76% 


66% 


S2/T1 


GT 


54 


28 


0.0070 


0.0030 


84% 


72% 
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S2/V-4 


GC 


86 


58 


0.0230 


0.0782 


57% 


49% 


S2/V-3 


GG 


81 


54 


0.0231 


0.0253 


46% 


36% 


S2/V-1 


GC 


56 


29 


0.0038 


0.0033 


84% 


73% 


S2/V1 


GA 


52 


29 


0.0130 


0.0032 


84% 


73% 


S2/V4 


GC 


66 


42 


0.0189 


0.0046 


73% 


61% 


ST+4/ST+7 


AG 


88 


65 


0.0307 


0.0008 


55% 


39% 


ST+4/T1 


AT 


71 


63 


0.7750 


0.0106 


51% 


38% 


ST+4/V-4 


AC 


74 


70 


0.9448 


0.0283 


59% 


48% 
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CCA 


73 


54 


0.01 67 


0.2373 


67% 


62% 


V-4/V-1/V4 


CCC 


79 


59 


0.0398 


0.0744 


56% 


48% 


l / >■ ft tA I\ t A 

V-4/V1 /V4 


CAC 


79 


57 


fx fXfxsxtx 

0.0998 


0.1 000 


57% 


49% 


V-3/V-1/V1 


GCA 


79 


62 


0.0658 


0.2831 


54% 


49% 


V-3/V-1/V4 


GCC 


81 


59 


0.0191 


0.2312 


54% 


48% 


V-3/V1/V4 


GAC 


81 


L 57 


0.0097 


0.1462 


55% 


47% 


V-1/V1/V4 


CAC 


62 


41 


0.0066 


0.0088 


83% 


73% 


A *i t^*A ft t A l\ # *4 

S1/T1/V-1/V1 


GTCA 


53 


27 


0.0040 


0.0008 


85% 


73% 


j f^TA ft t A T\ t A 

S1/T1/V-1/V4 


GTCC 


73 


43 


0.0031 


0.0027 


74% 


61% 


S1/T1/V1/V4 


GTAC 


73 


43 


fx fx rx^xrx 

0.0029 


0.0027 


74% 


62% 


f^rff ft § A ft /*4 ft / i« 

S1/V-1/V1/V4 


GCAC 


61 


41 


rx fx A ex fx 

0.0180 


0.0076| 


83% 


73% 


S2/ST+7/T1 /V-4 


GGTC 


79 


48 


0.0018 


fx rxA A*x 

0.0142 


58% 


46% 


/o-t* i "r it A n t 

S2/ST+7/T1 /V-3 


GGTG 


74 


45 


0.0022 


0.0193 


45% 


35% 


S2/ST+7/T1 /V-1 


GGTC 


68 


30 


O.UUUo 


0.U1 18 


76% 


66% 


S2/ST+7/T1 /V4 


GGTC 


74 


36 


0.0004 


0.0095 


66% 


54% 


S2/ST+7/V-4/V-1 


GGCC 


83 


52 


0.0030 


U.U2D0 


58% 


47% 


S2/S T +7/V-4/ V4 


GGCC 


76 


46 


0.0096 


0.0079 


49% 


35% 


S2/ST+7/V-3AM 


GGGC 


76 


47 


f\ fXfXAfX 

0.0040 


fx fx A cx rx 

0.0180 


46% 


35% 


S2/ST+-7/V-3/V4 


GGGC 


77 


47 


0.0063 


0.01 24 


46% 


34% 


ff» O /tf^T" ■ "T ft / A FX M A 

S2/ST+7/V-1 /V4 


GGCC 


75 


39 


0.0009 


0.01 15 


66% 


54% 


S2/T1 /V-4/V-1 


GTCC 


79 


48 


0.0027 


0.0265 


58% 


48% 


S2/T1/V-4/V4 


GTCC 


75 


44 


0.0192 


0.0412 


47% 


37% 


n*A ft / Oft/ «4 

S2/T1 /V-3/V-1 


GTGC 


75 


48 


0.0052 


0.0347 


45% 


35% 


S2/T1 A/-3A/4 


GTGC 


75 


45 


0.0179 


0.0371 


45% 


36% 


S2/T1/V-1/V4 


GTCC 


65 


40 


0.0328 


0.0029 


74% 


61% 


S2/V-4/V-1/V4 


GCCC 


77 


48 


0.0325 


0.0145 


49% 


37% 


*\ r\ t f\ t\ * A Ft M A 

S2/V-3/V-1/V4 


GGCC 


77 


49 


0.0280 


0.0463 


45% 


36% 


^■i- « -y rt- ^4 ft r ji ft / h 

ST+7/T 1 A/ -4/V-1 


GTCC 


82 


51 


0.0014 


0.0152 


58% 


47% 


ST+7/T1 /V-4/V4 


GTCC 


77 


46 


0.0042 


0.01 66 


47% 


36% 


-■- _ — r r-*- * ft # oft/ 

ST+7/T 1 /V-3/V-1 


GTGC 


78 


51 


0.0117 


0.0140 


45% 


34% 


ST+7/T 1 /V-3/V4 


GTGC 


78 


48 


0.0044 


0.0202 


45% 


35% 


ST+7/T1/V-1/V4 


GTCC 


78 


40 


/-\ fXf\r*% A 

0.0024 


0.0070 


67% 


54% 


OT t "7 ft / A f\ t A f\ I A 

ST+7/V-4/V-1 /V4 


GCCC 


78 


56 


0.0207 


0.0446 


57% 


47% 


ST+7/V-3/V-1 /V4 


GGCC 


81 


56 


fX f\ A A A 

0.01 1 1 


0.1 588 


54% 


47% 


T*«« ft* A T\ § A S\ tA 

T1/V-4/V-1/V4 


TCCC 


79 


48 


0.0072 


0.0214 


48% 


36% 


I 1/V-0/V-J/V4 


I bOL 


/» 


a a 
4» 


U.UUo/ 


U.Uoo/ 


4o /o 


OO70 


T1/V-1/V1/V4 


TCAC 


73 


43 


0.0004 


0.0038 


74% 


62% 


S1/T1/V-1/V1/V4 


GTCAC 


72 


43 


0.0008 


0.0021 


74% 


62% 


S2/ST+7/T1/V-3/V-1 


GGTGC 


75 


45 


0.0003 


0.0205 


45% 


35% 


S2/ST+7/T1/V-3/V4 


GGTGC 


7£ 


44 


0.0014 


0.0201 


45% 


35% 


S2/ST+7/T1AM/V4 


GGTCC 


74 


36 


0.0005 


0.0087 


66% 


54% 


S2/ST+7/V-3/V-1/V4 


GGGCC 


77 


47 


0.0135 


0.0134 


46% 


35% 


S2/T1/V-3/V-1/V4 


GTGCC 


75 


45 


0.0161 


0.0399 


45% 


36% 


ST+7/T1/V-3/V-1/V4 


GTGCC 


78, 


48 


0.0063 


0.0176 


45% 


34% 



-246- 



WO 02/083077 



PCT/US02/12063 



TABLE 28 (CON'T) 



Asthma Yes/No 
US 


SNP(s) 


Transmitted 
Allele or 
Haplo 


T 


NT 


TDT 
p-value 


Case/ 
Cntl 

p-value 


Case 
freq 


Cntl 
freq 


L-1 


G 


15 


8 


0.2100 


0.0116 


78% 


92% 


S1 


A 


9 


7 


0.8036 


0.7873 


8% 


10% 


S2 


G 


23 


10 


0.0351 


0.1571 


65% 


75% 


ST+4 


C 


20 


18 


0.8714 


0.5150 


37% 


43% 


ST+7 


A 


13 


7 


0.2632 


0.3413 


17% 


24% 


n 


T 


17 


8 


0.1078 


0.0030 


76% 


92% 


V-4 


C 


14 


10 


0.5413 


0.5795 


72% 


77% 


V-3 


A 


18 


16 


0.8642 


0.8684 


33% 


35% 


V-1 


A 


10 


7 


0.6291 


0.5262 


13% 


17% 


V1 


A 


2, 


1 


1 .0000 


0.7308 


96% 


94% 


V4 


G 


15 


12 


0.7011 


0.5583 


17% 


21% 


L-1/S1 


GG 


17 


12 


0.2469 


0.1892 


72% 


82% 


L-1/S2 


GG 


23 


14 


0.1994 


0.1562 


65% 


75% 


L-1 /ST+4 


GA 


28 


20 


0.2176 


0.3061 


41% 


49% 


L-1/ST+7 


GA 


15 


7 


0.1592 


0.2677 


17% 


24% 


L-1/T1 


GT 


16 


8 


0.1677 


0.0033 


76% 


92% 


L-1/V-4 


GC 


30 


16 


0.0763 


0.0071 


50% 


69% 


L-1 /V-3 


GG 


27 


19 


0.2133 


0.1469 


44% 


57% 


L-1 /V-1 


GC 


18 


14 


0.2369 


0.1660 


65% 


75% 


L-1/V-1 


GA 


10 


6 


0.2369 


0.5475 


13% 


17% 


L-1 A/1 


GA 


17 


8 


0.1204 


0.0486 


74% 


86% 


L-1/V4 


GC 


20 


15 


0.3011 


0.1752 


61% 


71% 


S1/S2 


GG 


21 


11 


0.0422 


0.1431 


65% 


75% 


S1/ST+4 


AA 


9 


7 


0.8773 


0.5743 


8% 


10% 


S1/ST+7 


AA 


9 


6 


0.2857 


0.9946 


10% 


10% 


S1/ST+7 


GA 


7 


3 


0.2857 


0.1514 


7% 


14% 


S1/T1 


GT 


20 


13 


0.2095 


0.1095 


70% 


82% 


S1/V-4 


GC 


19 


16 


0.6136 


0.7811 


64% 


67% 


S1/V-3 


AG 


8 


6 


0.9435 


0.6472 


8% 


10% 


S1/V-1 


AA 


9 


5 


0.5677 


1.0000 


10% 


10% 


S1/V1 


AA 


9 


7 


0.8527 


0.5640 


8% 


10% 


S1/V4 


AG 


9 


6 


0.7652 


0.9978 


10% 


10% 


S2/ST+4 


GA 


26 


17 


0.3120 


0.4193 


30% 


36% 


S2/ST+7 


GG 


22 


16 


0.0248 


0.3353 


61% 


68% 


S2/ST+7 


CA 


11 


7 


0.0248 


0.4560 


13% 


17% 


S2/ST+7 


GA 


7 


3 


0.0248 


0.4883 


4% 


7% 


S2/T1 


GT 


24 


14 


0.2315 


0.1083 


63% 


75% 


S2/V-4 


GC 


28 


15 


0.0684 


0.0304 


37% 


52% 


S2/V-3 


GG 


24 


16 


0.1690 


0.2245 


31% 


40% 


S2/V-1 


GC 


21 


11 


0.0106 


0.1648 


65% 


75% 


S2/V1 


GA 


22 


11 


0.0624 


0.1645 


65% 


75% 
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S2/V4 


GC 


24 


12 


0.0165 


0.8376 


63% 


64% 


O ■ ■ *W O 1 • J 


AA 


10 


g 


0.3518 


0.5418 


10% 


14% 


O 1 ~*r/0 IT/ 


CA 


g 


3 


0 3518 


\J**T<J\J'J 


7% 


10% 


ST+4/T1 


AT 


30 


19 


0.1 548 


0.1835 


39% 


49% 


O 1 • t/ V "f 


CC 


13 


g 


0.2718 


0.0788 


10% 


20% 


ST+4/V-3 


CA 


20 


19 


1 0000 


0.841 5 


33% 


35% 


O 1 T*f/ V 1 


A A 
MM 


Q 

y 




0 7672 


0 61 1 4 


10% 


J O /0 


ST+4/V1 


AA 


20 


1 8 


0 9025 


0 4532 


64% 


57% 


O 1 V^f 


AP 


Q 

y 


D 


0 8877 


0 9129 


*tft% 

IW/O 


I U /O 




AT 
M I 


-1 4 

1 H 


7 
f 


U. I ODD 




1 1 /o 


/o 




MO 


I ^ 


Q 


0 2223 




14% 


21 % 


O 1 T//VO 




I Z 


R 

Q 


u.o^tuy 


0 3257 


1 O /0 


i y /o 


QT+7/V-1 

O 1 T// V- 1 


AA 


1 0 


g 


U.OOJO 




13% 


1 7% 


ST+7/V-1 


AC 




3 




0.3852 


4% 


7% 


ST+7/V1 


AA 


14 


7 


0.2577 


0.4406 


13% 


18% 


ST+7/V4 


AP 
MVj 


10 


7 


0 4163 




1 5% 


10% 


ST+7A/4 


AC 


7 


3 


0.4163 


0.0120 


2% 


14% 


T1/V-4 




30 


16 


0.0612 




48% 


69% 


T1A/-3 


TG 


29 


18 


U. IUJU 


0 D8Q3 


43% 


57% 


T1/V-1 


TC 


21 


15 


0.21 53 


0.1267 


63% 


75% 


T1/V1 


TA 


19 


g 




0.0223 


72% 


86% 


T1A/4 


TC 


24 


1 6 


0 1748 


0 1403 


59% 


71 % 


V-4/V-3 


CA 


10 


4 


0.3196 


0 1Q^fi 


6% 


12% 


V-4A/-1 


CC 


17 


15 


0.61 56 


KJ.O <?\J\J 


59% 


60% 


V-4A/-1 




10 


7 


0 6156 


0.4901 


13% 


17% 


V-4/V1 


CA 


16 


10 


0 4921 


0.7697 


69% 


71 % 


\/ Al\fA 




\ o 


I u 


0 6226 


0 7499 


I / /o 


i y /o 


V-o/V" 1 


PA 
oM 


Q 

y 


7 


0 9452 


0 4930 


i o /o 


I / /o 


V-3/V1 


GA 


18 


16 


0.8972 




63% 


59% 


V-3/V4 


GG 


•1 n 


7 


0 9232 


0 5752 


14% 


1 0% 


V-1/V1 


AA 


g 


5 


U. JUOi? 


0 7650 


9% 


11% 


V-1A/4 


AP 

MVJ 


1 u 


7 


0 8532 


0.5024 


13% 


1 0% 


V1A/4 


AG 


14 


12 


0.8827 


0.2048 


13% 


21% 


l— I / O 1 / O ^ 


GGG 


21 


12 


\j . i jjj 


0.161 5 


65% 


75% 


L» I/O I/O 1 • ■* 


GGA 


23 


17 


0.3377 


0.3922 


33% 


39% 


1 — " I/O I/O 1 * I 


GAA 


g 


5 


0 2884 


0.7508 


9% 


1 0% 


1— 1 / O I/O 1 « / 


GGA 


7 


3 


0.2884 


0.1 709 


7% 


14% 


L-1/S1/T1 


GGT 


1fi 


I o 


0 374C 


0.1225 


70% 


82% 


l- I/O I / v 


GGC 


26 


17 


0.2542 


0.0264 


42% 


59% 


[_"* l/o 1/ V~o 


GGG 


22 


17 


0 3697 


0.2224 


36% 


47% 


L-1/S1/V-1 


GGC 


18 


13 


0.1949 


0.1828 


65% 


75% 


L-1/S1/V-1 


GAA 


9 


4 


0.1949 


0.9559 


10% 


10% 


L-1/S1/V1 


GGA 


19 


13 


0.2846 


0.3498 


69% 


76% 


L-1/S1/V4 


GGC 


20 


14 


0.2726 


0.2516 


62% 


71% 


L-1/S2/ST+4 


GGA 


23 


17 


0.4714 


0.4109 


30% 


36% 


L-1/S2/ST+7 


GGG 


22 


18 


0.0512 


0.3738 


61% 


68% 


L-1/S2/ST+7 


GCA 


11 


6 


0.0512 


0.5492 


13% 


17% 


L-1/S2/ST+7 


GGA 


7 


3 


0.0512 


0.4350 


4% 


7% 


L-1/S2/T1 


GGT 


22 


14 


0.3930 


0.1339 


63% 


75% 
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I A J«><0 t\ M A 

L-1/S2/V-4 


GGC 


26 


15 


0.1452 


i~\ Anno 

0.0298 


37% 


52% 


L-1/S2/V-3 


GGG 


21 


1 6 


0.2228 


0.2277 


31% 


40% 


L-1/S2/V-1 


GGC 


21 


13 


0.0206 


n A A Cl A 

0.1484 


65% 


75% 


L-1/S2A/1 


GGA 


22 


13 


0.1990 


0.1 581 


65% 


75% 


L-1 /S2 A/4 


GGC 


22 


13 


0.0358 


0.7429 


62% 


64% 


L-1 /ST+4/ST+7 


GAG 


22 


18 


0.2357 


0.5054 


30% 


35% 


L-1/ST+4/ST+7 


GAA 


10 


5 


0.2357 


0.5672 


10% 


14% 


L-1/ST+4/ST+7 


GCA 


8 


4 


0.2357 


0.4259 


6% 


10% 


L-1/ST+4/T1 


GAT 


27 


18 


0.3306 


0.1 871 


39% 


49% 


L-1/ST+4/V-4 


GAC 


26 


18 


0.2263 


0.2453 


40% 


49% 


L-1/ST+4A/-4 


GCC 


12 


6 


0.2263 


0.0924 


10% 


20% 


L-1/ST+4/V-3 


GAG 


26 


18 


0.3009 


0.2456 


41% 


49% 


L-1/ST+4A/-1 


GAC 


23 


18 


0.3345 


0.4090 


30% 


36% 


L-1/ST+4A/-1 


GAA 


9 


4 


0.3345 


0.6342 


10% 


13% 


L-1/ST+4A/1 


GAA 


27 


1§ 


0.291 1 


0.31 94i 


41% 


49% 


L-1/ST+4A/4 


GAC 


23 


18 


0.4359 


0.3199 


31% 


39% 


L-1/ST+4A/4 


GAG 


9 


5 


0.4359 


0.8722 


9% 


10% 


L-1/ST+7/T1 


GAT 


14 


6 


0.2291 


0.2644 


17% 


24% 


L-1/ST+7/V-4 


GGC 


23 


17 


0.1444 


0.0474 


36% 


49% 


L-1/ST+7/V-4 


GAC 


12 


5 


0.1444 


f\ OO A A 

0.3844 


15% 


20% 


L-1/ST+7/V-3 


GGG 


20 


17 


0.2350 


0.3482 


31% 


38% 


L-1/ST+7A/-3 


GAG 


12 


5 


0.2350 


0.3831 


13% 


19% 


L-1/ST+7/V-1 


GAA 


10 


5 


0.2786 


0.5280 


13% 


17% 


L-1/ST+7A/1 


GAA 


14 


6 


0.2436 


0.4738 


13% 


18% 


L-1/ST+7/V4 


GAG 


10 


6 


0.4151 


0.4588 


15% 


10% 


L-1/ST+7A/4 


GAC 


7 


3 


0.4151 


0.01 12 


2% 


14% 


L-1/T1A/-4 


GTC 


28 


15 


0.0966 


0.0056 


48% 


69% 


L-1/T1 A/-3 


GTG 


26 


17 


0.2203 


0.0905 


43% 


57% 


L-1/T1/V-1 


GTC 


19 


15 


0.3430 


0.1255 


63% 


75% 


L-1/T1/V-1 


GTA 


10 


6 


0.3430 


0.5480 


13% 


17% 


L-1 /T1 A/1 


GTA 


18 


9 


0.1789 


0.021 1 


72% 


86% 


L-1/T1/V4 


GTC 


21 


16 


0.4009 


0.1375 


59% 


71% 


L-1 A/-4A/-3 


GCG 


27 


18 


0.1983 


0.1259 


44% 


57% 


1 A i\ § A f\ r A 

L-1 / V -4/ V-1 


GCC 


25 


17 


0.2524 


0.0320 


37% 


52% 


L-1/V-4/V1 


GCA 


27 


14 


0.1 32£ 


0.0215 


46% 


63% 


■ a T\ f A T\ 1 A 

L-1A/-4/V4 


GCC 


23 


16 


0.2898 


0.0373 


33% 


50% 


L-1 A/-4/V4 


GCG 


14 


9 


0.2898 


0.7018 


17% 


19% 


L-1/V-3/V-1 


GGC 


22 


17 


0.3747 


0.2308 


31% 


40% 


L-1/V-3A/1 


GGA 


25 


16 


0.2682 


0.1738 


41% 


51% 


L-1 A/-3/V4 


GGC 


22 


17 


0.4343 


0.0785 


32% 


46% 


l_- 1/ V-O/ V* 




1 U 


c 
O 


U.4o4o 


n 7APC7 


1 00/ 


a no/ 


L-1AMA/1 


GCA 


18 


14 


0.2423 


0.2204 


66% 


75% 


L-1A/-1A/1 


GAA 


9 


4 


0.2423 


0.6250 


8% 


11% 


L-1A/-1 A/4 


GCC 


19 


15 


0.3549 


0.7760 


62% 


64% 


L-1 AM A/4 


GAG 


10 


6 


0.3549 


0.5564 


13% 


10% 


L-1A/1/V4 


GAC 


20 


15 


0.3308 


0.7936 


63% 


65% 


L-1 A/1 A/4 


GAG 


13 


9 


0.3308 


0.1248 


11% 


21% 


S1/S2/ST+4 


GGA 


24 


15 


0.1432 


0.3823 


30% 


36% 


S1/S2/ST+7 


GGG 


21 


16 


0.0717 


0.2756 


60% 


68% 
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GGA 


7 


2 


0.071 7 


0.6466 


OYo 


70/ 
I/O 


O H IOO ITd 

51/SZ/T1 
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6% 


12% 


V-4/V-3/V1 


CGA 


17 


15 


0.6246 


0.6591 


63% 


59% 


V-4/V-3/V1 


CAA 


8 


4 


0.6246 


0.1913 


6% 


12% 


V-4/V-3/V4 


CGG 


10 


7 


0.7396 


0.5137 


13% 


10% 


V-4/V-3/V4 


CAG 


7 


4 


0.7396 


0.1026 


- 3% 


10% 


V-4AMA/1 


CCA 


17 


15 


0.6155 


0.8907 


59% 


60% 


V-4/V-1/V1 


CAA 


9 


5 


0.6155 


0.7730 


9% 


11% 


V-4/V-1/V4 


CAG 


10 


7 


0.7073 


0.5108 


13% 


10% 


V-4A/-1A/4 


CCG 


7 


4 


0.7073 


0.3859 


4% 


8% 


V-4A/1 A/4 


CAG 


14 


8 


0.6178 


0.3492 


13% 


19% 


V-3A/-1/V1 


GAA 


8 


5 


0.8524 


0.7741 


9% 


11% 


V-3A/-1/V4 


GAG 


10 


7 


0.9237 


0.5208 


13% 


10% 


V-3A/1/V4 


GAG 


9 


5 


0.8664 


0.9426 


10% 


10% 


V-1A/1/V4 


AAG 


9 


5 


0.7290 


0.8938 


9% 


10% 


S1/T1/V-1A/1 


GTCA 


21 


14 


0.1996 


0.1265 


63% 


75% 




ATAA 


9 


5 


0.1 99G 


0.7387 


9% 


10% 


S1/T1A/-1A/4 


GTCC 


23 


15 


0.2712 


0.6250 


60% 


64% 


S1/T1AM/V4 


ATAG 


9 


5 


0.2712 


0.9606 


10% 


10% 


S1/T1A/1/V4 


GTAC 


24 


15 


0.2788 


0.5740 


60% 


65% 


S1/T1A/1/V4 


ATAG 


9 


6 


0.2788 


0.7658 


8% 


10% 


S1/V-1/V1A/4 


AAAG 


9 


5 


0.7360 


0.9308 


9% 


10% 


S2/-ST+7/T1/V-4 


GGTC 


23 


14 


0.0638 


0.0391 


35% 


49% 


S2/ST+7/T1/V-4 


CATC 


10 


7 


0.0638 


0.5190 


13% 


17% 


S2/ST+7/T1A/-4 


GATG 


5 


1 


0.0638 


0.9031 


4% 


4% 
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S2/ST+7/T1 A/-3 


GGTG 


19 


14 


0.0770 


0.2353 


30% 


38% 


S2/ST+7/T1/V-3 


CATG 


10 


6 


0.0770 


0.5080 


13% 


17% 


S2/ST+7/T1/V-3 


GATA 


5 


1 


0.0770 


0.8427 


4% 


5% 


S2/ST+7/T1 AM 


GGTC 


24 


18 


0.0650 


0.2358 


59% 


68% 


S2/ST+7/T1AM 


CATA 


10 


6 


0.0650 


0.4659 


13% 


17% 


S2/ST+7/T1A/-1 


GATC 


6 


2 


0.0650 


0.4282 


4% 


7% 


S2/ST+7/T1/V4 


GGTC 


21 


16 


0.0855 


0.7703 


59% 


57% 


S2/ST+7/T1/V4 


CATG 


10 


7 


0.0855 


0.5058 


13% 


10% 


S2/ST+7/T1/V4 


GATC 


7 


2 


0.0855 


0.1434 


2% 


7% 


S2/ST+7A/-4A/-1 


GGCC 


23 


14 


0.0329 


0.0722 


37% 


49% 


S2/ST+7A/-4A/-1 


CACA 


10 


6 


0.0329 


0.4878 


13% 


17% 


S2/ST+7A/-4A/-1 


GAGC 


5 


1 


0.0329 


0.9092 


4% 


4% 


S2/ST+7A/-4A/4 


GGCC 


19 


13 


0.0852 


0.9322 


37% 


38% 


S2/ST+7/V-4/V4 


CACG 


10 


7 


0.0852 


0.5055 


13% 


9% 


S2/ST+7/V-4/V4 


GAGC 


5 


1 


0.0852 


0.8599 


2% 


3% 


S2/ST+ 7/V-3A/-1 


GGGC 


19 


14 


0.0622 


0.3675 


31% 


38% 


S2/ST+7/V-3/V-1 


CAGA 


10 


6 


0.0622 


0.4791 


13% 


17% 


S2/ST+7A/-3A/-1 


GAAC 


5 


1 


0.0622 


0.8643 


4% 


5% 


S2/ST+ 7/V-3A/4 


GGGC 


19 


14 


0.0897 


0.5395 


33% 


38% 


S2/ST+ 7 A/ -3 A/4 


CAGG 


10 


6 


0.0897 


0.5165 


13% 


10% 


S2/ST+7A/-3A/4 


GAAC 


5 


1 


0.0897 


0.5305 


2% 


5% 


S2/ST+7A/-1 A/4 


GGCC 


20 


15 


0.0659 


0.5887 


61% 


57% 


S2/ST+7A/-1 A/4 


CAAG 


10 


6 


0.0659 


0.5003 


13% 


10% 


S2/ST+7A/-1 A/4 


GACC 


6 


2 


0.0659 


0.1538 


2% 


7% 


S2/T1A/-4A/-1 


GTCC 


26 


14 


0.0468 


0.0178 


35% 


52% 


S2/T1A/-4A/4 


GTCC 


23 


13 


0.1053 


0.1259 


31% 


44% 


S2/T1/V-3A/-1 


GTGC 


24 


14 


0.0533 


0.1595 


30% 


40% 


S2/T1/V-3A/4 


GTGC 


24 


14 


0.0798 


0.1451 


30% 


40% 


S2/T1A/-1A/4 


GTCC 


25 


14 


0.0363 


0.5912 


60% 


64% 


S2A/-4A/-1/V4 


GCCC 


21 


13 


0.0637 


0.1917 


33% 


44% 


S2A/-4A/-1A/4 


CCAG 


10 


6 


0.0637 


0.5145 


13% 


10% 


S2A/-4A/-1A/4 


GCCG 


7 


4 


0.0637 


0.3650 


4% 


9% 


S2A/-3A/-1/V4 


GGCC 


23 


14 


0.0728 


0.2225 


31% 


40% 


S2A/-3A/-1A/4 


CGAG 


10 


6 


0.0728 


0.5234 


13% 


10% 


ST+7/T1 A/-4AM 


GTCC 


26 


17 


0.1396 


0.0388 


35% 


49% 


ST+7/T1A/-4A/-1 


ATCA 


10 


6 


0.1396 


0.4907 


13% 


17% 


ST+7/T1A/-4A/-1 


ATGC 


5 


2 


0.1396 


0.9054 


4% 


4% 


ST+7/T1 A/-4A/4 


GTCC 


22 


16 


0.3279 


0.2892 


33% 


42% 


ST+7/T1/V-4A/4 


ATCG 


10 


7 


0.3279 


0.4104 


15% 


10% 


ST+7/T 1 A/-4/V4 


ATGC 


5 


2 


0.3279 


0.5611 


2% 


5% 


b i +7/ 1 j/V-o/v-1 


GTGC 


22 


17 


0.2605 


0.2445 


30% 


38% 


ST+7/T1A/-3/V-1 


ATGA 


10 


6 


0.2605 


0.4780 


13% 


17% 


ST+7/T1A/-3A/-1 


ATAC 


5 


2 


0.2605 


0.8436 


4% 


5% 


ST+7/T1A/-3A/4 


GTGC 


22 


17 


0.2859 


0.3226 


31% 


39% 


ST+7/T1A/-3A/4 


ATGG 


10 


6 


0.2859 


0.5352 


13% 


10% 


ST+7/T1A/-3A/4 


ATAC 


5 


2 


0.2859 


0.4386 


2% 


6% 


ST+7/T1/V-1A/4 


ATAG 


10 


6 


0.4356 


0.5284 


13% 


10% 


ST+7/T1A/-1A/4 


ATCC 


6 


3 


0.4356 


0.1787 


2% 


7% 


ST+7A/-4AMA/4 


ACAG 


10 


6 


0.4707 


0.5224 


13% 


10% 
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ST+7/V-4/V-1 /V4 


GCCG 


7 


4 


0.4707 


0.316$ 


2% 


7% 


ST+7/V-4/V-1 /V 4 


AGCC 


5 


2 


0.4707 


0.5401 


2% 


4% 


ST+7/V-3/V-1 /V4 


AGAG 


10 


6 


0.5730 


0.5053 


13% 


10% 


ST+7/V-3/V-1 A/4 


AACC 


5 


2 


0.5730 


0.4917 


2% 


5% 


T1/V-4/V-1/V4 


TCCC 


24 


16 


0.2413 


0.1299 


31% 


44% 


T1/V-4/V-1/V4 


TCAG 


10 


7 


0.2413 


0.5179 


13% 


10% 


T1/V-4/V-1/V4 


TCCG 


7 


4 


0.2413 


0.3666 


4% 


9% 


T1/V-3/V-1/V4 


TGCC 


26 


17 


0.2414 


0.1658 


30% 


40% 


T1/V-3/V-1/V4 


TGAG 


10 


7 


0.2414 


0.5361 


13% 


10% 


T1/V-1/V1/V4 


TCAC 


23 


16 


0.2936 


0.6233 


60% 


64% 


T1/V-1/V1/V4 


TAAG 


9 


5 


0.2936 


0.8825 


9% 


10% 


S1/T1/V-1/V1/V4 


GTCAC 


23 


15 


0.2753 


0.6149 


60% 


64% 


S1/T1/V-1/V1/V4 


ATAAG 


9 


5 


0.2753 


0.8705 


9% 


10% 


S2/ST+7/T1/V-3/V-1 


GGTGC 


20 


14 


0.0675 


0.2340 


30% 


38% 


S2/ST+7/T1/V-3/V4 


GGTGC 


20 


14 


0.1049 


0.3360 


31% 


38% 


S2/ST+7nT1/V-1/V4 


GGTCC 


21 


16 


0.1071 


0.7706 


59% 


57% 


S2/ST+7/V-3/V-1/V4 


GGGCC 


19 


14 


0.0934 


0.4946 


33% 


38% 


S2/T1/V-3/V-1/V4 


GTGCC 


24 


14 


0.0832 


0.1609 


30% 


40% 


ST+7/T1/V-3/V-1/V4 


GTGCC 


22 


17 


0.2899 


0.3417 


31% 


38% 



TABLE 29 



BHR 

Combined 


SNP(s) 


Over- 
Transmitted 
Allele or 
Hapio 


T 


NT 


TDT 
p -value 


Case/ 
CntI 

p-value 


Case 
freq 


CntI 
freq 


L-1 


G 


23 


13 


0.1325 


0.8722 


90% 


89% 


S1 


G 


22 


11 


0.0801 


0.2251 


94% 


89% 


S2 


G 


49 


23 


0.0029 


0.2009 


80% 


74% 


ST+4 


A 


52 


50 


0.9212 


0.1521 


59% 


51% 


ST+7 


G 


32 


21 


0.1690 


0.3199 


83% 


78% 


T1 


T 


27 


13 


0.0385 


0.8750 


88% 


89% 


V-4 


G 


43 


40 


0.8264 


0.5602 


27% 


24% 


V-3 


A 


48 


46 


0.9179 


0.6016 


40% 


37% 


V-1 


C 


27 


16 


0.1263 


0.1413 


91% 


85% 


V1 


A 


4 


4 


1 .0000 


0.7758 


98% 


96% 


V4 


C 


43 


28 


0.0959 


0.4009 


80% 


77% 


L-1/S1 


GG 


38 


18 


0.0143 


0.1048 


85% 


78% 


L-1/S2 


GG 


48 


21 


0.0014 


0.2486 


79% 


74% 


L-1/ST+4 


GA 


53 


41 


0.1541 


0.2543 


48% 


42% 


L-1/ST+7 


GG 


50 


26 


0.0134 


0.2022 


73% 


67% 


L-1/T1 


GT 


25 


13 


0.0319 


0.6689 


87% 


89% 


L-1/V-4 


GC 


56 


43 


0.1621 


0.6756 


63% 


65% 


L-1 /V-3 


GG 


52 


40 


0.1632 


0.7599 


50% 


52% 


L-1 /V-1 


GC 


44 


21 


0.0077 


0.0816 


82% 


74% 


L-1/V1 


GA 


26 


15 


l_ 0.1266 


0.5742 


87% 


85% 


L-1/V4 


GC 


47 


23 


0.0086 


0.4292 


70% 


66% 
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26 
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ZO 
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0.1 638 


U.I 1 ol 


91 % 


86% 


o1/V4 


/— 'O 
oO 


/IO 

4Z 


OQ 
ZO 


ft A A C7 

U.I 1oY 


r» OQon 
U.ZoZU 


ono/ 


/b% 


o2/b 1 +4 


O A 

bA 


64 


O/l 

o4 


0.001 3 


0.031 5 


4Zto 


31 % 


COJCTj.7 
OZ/O I T / 


oo 
oo 


59 


27 


U . UUUo 


O oh go 
U.O IOO 


"7-10/ 


67% 
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73% 
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69 


37 
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bO 
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44 
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ST+7/T1 


GT 


56 


26 
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U.oOoo 
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67% 
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GC 


59 


A*i 
4o 
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63% 


59% 
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oo 


bZ 


44 
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coo/ 
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GC 


37 


26 
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82% 


78% 
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GA 


34 


OC 

zo 
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83% 
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oo 


44 


31 
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CC 


56 


43 
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oo 


oy 


44 
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o I /o 


48% 


V-3/V1 


GA 


49 


42 


0.6640 


0.7855 


58% 


59% 


V-3A/4 


GC 


58 


42 


0.1799 


0.9090 


50% 


51% 


V-1/V1 


CA 


26 


18 


0.3190 


0.1313 


91% 


85% 


V-1 A/4 


CC 


43 


29 


0.1529 


0.0596 


80% 


73% 


V1/V4 


AC 


43 


28 


0.1850 


0.1074 


80% 


74% 


L-1/S1/S2 


GGG 


42 


21 


0.0177 


0.2379, 


79% 


74% 


L-1/S1/ST+4 


GGA 


51 


30 


0.0488 


0.0296 


43% 


32% 


L-1/S1/ST+7 


GGG 


45 


26 


0.0555 


0.19761 


73% 


67% 
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1 A /OH IT A 

L-1/S1/T1 


GGT 


40 


1 7 


0.0U23 


0.2745 


83% 


78% 


1 Hi (CH ft/ A 

L-1/51/V-4 


GGC 


57 


36 


0.0563 


0.4793 


58% 


54% 


■ A /o A n. t n 

L-1 /SI /V-3 


GGG 


51 


31 


0.0504 


0.4982 


45% 


42% 


1 A /C A l\t A 

L-1/51/V-1 


GGC 


42 


21 
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0.0788 


82% 


74% 


I A /OH /\ /H 
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GGA 


40 


21 


0.0376 
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83% 


75% 
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46 


23 
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72% 


66% 
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54 


28 


0.0067 


0.0362 


41% 


31% 
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53 


2§ 
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0.4921 


70% 


67% 
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46 


19 
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0.41 81 


77% 


74% 
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61 


34 
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68 


33 


0.0005 


0.6143 


52% 


49% 


OOiTil ft/ 4 ft/4 

o2/ 1 1/V-4/V4 


GTCC 


60 


27 


0.0024 


0.6594 


42% 


39% 


oo/t»i ft/ on/ 4 


GTGC 


61 


27 


0.0005 


0.6850 


39% 


37% 


O0/T4ft/ Oft/4 

o2/ 1 1/V-3/V4 


GTGC 


6G 


2G 


u.uuiy 


0.6076 


40% 


37% 


OOiT4 ft/ 4 ft/4 

o2/T1/V-1/V4 


GTCC 


54 


24 


u.uuiy 


U.lobo 


69% 


62% 


52/ V-4/ V-1 / V4 


GCCC 


58 


30 


0.01 32 


0.3435 


44% 


39% 


CIA/ Oft/ 4 ft/4 

o2/V-o/ V-1 / V4 




CO 

OC 


OQ 


f\ c\"t Art 
U.U1 4U 


0.4526 


4-10/ 
4 1 7b 


070/ 

Of To 


CTx7/T«l ft/ 4ft/ 4 

o r+7/Tl/V-4/V-1 


GTCC 


69 


35 


u.uun u 


0.4073 


52% 


48% 


CTi7/T<l A/ 4n/4 

ST+7/T1 / V-4/ V4 


GTCC 


59 


28 


u.uuuy 


0.4429 


41 % 


37% 


CTj.7/T'I ft/ Oft/ 4 

5T+7/T1 / V-3/ V-1 


GTGC 


62 


29 


0.0013 


0.4143 


40% 


36% 


Ol + i; I i/v "o/ v*f 




O I 


Zo 


n nnna 




*+U /o 


OO /o 


ST+7/T1/V-1/V4 


GTCC 


55 


23 


0.0005 


0.2653 


61% 


55% 


ST+7/V-4A/-1/V4 


GCCC 


54 


42 


0.3202 


0.2691 


53% 


47% 


ST+7A/-3/V-1/V4 


GGCC 


57 


41 


0.2098 


0.4099 


51% 


47% 


T1/V-4/V-1/V4 


TCCC 


62 


29 


0.0005 


0.5872 


42% 


39% 


T1/V-3/V-1/V4 


TGCC 


63 


28 


0.0010 


0.6404 


40% 


37% 


T1/V-1/V1/V4 


TCAC 


57 


25 


0.0005 


0.1026 


70% 


62% 


S1/T1/V-1/V1/V4 


GTCAC 


56 


25 


0.0020 


0.1001 


70% 


62% 


S2/ST+7/T1/V-3/V-1 


GGTGC 


59 


26 


0.00013 


0.4598 


40% 


36% 
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S2/ST+7/T1/V-3/V4 


GGTGC 


58 


26 


0.0030 


0.4615 


40% 


36% 


S2/ST+7/T1/V-1/V4 


GGTCC 


55 


21 


0.00013 


0.3216 


60% 


55% 


S2/ST+7/V-3/V-1/V4 


GGGCC 


56 


28 


0.0103 


0.2865 


41% 


36% 


S2/T1/V-3/V-1/V4 


GTGCC 


60 


26 


0.0030 


0.6814 


40% 


37% 


ST+7/T1/V-3/V-1/V4 


GTGCC 


61 


28 


0.00024 


0.4308 


40% 


36% 



TABLE 29 (CONT) 



BHR 
UK 




Over- 
i r«iismiirea 
Allele or 
Haplo 


T 


NT 


p-value 


Case/ 
p-value 


Case? 

freq 

— 3 


freq 


L-1 


G 


13 


11 


0.8388 


0.0899 


94% 


87% 


S1 


G 


19 


5 


0.0066 


0.0603 


96% 


89% 


S2 


G 


37 


16 


0.0055 


0.0038 


87% 


73% 


ST+4 


A 


43 


39 


0.7407 


0.1538 


57% 


48% 


ST+7 


G 


29 


13 


0.0195 


0.2294 


86% 


80% 


T1 


T 


16 


10 


0.3269 


0.1041 


93% 


87% 


V-4 


C 


34 


34 


1 .0000 


0.7889 


73% 


75% 


V-3 


G 


38 


36 


0.9076 


0.5461 


58% 


62% 


V-1 


C 


23 


9 


0.0201 


0.0454 


94% 


86% 


V1 


A 


3 


3 


1 .0000 


1 .0000 


98% 


98% 


V4 


c 


38 


21 


0.0363 


0.2122 


82% 


75% 


L-1/S1 


GG 


28 


12 


0.01 16 


0.0036 


91% 


77% 


L-1/S2 


GG 


37 


14 


0.0007 


0.0071 


86% 


73% 


L-1 /ST+4 


GA 


40 


34 


0.6901 


0.0475 


51% 


38% 


L-1 /ST+7 


GG 


40 


17 


0.0023 


0.01 1 1 


81% 


67% 


L-1/T1 


GT 


15 


10 


0.1504 


0.2233 


92% 


87% 


L-W-4 


GC 


42 


35 


0.6696 


0.4106 


67% 


62% 


L-1/V-3 


GG 


39 


32 


0.6574 


0.6144 


52% 


49% 


L-1 AM 


GC 


33 


14 


0.0054 


0.0018 


89% 


74% 


L-1/V1 


GA 


16 


12 


0.6763 


0.1050 


92% 


85% 


L-1/V4 


GC 


36 


18 


0.0327 


0.0266 


76% 


63% 


S1/S2 


GG 


37 


15 


0.0020 


0.0042 


87% 


73% 


S1/ST+4 


GA 


49 


36 


0.0778 


0.0171 


53% 


39% 


S1/ST+7 


GG 


30 


13 


0.0076 


0.3019 


85% 


80% 


S1/T1 


GT 


35 


11 


0.00015 


0.0057 


90% 


76% 


S1/V-4 


GC 


43 


27 


0.0188 


0.3569 


69% 


64% 


S1/V-3 


GG 


49 


31 


0.0251 


0.6440 


54% 


51% 


S1/V-1 


GC 


22 


8 


0.0091 


0.0359 


94% 


86% 


S1/V1 


GA 


21 


8 


0.0133 


0.0746 


94% 


87% 


S1/V4 


GC 


37 


21 


0.0168 


0.1158 


82% 


75% 


S2/ST+4 


GA 


54 


28 


0.0035 


0.0017 


48% 


28% 


S2/ST+7 


GG 


47 


17 


0.00013 


0.0250 


78% 


66% 


S2/T1 


GT 


40 


13 


0.00017 


0.0097 


86% 


72% 


S2/V-4 


GC 


56 


28 


0.0023 


0.0488 


60% 


49% 


S2/V-3 


GG 


53 


25 


0.0015 


0.0452 


47% 


36% 
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S2/V-1 


GC 


39 


1 5 


0.0014 


0.0035 


87% 


73% 


S2/V1 


GA 


37 


1 5 


0.0020 


0.0062 


87% 


73% 


S2/V4 


GC 


42 


20 


0.0025 


0.0106 


75% 


61% 


ST+4/ST+7 


AG 


51 


34 


0.0533 


0.01 1 8 


55% 


39% 


ST+4/T1 


AT 


45 


31 


0.1929 


0.0545 


50% 


38% 


ST+4/V-4 


AC 


44 


35 


0.6758 


0.141 9 


58% 


48% 


ST+4/V-3 


AG 


47 


33 


0.1039 


0.1704 


57% 


48% 


ST+4/V-1 


AC 


52 


35 


0.0916 


0.0076 


55% 


39% 


ST+4/V1 


AA 


42 


38 


0.8061 


0.1664 


57% 


48% 


ST+4/V4 


AC 


49 


36 


0.1756 


0.0199 


53% 


37% 


ST+7/T1 


GT 


45 


17 


0.0003 


0.0146 


80% 


66% 


ST+7/V-4 


GC 


51 


29 


0.0054 


0.2473 


67% 


61% 


ST+7/V-3 


GG 


55 


31 


0.0170 


0.2386 


55% 


48% 


ST+7AM 


GC 


33 


16 


0.0223 


0.2882 


85% 


80% 


ST+7/V1 


GA 


31 


16 


0.0321 


0.1785 


86% 


80% 


ST+7/V4 


GC 


41 


23 


0.0086 


0.3028 


72% 


67% 


T1AA-4 


TC 


43 


33 


0.3496 


0.4354 


66% 


62% 


T1 /V-3 


TG 


44 


29 


0.1472 


0.6521 


52% 


49% 


T1/V-1 


TC 


41 


1 7 


0.0020 


0.0039 


88% 


73% 


T1/V1 


TA 


20 


12 


0.2895 


0.1 189 


91 % 


85% 


T1/V4 


TC 


46 


1 9 


0.0018 


0.0369 


75% 


63% 


V-4/V-3 


CG 


42 


37 


0.5926 


0.5775 


59% 


62% 


V-4/V-1 


CC 


47 


28 


0.0095 


0.2929 


67% 


61 % 


V-4/V1 


CA 


35 


32 


0.8785 


0.7463 


71% 


73% 


V-4/V4 


CC 


49 


31 


0.0742 


0.401 1 


55% 


50% 


V-3/V-1 


GC 


52 


32 


0.0350 


0.5466 


52% 


49% 


V-3A/1 


GA 


42 


32 


0.4122 


0.5624 


56% 


60% 


V-3/V4 


GC 


51 


30 


0.0143 


0.6351 


52% 


49% 


V-1/V1 


CA 


22 


1 1 


0.0559 


0.0445 


94% 


86% 


V-1/V4 


CC 


38 


22 


0.0218 


0.0545 


82% 


73% 


V1/V4 


AC 


38 


21 


0.0535 


0.0919 


82% 


74% 


L-1/S1/S2 


GGG 


32 


14 


0.01 17 


0.0091 


86% 


73% 


L-1/S1/ST+4 


GGA 


41 


23 


0,0429 


0,0006 


51% 


29% 


L-1/S1/ST+7 


GGG 


34 


16 


0.0189 


0.0138 


80% 


67% 


1 H IC* A It* A 

L-1/S1/T1 


GGT 


29 


10 


0.0007 


0.0104 


89% 


76% 


L-1/S1/V-4 


GGC 


45 


25 


0.0203 


0.0434 


64% 


52% 


L-1/S1/V-3 


GGG 


41 


23 


0.0367 


0.0258 


52% 


39% 


L-1/S1/V-1 


GGC 


31 


14 


0.0178 


0.0027 


89% 


74% 


L-1/S1/V1 


GGA 


29 


14 


0.0343 


0.0044 


89% 


75% 


L-1/S1/V4 


GGC 


35 


18 


0.0238 


0.0198 


77% 


64% 










U.UU I 0 


U.UU I o 




Zo /a 


L1/S2/ST+7 


GGG 


42 


16 


0.0013 


0.0382 


77% 


66% 


L-1/S2/T1 


GGT 


35 


11 


0.00003 


0.0146 


85% 


73% 


L-1/S2/V-4 


GGC 


50 


25 


0.0063 


0.0794 


59% 


48% 


L-1/S2/V-3 


GGG 


45 


23 


0.0080 


0.0514 


47% 


35% 


L-1/S2/V-1 


GGC 


34 


14 


0.0048 


0.0076 


86% 


73% 


L-1/S2/V1 


GGA 


33 


14 


0.0084 


0.0096 


86% 


73% 


L-1/S2/V4 


GGC 


35 


18 


0.0102 


0.0128 


75% 


61% 


L-1/ST+4/ST+7 


GAG 


45 


24 


0.0174 


0.0009 


50% 


29% 
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L-1/ST+4/T1 


GAT 


37 


27 


0.1818 


0.0740 


49% 


38% 


1 A IOT i A T\ I A 

L-1/ST+4/V-4 


GAC 


41 


30 


0.4742 


0.0370 


51% 


38% 


L-1/ST+4/V-3 


GAG 


4Q 


28 


0.2165 


0.0376 


51% 


38% 


L-1/ST+4/V-1 


GAC 


43 


23 


0.0305 


0.0002 


51% 


28% 


L-1/ST+4/V1 


GAA 


38 


29 


0.6152 


0.0521 


51% 


38% 


L-1/ST+4/V4 


GAC 


42 


23 


0.0519 


0.0021 


47% 


28% 


L-1/ST+7/T1 


GGT 


39 


14 


0.00007 


0.0232 


79% 


67% 


L-1/ST+7/V-4 


GGC 


51 


26 


0.0055 


0.0222 


62% 


48% 


L-1/ST+7/V-3 


GGG 


48 


22. 


0.0044 


0.0125 


50% 


35% 


L-1/ST+7/V-1 


GGC 


37 


16 


0.0060 


0.0160 


80% 


67% 


L-1/ST+7/V1 


GGA 


35 


16 


0.0133 


0.0063 


81% 


67% 


L-1/ST+7/V4 


GGC 


38 


18 


0.0102 


0.0538 


67% 


55% 


L-1/T1/V-4 


GTC 


40 


31 


0.1656 


0.5227 


65% 


62% 


L-1/T1/V-3 


GTG 


36 


26 


0.1445 


0.7708 


51% 


49% 


L-1/T1/V-1 


GTC 


34 


11 


0.00012 


0.0069 


87% 


73% 


L-1/T1/V1 


GTA 


16 


10 


0.2576 


0.2318 


90% 


85% 


L-1/T1/V4 


GTC 


37 


16 


0.0044 


0.0644 


74% 


64% 


1 #111/ j r\ # 

L-1/V-4/V-3 


i GCG 


39 


31 


0.7040 


0.6053 


53% 


49% 


L-1/V-4/V-1 


GCC 


49 


26 


0.0083 


0.0358 


62% 


49% 


1 A T\ T A I\ t A 

L-1 A/-4A/1 


GCA 


42 


33 


0.6803 


0.4450 


65% 


60% 


L-1/V-4/V4 


i_ GCC 


44 


21 


0.0304 


0.1033 


49% 


39% 


L-1 A/-3/V-1 


GGC 


44 


23 


0.0160 


0.0725 


47% 


36% 


L-1 A/- 3 A/1 


GGA 


38 


28 


0.5633 


0.6053 


50% 


47% 


L-1/V-3/V4 


GGC 


44 


22 


0.0189 


0.1902 


46% 


38% 


L-1 A/ -1 A/1 


GCA 


31 


14 


0.0130 


0.0018 


89% 


74% 


L-1A/-1/V4 


GCC 


3/ 


19 


0.0126 


0.0065 


76% 


62% 


L-1 A/1 A/4 


GAC 


37 


19 


0.0577 


0.0105 


76% 


62% 


S1/S2/ST+4 


GGA 


48 


26 


0.0224 


0.0038 


47% 


29% 


S1/S2/ST+7 


GGG 


41 


15 


0.0007 


0.0274 


78% 


66% 


S1/S2/T1 


GGT 


38 


13 


0.0004 


0.0085 


86% 


72% 


S1/S2/V-4 


GGC 


52 


27 


0.0057 


0.0647 


60% 


49% 


S1/S2/V-3 


GGG 


48 


24 


0.0092 


0.0618 


47% 


36% 


S1/S2/V-1 


GGC 


37 


15 


0.0034 


0.0046 


87% 


73% 


S1/S2/V1 


GGA 


35 


15 


0.0054 


0.0043 


87% 


73% 


S1/S2/V4 


GGC 


41 


20 


0.0084 


0.0185 


75% 


61% 


S1/ST+4/ST+7 


GAG 


45 


33 


0.0935 


0.0181 


53% 


39% 


S1/ST+4/T1 


GAT 


50 


26 


0.0028 


0.0003 


50% 


28% 


S1/ST+4/V-4 


GAC 


48 


30 


0.0638 


0.0063 


55% 


39% 


S1/ST+4A/-3 


GAG 


48 


28 


0.0600 


0.0203 


53% 


39% 


S1/ST+4A/-1 


GAC 


49 


35 


0.0896 


0.0160 


53% 


38% 


ol/o 1 +**/ V 1 


GAA 


48 


36 


0.1 558 




53% 


39% 


S1/ST+4A/4 


GAC 


47 


36 


0.0625 


0.0110 


53% 


38% 


S1/ST+7/T1 


GGT 


42 


14 


0.00012 


0.0253 


79% 


66% 


S1/ST+7A/-4 


GGC 


45 


28 


0.0361 


0.3136 


67% 


61% 


S1/ST+7A/-3 


GGG 


48 


29 


0.0295 


0.4047 


53% 


48% 


S1/ST+7A/-1 


GGC 


30 


13 


0.0117 


0.2931 


85% 


80% 


S1/ST+7A/1 


GGA 


28 


13 


0.0260 


0.2911 


85% 


80% 


S1/ST+7/V4 


GGC 


39 


23 


0.0325 


0.434§ 


71% 


67% 


S1/T1/V-4 


GTC 


52 


26 


0.0013 


0.0448 


63% 


51% 
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S1/T1/V-3 


GTG 


50 


23 


0.0008 


0.0282 


52% 


38% 


S1/T1/V-1 


GTC 


38 


13 


0.00018 


0.0029 


88% 


73% 


S1/T1/V1 


GTA 


36 


13 


0.0008 


0.0059 


88% 


74% 


S1/T1/V4 


GTC 


45 


19 


0.0006 


0.0286 


76% 


63% 


S1/V-4/V-3 


GCG 


47 


30 


0.0277 


0.4330 


56% 


51% 


S1/V-4/V-1 


GCC 


45 


28 


0.0212 


0.4023 


67% 


62% 


S1/V-4/V1 


GCA 


44 


28 


0.0361 


0.4281 


67% 


63% 


S1/V-4/V4 


GCC 


48 


31 


0.0423 


0.3538 


55% 


50% 


S1/V-3/V-1 


GGC 


50 


31 


0.0259 


0.5545 


52% 


49% 


S1/V-3/V1 


GGA 


49 


30 


0.0326 


0.6543 


52% 


49% 


S1/V-3/V4 


GGC 


50 


30 


0.0138 


0.6664 


53% 


50% 


S1/V-1/V1 


GCA 


20 


8 


0.0191 


0.0341 


94% 


86% 


S1/V-1/V4 


GCC 


37 


22 


0.0454 


0.0589 


82% 


73% 


S1/V1/V4 


GAC 


37 


21 


0.0427 


0.0773 


82% 


73% 


S2/ST+4/ST+7 


GAG 


49 


25 


0.0102 


0.0024 


48% 


29% 


S2/ST+47T1 


GAT 


51 


24 


0.0027 


0.001 1 


48% 


28% 


S2/ST+4/V-4 


GAC 


52 


23 


0.0069 


0.0013 


49% 


28% 


S2/ST+4/V-3 


GAG 


51 


21 


0.0038 


O.OOOo 


48% 


28% 


S2/ST+4/V-1 


GAC 


50 


24 


0.0096 


0.0012 


49% 


28% 


S2/ST+4/V1 


GAA 


49 


26 


0.0126 


0.0009 


48% 


29% 


S2/ST+4/V4 


GAC 


4Q 


25 


0.0135 


0.0019 


47% 


28% 


S2/ST+7/T1 


GGT 


46 


13 


0.000022 


0.0419 


77% 


66% 


S2/ST+7/V-4 


GGC 


55 


28 


0.0036 


0.0358 


60% 


47% 


S2/ST+7/V-3 


GGG 


51 


23 


0.0029 


0.0377 


47% 


34% 


S2/ST+7/V-1 


GGC 


44 


15 


0.000024 


0.0267 


78% 


66% 


S2/ST+7/V1 


GGA 


42 


15 


0.0007 


0.0198 


79% 


66% 


S2/ST+7/V4 


GGC 


44 


17 


0.00018 


0.0475 


66% 


54% 


S2/T1/V-4 


GTC 


55 


24 


0.0005 


0.0654 


59% 


48% 


S2/T1/V-3 


GTG 


51 


21 


0.0006 


0.0465 


47% 


35% 


S2/T1/V-1 


GTC 


40 


13 


0.00009 


0.0081 


86% 


73% 


S2/T1/V1 


GTA 


38 


13 


0.0006 


0.0083 


86% 


72% 


S2/T1/V4 


GTC 


43 


18 


0.0008 


0.0135 


75% 


61% 


S2/V-4/V-3 


GCG 


52 


24 


0.0016 


0.031 1 


48% 


35% 


S2/V-4/V-1 


GCC 


54 


27 


0.0025 


0.0541 


60% 


48% 


S2/V-4/V1 


GCA 


52 


27 


0.0061 


0.0516 


60% 


49% 


S2/V-4/V4 
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21% 


10% 


V-4/V-1 /VI 


GCA 


9 


7 


0.6385 


0.541 8 


29% 


23% 


l / A i\ t A l\ 1A 

V-4/V-1/V1 


CAA 


6 


3 


0.6385 


0.2754 


18% 


11% 


V-4/V-1/V4 


GCC 


9 


7 


0.6256 


0.61 65 


25% 


20% 


V-4/V-1 / V4 


CAG 


7 


4 


0.6256 


0.1272 


21% 


10% 


V-4/V1/V4 


GAC 


9 


7 


O.ooUo 




29% 


21 % 


V-4/V1/V4 


CAG 


6 


3 


U.DOUO 


0.81 71 


21 % 


1 9% 


1/ *>l\t A l\/A 

V-o/V-l/VI 


ACA 


1 1 


8 


0.7592 


t\ onno 

u.ouyo 


32% 


35% 


1/ tt\f A I\IA 

V-0/V-1/V4 


ACC 


9 


7 


U.t>2U2 


0.6072 


29% 


24% 


V-3/V-1/V4 


GAG 


7 


4 


0.6202 


0.1310 


21% 


10% 


V-3/V1/V4 


AAC 


9 


7 


0.7614 


0.8032 


27% 


25% 


V-3/V1/V4 


GAG 


6 


3 


0.7614 


0.4394 


17% 


10% 


i/ ■• i\ia ntA 
V-1/V1/V4 


AAG 


6 


3 


0.8564 


0.3224 


18% 


10% 


S1/T1/V-1/V1 


GTCA 


13 


8 


0.1790 


0.0127 


50% 


75% 


S1/T1 /V-1/V1 


ATAA 


6 


3 


0.1790 


0.3704 


18% 


10% 


51/T1/V-1/V4 


GTCC 


1 2 


6 


0.1823 


0.0812 


46% 


64% 


OH fT^ ft/ A TX f A 

S1/T1/V-1/V4 


ATAG 


6 


3 


0.1823 


0.1256 


21% 


10% 


S1/T1/V1/V4 


GTAC 


12 


6 


0.1758 


0.0704 


46% 


65% 


O-H IT ill n#>< f\ / A 

S1/T1/V1/V4 


ATAG 


6 


3 


0.1758 


0.3261 


17% 


10% 


t>AI\I A TA tA FX g A 

S1/V-1/V1/V4 


AAAG 


6 


3 


0.8570 


0.3023 


18% 


10% 


S2/5T+7/T1 /V-4 


GGTC 


13 


9 


0.1205 


L 0.0014 


21% 


49% 


r^o/OT" > ~7t~rA [\ I A 

S2/ST+7/T1 /V-4 


CATC 


7 


4 


0.1205 


0.4692 


21% 


17% 


52/ST+7/T 1 / V-4 


GATG 


3 


0 


r\ a one 

0.1205 


0.5660 


7% 


4% 


S2/ST+7/T1 /V-3 


GGTG 


10 


6 


0.1048 


0.0332 


18% 


38% 


SZ/o Y-rfJ I1/V-3 


CATG 


7 




n A r\ A o 


U.4o42 


21 % 


17% 


52/ST+7/T1/V-3 


GATA 


3 


0 


0.1048 


0.4934 


7% 


5% 


S2/ST+7/T 1 / V-1 


GGTC 


12 


10 


0.2017 


0.01 1 6 


43% 


68% 


S2/ST+7/T1 /V-1 


CATA 


7 


4 


0.2017 


0.5031 


21% 


17% 


OA/Ol^f/I IfV-l 






4 
I 


n 9fH 7 




70/ 

1 /o 


7°/ 
f /o 


S2/ST+7/T1/V4 


GGTC 


9 


6 


0.2571 


0.1649 


43% 


57% 


S2/ST+7/T1/V4 


CATG 


7 


4 


0.2571 


0.1102 


21% 


10% 


S2/ST+7/T1/V4 


GATC 


4 


1 


0.2571 


0.4225 


4% 


7% 


S2/ST+7/V-4AM 


GGCC 


12 


9 


0.0903 


0.0064 


25% 


49% 


S2/ST+7/V-4/V-1 


CACA 


7 


4 


0.0903 


0.51971 


21% 


17% 


S2/ST+7/V-4A/-1 


GAGC 


3 


0 


0.0903 


0.6054 


7% 


4% 


S2/ST+7/V-4/V4 


GGCC 


9 


6 


0.1321 


0.1216 


21% 


38% 


S2/ST+7/V-4/V4 


CACG 


7 


4 


0.1321 


0.2038 


21% 


9% 
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0.0854 
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0.0269 


18% 


4fl% 

/o 


CO/T1 A/-0iA/A 




Q 

y 


7 
f 


U.ZOOJ 


u.oyu 1 


4y /o 


9 AO/ 


QO/T1 A/-"? A/4 

O^/ 1 l/V~0/V*r 




7 


A 


U xOOO 


0 1433 


91 % 


1 A% 
I U /o 


oo/TI A/-1 A/4 


GTCC 


1 1 




0 1 840 


u.uoyu 


46% 


Dt /o 


co/TI A/-1 A/4 
Ot/ I l/v-l/v*r 


O 1 


7 


O 


VJ. 1 0*fU 


U. I U'f-O 


91 OA 
4 1 /o 


1 C\ a L 
I U /o 


Q9 A/-AA/-1 A/4 


GGCC 


g 


7 


0 2385 


0 4472 


29% 


21 % 


QO/l/ Af\l 1 A/4 
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Ol^lfl l/V "*!■/ V *r 


ATGC 


O 
0 


u 


0 0947 


u.ooy i 


4°/, 
*t /O 


^0/ 
O /o 


e-r+7/T1 A/-** A/-1 

O 1 ~li 1 l/V O/ V " 1 
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0.0125 


18% 
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57% 
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38% 
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21% 
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10 


G 


0.2753 


0.0285 


18% 


40% 


S2/T1/V-3A/-1/V4 


CTGAG 
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4 


0.2753 


0.1351 


21% 


10% 
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11 
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EXAMPLE 15: Attributable Risk Assessment 

The frequency of a functional polymorphism and the relative risk of the 
heterozygote and homozygote (at-risk) genotypes can be used to evaluate the 
~5 attributable fraction (MJ. Khoury et al., 1993, Fundamentals of Genetic 
Epidemiology, J.L. Kelsy et al., (eds), Monographs in Epidemiology and 
Biostatistics, Oxford University Press, New York, NY, Section 3, p. 74-77) or 
attributable risk in the population. An attributable fraction of 25% would mean 
that if the population were monomorphic for the protective allele, the 
10 prevalence of the trait would be 25% lower. 

The formula for the attributable fraction is: 

Attributable fraction = ^^^^O"" 1 ' 

where f is the allele frequency, y Is the relative risk of the heterozygote 
genotype over the wild type homozygote, and r\ is the risk of the homozygote 
1 5 mutant over the wild type homozygote. This approach requires the estimation 
of f, y and r|. Ideally these quantities should be estimated in an epidemiological 
sample. 

For this study, a genome scan with affected sibling pairs was employed, 



I 
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followed by association study using IBD = 2 individuals as cases in the 
case/control comparison. This study design offered maximum power to detect 
linkage and association. However, it did not provide estimates of the required 
parameters, namely 1 ) the relative risk (or odds ratio) of the genotype/allele for 
5 most SNPs or haplotypes; and 2) the frequency of the SNP in the general 
population. In a recent paper, researchers used the data from TDT analysis 
to estimate allele and genotype relative risks assuming a multiplicative model 
or ti = y 2 (D. Altshuler et al., 2000, Nature Genetics 26:76-80). In accordance 
with this method, the mutant homozygote was predicted to carry a relative risk 

1 0 equal to the square of the risk for the heterozygote. 

To overcome some of the difficulties associated with a case/control 
design (see above), the data obtained from typing eleven SNPs in Gene 216 
on the entire population were used to estimate the relative risk of these eleven 
SNPs. The analysis was not limited to the subset of IBD = 2 individuals. The 

1 5 data from the TDT obtained by using the first asthmatic sibling per family were 
used. Because of the limited number of informative matings in the TDT 
analysis, a multiplicative model for the genotype relative risk was used as in 
Altshuler et al., i.e. r\ = y 2 . By using the control population to estimate allele 
frequencies, the attributable risk may have been underestimated. Based on 

20 this, the attributable risk was computed for the single SNPs and SNP 
haplotypes that were significant in the TDT analysis (p < 0.05) using the 
asthma phenotype in the combined population. These values, as well as y, the 
relative risk (RR), and its 95% confidence interval are shown below in Table 30. 
It is noted that for Table 30, the haplotypes are written without slashes 

25 separating each allele. Thus, the haplotype that is G/T/G/C/C is written as 
GTGCC in Table 30. This represents the short-hand designations for the 
haplotypes and is not, in any way, meant to represent contiguous nucleotide 
sequences. 

30 



-278- 



WO 02/083077 



PCT/US02/12063 



TABLE 30 



Asthma Yes/No 
Combined 


SNP(s) 


Over-Transmitted 
Allele or Haplo 


RR 


95% Confidence 
Interval 


Attributable 
Fraction 


Lower 


Upper 


S1 


G 


1.41 


0.98 


2.10 


47% 


S2 


G 


1.19 


0.94 


1.53 


24% 


L-1/S1 


GG 


1.29 


0.98 


1.7Q 


41% 


L-1/S2 


GG 


1.27 


0.99 


1.63 


39% 


L-1/V-1 


GC 


1.26 


0.98 


1.64 


39% 


S1/S2 


GG 


1.28 


1.00 


1.65 


40% 


S1/T1 


GT 


1.32 


1.02 


1.72 


42% 


S2/ST+4 


GA 


1.08 


0.88, 


1.33 


25% 


S2/ST+7 


GG 


1.26 


1.00 


1.58 


37% 


S2/T1 


GT 


1.30 


1.02 


1.68 


41% 


S2/V-4 


GC 


1.15 


0.94 


1.41 


29% 


S2/V-3 


GG 


1.08 


0.88 


1.33 


25% 


S2/V-1 


GC 


1.32 


1.03 


1.70 


41% 


S2/V1 


GA 


1.30 


1.02 


1.68 


40% 


S2/V4 


GC 


1.23 


0.99 


1.55 


35% 


ST+7/T1 


GT 


1.24 


0.98 


1.57 


36% 


T1/V-1 


TC 


1.25 


0.98 


1.61 


38% 


T1/V4 


TC 


1.21 


0.97 


1.51 


34% 


L-1/S1/S2 


GGG 


1.28 


0.99 


1.67 


40% 


L-1/S1/T1 


GGT 


1.28 


0.97 


1.69 


41% 


L-1/S1/V-4 


GGC 


1.13 


0.91 


1.42 


29% 


L-1/S1/V-1 


GGC 


1.30 


1.00 


1.70 


41% 


L-1/S1/V1 


GGA 


1.27 


0.98 


1.66 


40% 


L-1/S2/ST+4 


GGA 


1.08 


0.87 


1.34 


25% 


L-1/S2/ST+7 


GGG 


1.27 


1.01 


1.61 


38% 


L-1/S2/T1 


GGT 


1.32 


1.02 


1.72 


42% 


L-1/S2/V-4 


GGC 


1.14 


0.93 


1.42 


29% 


L-1/S2/V-3 


GGG 


1.07 


0.86 


1.32 


24% 


L-1/S2/V-1 


GGC 


1.32 


1.03 


1.72 


42% 


L-1/S2/V1 


GGA 


1.32 


1.03 


1.72 


42% 


L-1/S2/V4 


GGC 


1.23 


0.98 


1,55 


35% 


L-1/ST+7/T1 


GGT 


1.24 


[ 0.97 


1.59 


37% 


L-1/ST+7AM 


GGC 


1.21 


0.96 


1.55 


35% 


L-1/T1/V-4 


GTC 


1.09 


0.87 


1.37 


27% 


L-1/T1/V-1 


GTC 


1.30 


1.00 


1.70 


41% 


L-1/T1/V4 


GTC 


1.20 


0.96 


1.51 


34% 


L-1/V-4/V-1 


GCC 


1.15 


0.93 


1.43 


30% 


L-1/V-3AM 


GGC 


1.05 


0.85 


1.30 


23% 


L-1/V-1/V1 


GCA 


1.25 


0.97 


1.64 


39% 


S1/S2/ST+4 


GGA 


1.06 


0.86 


1.32 


24% 


S1/S2/ST+7 


GGG 


1.28 


1.02 


1.63 


39% 


S1/S2/T1 


GGT 


1.28 


1.00 


1.65 


39% 


S1/S2/V-4 


GGC 


1.17 


0.94 


1.45 


31% 
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C1 /Oft/ 1 

oi/o2/v-o 


GGG 


I .UD 


0.86 


1 .02 


24% 


51/52/V-1 


GGC 


1.29 


1 .01 


1 .67 


40% 


o 1/52/ VI 


GGA 


1 .20 


1 .OC 


1 .00 


40% 


CI /COA/vI 

51/S2/V4 


GGC 


1 .22 


0.98 


1 .53 


35% 


bl/o 1 +4/1 1 


GAT 


1 .08 


0.88 


1 .34 


25% 


51/5 1 +7 / 1 1 


GGT 


1 .30 


1 .02 


1 .66 


40% 


51/ 1 1/V-4 


GTC 


1.21 


0.98 


1 .50 


33% 


51/ 1 1/V-3 


GTG 


1.10 


0.89 


1 .37 


27% 


S1/T1/V-1 


GTC 


1.30 


1 .01 


1 .69 


41% 


d- A tTA t\IA 

SI /II /VI 


GTA 


1 .28 


1 .00 


1 .66 


40% 


oa tya i\ia 
S1/T1/V4 


GTC 


1.21 


0.97 


1 .51 


34% 


DUCTj-jl /Til 

52/5 T +4/T1 


GAT 


1.11 


0.90 


1 .38 


28% 


S2/5T+4/V-4 


GAC 


1 .1 1 


0.90 


1 .38 


27% 


52/5 1 +4/V-0 


GAG 


1.11 


0.89 


1 .39 


28% 


SZ/o 1 +4/V-1 


GAC 


1.1C 


0.89 


1 .36 


27% 


b//o 1 +4/ VI 


GAA 


1 .08 


0.87 


1 .34 


oco/ 
2b% 


04/0 1 +4/ V4 




1 .Uo 


U.of 


1 .34 


20% 


oz/o 1 + /7 1 1 




1 .33 


1 .UO 


1 .69 


41 % 


52/5 1 +//V-4 


0I3O 


1 .08 


0.88 


1 .33 


25% 


CO/CTx7ft/ Q 
o//o 1 + I / V-O 


GGG 


1.06 


0.86 


1 .31 


23% 


52/5 I +7/V-I 


GGC 


1 .32 


1 .05 


1 .67 


40% 


52/5 I + /7V1 




■i on 


1 .04 


1 .00 


40% 


CO/OTX7AM 




1 .25 


i .in 


H Eft 


36% 


SZ/T1/V-4 


GTC 


1 .23 


0.99 


1 .53 


35% 


5Z/T1/V-3 


GTG 


1.12 


0.90 


1 .40 


28% 


52/T1/V-1 


GTC 


1.31 


1.03 


1 .70 


41 % 


bZ/ 1 1/V1 


GTA 


1 .31 


1 .02 


1 .69 


41 % 


52/T1/V4 


GTC 


1.24 


1 .00 


1 .56 


36% 


5Z/V-4/V-3 


GCG 


1 .04 


0.84 


1 .29 


23% 


52/V-4/V-1 


GCC 


1.20 


0.97 


1 .49 


33% 


OA/ Af\fA 

5Z/V-4/V1 


GCA 


1.20 


0.97 


1 .49 


33% 


OA/ >1 A/ /I 

5Z/V-4/V4 


GCC 


1 .1 7 


0.94 


1 .46 


31% 


OA# *>t\l 4 


GGC 


1 .1C 


0.89 


1 .36 


27% 


SOAf OA/4 

527V -o/ VI 


GGA 


1 .08 


0.87 


1 .34 


25% 


OA/ QA//I 

5Z/V-J/V4 


GGC 


1 .12 


0.90 


1 .40 


29% 


OA# <i l\f<\ 

52/V-1/V1 


A 

OLrA 


1 .31 


1 .03 


1 .69 


41 % 


OA/ «l f\lA 
52/V- 1/ V4 




1 .21 


c\ 07 

u.y / 


1 .ol 


o4% 


52/V1/V4 


AAA 

bAO 


1 .22 


0.98 


1 .53 


35% 


5 1 +4/OF+7/ 1 1 


AGT 


1 .04 


0.84 


1 .29 


22% 


5 1 +4/ 1 1/V-1 


ATC 


1.12 


0.91 


1 .38 


28% 


C.T4.4/T1 A74 
O I T*f/ | I/V4 


M I O 


I . I u 


u.oy 


i .00 




ST+7/T1A/-4 


GTC 


1.13 


0.92 


1.40 


29% 


ST+7/T1A/-3 


GTG 


1.05 


0.85 


1.31 


24% 


ST+7/T1A/-1 


GTC 


1.25 


0.99 


1.59 


37% 


ST+7/T1/V1 


GTA 


1.23 


0.97 


1.57 


36% 


ST+7/T1/V4 


GTC 


1.22 


0.98 


1.52 


34% 


ST+7A/-4/V-1 


GCC 


1.05 


0.84 1 


1.31 


24% 


T1/V-4/V-1 


TCC 


1.23 


1.00 


1.53 


34% 


T1/V-4A/4 


TCC 


1.19 


0.96 


1.48 


32% 
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T1/V-3/V-1 


TGC 


1 .13 


0.92 


1 .40 


29% 


T1 /V-3A/4 


TGC 


1 .13 


0.92 


1 .40 


29% 


T1 A/-1A/1 

1 1/ V" If V 1 


TCA 


1 25 


0 97^ 


1 61 


OO /o 


T1 A/-1 A/4 

1 If V - If V H- 


TCC 


1 .21 


0.97 


1 .51 


34°A 
Ot /o 


T1 A/1 A/4 

1 l/V l/Vf 


TAC 


1 93 
I .^o 


1 no 


1 54 


3*W 


S1/T1 A/-1 A/1 

will If V" l/V 1 


GTCA 


1 .2S 


1 .00 


1 67 


4fW„ 


S1/T1 A/-1 A/4 


GTCC 


1 .23 


0.99 


1 .54 


3^% 

OO /O 


c-1 /T"i fi/1 A/4 

OKI I / V I / V1> 


I MO 


1 93 


u.yy 


1 ^4 

i .o«+ 


OO /o 


c ?2/ , 5T+7/T1 A/-4 


GGTC 


1 16 


0 93 


1 45 


31 

O I /o 


^9/^T+7/T1 A/ -3 

0£/OI~lll l/v-o 


GGTG 


1 08 


0 87 


1 35 


900/ 
£0 /a 


^2/^T+7/T1 A/-1 


GGTC 


1 34 


1 06 


1 70 


410/ 

H- I /o 


^?/^T+7/T1 A/4 

OA/O I ~f / 1 l/VJ 


GGTC 


1 27 


1 02 


1 52 


Of /o 


Q9/QT+7A/-4A/-1 

Oa/O J " / / V -*K V 1 


GGCC 


1 14 


n Q9 


1 AO 






GGCC 


1.12 


0 89 


1 40 


/o 


S 2/ST+7 A/-3 A/-1 


GGGC 


1 .06 


0.85 


1.31 


24% 


S2/ST+7A/-3A/4 


GGGC 


1.07 


0.85 


1 .33 


25% 


^2/ € 5T+7A/-1 A/4 


GGCC 


1 24 


1 00 


1 55 


OO /O 


qorri A/-4A/-1 


GTCC 


1 .24 


1 .OC 


1 .54 


oo /o 


^2/T1 A/-4A/4 

w£l 1 1 f V < -r/ V *t 


GTCC 


1 21 


0 97 




34% 

O'f /o 


S2/T1 A/-3 A/-1 

1 l/V Of V I 


GTGC 


1.14 


0.92 


1 .42 


30% 


S2/T1 A/-3/V4 


GTGC 


1.17 


0.94 


1 .47 


32% 


R2/T1 /V-1 A/4 

u£f | l/V" 1/ Vt 


GTCC 


1 22 


0 97 


1 53 


OO /o 


S2A/-4A/-1A/4 


GCCC 


1.15 


0.92 


1 43 


3n% 

OVJ /O 


Q2A/-3 A/-1 A/4 

w^f V Of V 1 / V *» 


GGCC 


1.12 


0.90 


1 39 




CT+7/T1 A/-4 A/-1 

J 1 ' / / 1 If V ™"t| V I 


GTCC 


1 16 


0 94 


1 44 


3n% 

OLF /0 


CT+7/T1 A/-4 A/4 

O 1 T /l I l/V ""*tf V M- 


GTCC 


1 1^ 

I . I o 


0 92 


1 44 


OU /o 


QT+7/T1 A/-3A/-1 

D I 'I / I l/V "Of V ~ 1 




1 .uo 


U.Oi 




^O /o 


CT+7/T1 A/-3 A/4 

wl T /f I l/V Of V *T 


GTGC 


1 .09 


0 88 


1 36 


9 7 OA 


CT+7/T1 A/-1 A/4 

9l~f/l l/V*l/V"T 


GTCC 


1 91 


0 98 


1 ^1 


O^f /o 


n A/-4A/-1 A/4 

1 1/ V-tf V- 1/ VI 


I OV_>V_/ 


1 1Q 


U.bQ 


1 AQ 


Oo /o 


T1/V-3/V-1/V4 


TGCC 


1 .16 


0.94 


1.45 


31 % 


T1/V-1/V1/V4 


TCAC 


1.20 


0.96 


1.50 


33% 


S1/T1/V-1/V1/V4 


GTCAC 


1.22 


0.98 


1.52 


34% 


S2/ST+7/T1A/-3AM 


GTGCC 


1.09 


0.88 


1.36 


27% 


S2/ST+7/T1/V-3/V4 


GGTGC 


1.10 


0.88 


1.38 


28% 


S2/ST+7/T1A/-1/V4 


GGTCC 


1.26 


1.01 


1.58 


37% 


S2/ST+7/V-3/V-1/V4 


GGTGC 


1.07 


0.85 


1.33 


25% 


S2/T1/V-3/V-1/V4 


GTGCC 


1.17 


0.93 


1.46 


31% 


ST+7/T1/V-3AM/V4 


GTGCC 


1.09 


0.88 


1.36 


27% 



The alleles that conferred increased risk of developing asthma appeared 
common. Haplotype frequencies ranged from 31% to 89%. This effect 
translated into a substantial population attributable risk, with estimates ranging 
5 from 2 to 47% for different SNPs or SNP haplotypes. These computations 
depended heavily on allele frequency and risk estimates. 
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Conclusion : Gene 216 has been demonstrated to be an asthma gene 
in accordance with the data disclosed herein, including: 1) localization to a 
region on chromosome 20 identified through linkage; 2) polymorphism analysis 
performed to identify sequence variants localized in the candidate gene; 3) 
5 genotype analyses of the identified polymorphisms; 4) association between 
identified alleles and the asthma phenotype in a case-control analysis; 5) 
association between identified alleles and the asthma phenotype in 
transmission disequilibrium tests (TDT), haplotype analyses, and analyses 
using additional phenotypes; 6) identification of transcripts in tissues relevant 

10 to pulmonary disease and/or inflammation; and 7) characterization of Gene 
216 as an ADAM family member. It is noted that Gene 216 is also likely to be 
involved in obesity and inflammatory bowel disease, as obesity (Wilson et al., 
1999, Arch. Intern. Med. 159: 2513-14) and inflammatory bowel disease (B. 
Wallaert et al., 1995, J. Exp. Med. 182:1897-1904) have been linked to 

15 asthma. 

The disclosure of each of the patents, patent applications, and 
publications cited in the specification is hereby incorporated by reference 
herein in its entirety. 

20 

Although the invention has been set forth in detail, one skilled in the art 
will recognize that numerous changes and modifications can be made, and that 
such changes and modifications may be made without departing from the spirit 
and scope of the invention. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid which comprises SEQ ID NO:6, and contains 
at least one allele selected from the group consisting of: 

a. allele G at single nucleotide polymorphism F+1; 

b. allele A at single nucleotide polymorphism L-1 ; 

c. allele G at single nucleotide polymorphism L-1 ; 

d. allele T at single nucleotide polymorphism M+1 ; 

e. allele C at single nucleotide polymorphism Q-1 ; 

f. allele G at single nucleotide polymorphism ST+7; 

g. allele A at single nucleotide polymorphism ST+4; 

h. allele T at single nucleotide polymorphism T+1 ; and 

i. allele C at single nucleotide polymorphism V-1. 



2. An isolated nucleic acid which comprises SEQ ID NO:6, and contains 
at least one allele selected from the group consisting of: 

a. allele A at single nucleotide polymorphism 11 ; 

b. allele G at single nucleotide polymorphism S1 ; 

c. allele G at single nucleotide polymorphism S2; 

d. allele C at single nucleotide polymorphism S2; 

e. allele C at single nucleotide polymorphism T1 ; 

f. allele T at single nucleotide polymorphism T2; 

g. allele C at single nucleotide polymorphism V4; and 

h. allele G for single nucleotide polymorphism V7. 



3. An isolated nucleic acid which comprises at least 50 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one allele selected from 
the group consisting of: 

a. allele G at single nucleotide polymorphism F+1; 

b. allele A at single nucleotide polymorphism L-1 ; 

c. allele T at single nucleotide polymorphism M+1 ; 

d. allele C at single nucleotide polymorphism Q-1 ; 
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e. allele G at single nucleotide polymorphism ST+7; 

f. allele A at single nucleotide polymorphism ST+4; 

g. allele T at single nucleotide polymorphism T+1 ; and 

h. allele C at single nucleotide polymorphism V-1 . 



4. An isolated nucleic acid which comprises at least 50 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one allele selected from 
the group consisting of: 

a. allele A at single nucleotide polymorphism 11 ; 

b. allele G at single nucleotide polymorphism S1 ; 

c. allele G at single nucleotide polymorphism S2; 

d. allele C at single nucleotide polymorphism S2; 

e. allele C at single nucleotide polymorphism T1 ; 

f. allele T at single nucleotide polymorphism T2; 

g. allele C at single nucleotide polymorphism V4; and 

h. allele G for single nucleotide polymorphism V7. 



An isolated nucleic acid which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one allele selected from 
the group consisting of: 

a. allele G at single nucleotide polymorphism F+1; 

b. allele A at single nucleotide polymorphism L-1 ; 

c. allele T at single nucleotide polymorphism M+1; 

d. allele C at single nucleotide polymorphism Q-1; 

e. allele G at single nucleotide polymorphism ST+7; 

f. allele A at single nucleotide polymorphism ST+4; 

g. allele T at single nucleotide polymorphism T+1 ; and 

h. allele C at single nucleotide polymorphism V-1 . 

An isolated nucleic acid which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one allele selected from 
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the group consisting of: 

a. allele A at single nucleotide polymorphism 11 ; 

b. allele G at single nucleotide polymorphism S1 ; 

c. allele G at single nucleotide polymorphism S2; 

d. allele C at single nucleotide polymorphism S2; 

e. allele C at single nucleotide polymorphism T1 ; 

f. allele T at single nucleotide polymorphism T2; 

g. allele C at single nucleotide polymorphism V4; and 

h. allele G for single nucleotide polymorphism V7. 

7. An isolated nucleic acid which comprises at least 1520 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one haplotype selected 
from the group consisting of: 

a. haplotype C/G at single nucleotide polymorphisms ST+4A/-3; 

b. haplotype C/C at single nucleotide polymorphisms ST+4/V-2; 

c. haplotype C/C at single nucleotide polymorphisms ST+4A/-4; 

d. haplotype A/C at single nucleotide polymorphisms ST+7/V-2; 

e. haplotype T/C at single nucleotide polymorphisms S+1/ST+4; and 

f. haplotype C/T at single nucleotide polymorphism ST+4/ST+5. 

8. An isolated nucleic acid which comprises at least 2070 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one haplotype selected 
from the group consisting of: 

a. haplotype C/T at single nucleotide polymorphisms S2/T+2; and 

b. haplotype G/C at single nucleotide polymorphisms S2/V-1. 

9. An isolated nucleic acid which comprises at least 3915 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one haplotype selected 
from the group consisting of: 

a. haplotype G/A at single nucleotide polymorphisms F+1/ST+4; 

b. haplotype C/A at single nucleotide polymorphisms KL+2/ST+4; 
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c. haplotype G/A at single nucleotide polymorphisms L-1/ST+7; 

d. haplotype G/C at single nucleotide polymorphisms L-1 A/-1 ; 

e. haplotype T/G at single nucleotide polymorphisms Q-1/T+2; 

f. haploytype C/A at single nucleotide polymorphisms Q-1/ST+4; 

g. haplotype A/G at single nucleotide polymorphisms ST+4/ST+7; 

h. haplotype A/C at single nucleotide polymorphisms ST+4AA-1 ; and 

i. haplotype G/A at single nucleotide polymorphisms T+2/V-1 . 

10. An isolated nucleic acid which comprises at least 5009 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one haplotype selected 



1; 



from the group consisting of: 


a. 


haplotype A/A at single nucleotide polymorphisms I1/ST+4; 


b. 


haplotype A/A at single nucleotide polymorphism 11 A/1 ; 


c. 


haplotype A/C at single nucleotide polymorphisms I1/V2; 


d. 


haplotype A/T at single nucleotide polymorphisms I1/V3; 


e. 


haplotype A/A at single nucleotide polymorphisms S1/S+1; 


f. 


haplotype G/A at single nucleotide polymorphisms S1/ST+4; 


g. 


haplotype G/T at single nucleotide polymorphisms S1/T1 


h. 


haplotype G/A at single nucleotide polymorphisms S2/ST+4; 


i. 


haplotype G/C at single nucleotide polymorphisms S2/V-1; 


j- 


haplotype A/C at single nucleotide polymorphisms ST+4/V4; 


k. 


haplotype C/C at single nucleotide polymorphisms S2/V6; 


I. 


haplotype A/C at single nucleotide polymorphisms ST+4/V7; 


m. 


haplotype G/T at single nucleotide polymorphisms ST+7/T 1 ; 


n. 


haplotype T/C at single nucleotide polymorphisms T1/V4; 


o. 


haplotype C/C at single nucleotide polymorphisms V-1/V4; 


P- 


haplotype G/G/T at single nucleotide polymorphisms S2/ST+7/T1 


q- 


haplotype G/G/C at single nucleotide polymorphisms S2/ST+7/V- 


r. 


haplotype G/T/C at single nucleotide polymorphisms ST+7/T1/V4; 



s. haplotype G/G/T/C at single nucleotide polymorphisms 
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S2/ST+7/T1/V-1; 

t. haplotype G/G/T/G/C at single nucleotide polymorphisms 
S2/ST+7/T1/V-3/V-1; and 

u. haplotype G/G/T/C/C at single nucleotide polymorphisms 
S2/ST+7/T1/V-1/V4. 

11. An isolated nucleic acid which comprises at least 6875 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one haplotype selected 
from the group consisting of: 

a. haplotype T/A/A/C/G/C/T/C/A/C/C/G/A/C/C at single nucleotide 
polymorphisms D1/F1/11/L1/S1 /S2yT1 /T2/V1 A/2A/3/V4A/5A/6/V7; 

b. haplotype T/A/A/C/G/C/C/C/A/C/T/C/A/C/G at single nucleotide 
polymorphisms D1/F1/I1/L1/S1 /S2/T 1 /T2/V 1 /V2/V3/V4/V 5/V6/V7 ; 

c. haplotype T/A/A/C/G/C/C/T/A/C/T/C/A/C/G at single nucleotide 
polymorphisms D1 /F1/1 1 /L1 IS 1 /S2/T 1 /T2 A/ 1 A/2 A/3 A/4A/5 A/6 A/7 ; 

d. haplotype T/A/A/C/G/C/C/T/A/C/T/C/A/T/G at single nucleotide 
polymorphisms D1/F1/I1/L1/S1 /S2/T1 /T2A/1 A/2A/3A/4/V5A/6A/7; 

e. haplotype T/A/G/C/A/C/T/C/A/C/T/G/A/C/G at single nucleotide 
polymorphisms D1/F1/11 /L1 /S1 /S2/T1 /T2/V1 A/2A/3/V4/V5A/6A/7; 

f. T/A/G/C/A/C/T/C/A/C/T/G/A/C/G alleles at single nucleotide 
polymorphisms D1/F1/I1 /L1 /S1 /S2/T1 /T2/V1 A/2/V3/V4/V5/V6A/7; 

g. haplotype T/A/G/C/G/G/T/C/A/C/T/G/A/T/G at single nucleotide 
polymorphisms D1/F1/I1 /L1 /S 1 /S2/T1 /T2A/1 A/2/V3A/4/V5A/6A/7; 

h . haplotype T/A/G/C/G/G/T/C/A/C/T/G/A/C/C at single nucleotide 
polymorphisms D1 /F1/1 1/L1/S 1/S2/T1 /T2A/1 /V2/V3/V4A/5/V6A/7; 

i. haplotype T/A/G/C/G/G/T/C/A/Crr/C/A/C/C at single nucleotide 
polymorphisms D1/F1/I1/L1/S1/S2/T1/T2/V1A/2/V3A/4/V5/V6A/7; 

j. haplotype T/A/G/C/G/G/T/C/A/C/T/C/A/C/C at single nucleotide 
polymorphisms D 1 /F 1 /I1 /L1 /S 1 /S2/T1 /T2/V1 A/2A/3A/4/V5A/6/V7; 

k. haplotype T/A/G/C/G/G/T/C/A/C/T/C/A/C/G at single nucleotide 
polymorphisms D 1 /F1 /1 1 /L1 /S 1 /S2/T1 /T2/V1 A/2/V3A/4/V5/V6/V7; 
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I. haplotype T/A/G/C/G/G/T/C/A/C/T/C/A/C/G at single nucleotide 
polymorphisms D1/F1/I1/L1/S1/S2/T1/T2A/1A/2A/3A/4A/5A/6A/7; and 

m. haplotype T/G/A/C/G/C/T/C/T/T/C/G/G/C/G at single nucleotide 
polymorphisms D1/F1/I1/L1/S1/S2/T1/T2A/1A/2A/3A/4A/5A/6A/7. 

12. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele C at single 
nucleotide polymorphism ST+4; and 

b. a second isolated nucleic acid which comprises at least 1 5 
consecutive nucleotides of SEQ ID NO:6 and contains an allele selected from 
the group consisting of: 

1 . allele G at single nucleotide polymorphism V-3; 

2. allele C at single nucleotide polymorphism V-2; 

3. allele C at single nucleotide polymorphism V-4; 

4. allele T at single nucleotide polymorphism S+1 ; and 

5. allele T at single nucleotide polymorphism ST+5. 

13. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele C at single 
nucleotide polymorphism S2; and 

b. a second isolated nucleic acid which comprises at least 1 5 
consecutive nucleotides of SEQ ID NO:6 and contains an allele selected from 
the group consisting of: 

1 . allele C at single nucleotide polymorphism V6; and 

2. allele T at single nucleotide polymorphism T+2. 

14. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele A at single 
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nucleotide polymorphism ST+7; and 

b. a second isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele C at single 
nucleotide polymorphism V-2. 

15. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at (east 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele A at single 
nucleotide polymorphism ST+4; and 

b. a second isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains an allele selected from 
the group consisting of: 

1 . allele C at single nucleotide polymorphism Q+1 ; 

2. allele C at single nucleotide polymorphism KL+2; 

3. allele G at single nucleotide polymorphism ST+7; 

4. allele C at single nucleotide polymorphism V-1 ; 

5. allele C at single nucleotide polymorphism V4; 

6. allele G at single nucleotide polymorphism F+1 

7. allele G at single nucleotide polymorphism S1 ; 

8. allele G at single nucleotide polymorphism S2; 

9. allele C at single nucleotide polymorphism V7; and 

1 0. allele A at single nucleotide polymorphism 11 . 

16. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele A at single 
nucleotide polymorphism 11; and 

b. a second isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains an allele selected from 
the group consisting of: 

1 . allele A at single nucleotide polymorphism ST+4; 
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2. allele T at single nucleotide polymorphism V3; 

3. allele C at single nucleotide polymorphism V2; and 

4. allele A at single nucleotide polymorphism V1 . 

17. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele G at single 
nucleotide polymorphism T+2; and 

b. a second isolated nucleic acid which comprises at least 1 5 
consecutive nucleotides of SEQ ID NO:6 and contains an allele selected from 
the group consisting of: 

1 . allele T at single nucleotide polymorphism Q-1 ; and 

2. allele A at single nucleotide polymorphism V-1 . 

18. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele C at single 
nucleotide polymorphism V-1 ; and 

b. a second isolated nucleic acid which comprises at least 1 5 
consecutive nucleotides of SEQ ID NO:6 and contains an allele selected from 
the group consisting of: 

1 . allele C at single nucleotide polymorphism V4; 

2. allele G at single nucleotide polymorphism L-1 ; and 

3. allele G at single nucleotide polymorphism T+2. 

19. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which comprises at least 1 5 
consecutive nucleotides of SEQ ID NO:6 and contains allele A at single 
nucleotide polymorphism S1; and 

b. a second isolated nucleic acid which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6 and contains allele C at single 
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nucleotide polymorphism S+1. 

20. A set of isolated nucleic acids comprising: 

a. a first isolated nucleic acid which is complementary to the first 
isolated nucleic acid of any one of claims 12-19; and 

b. a second isolated nucleic acid which is complementary to the 
second isolated nucleic acid of any one of claims 12-19. 

21. A isolated nucleic acid which is complementary to the nucleic acid of 
any one of claims 5-6. 

22. An isolated nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NO:242-284 and SEQ ID NO:373-420. 

23. An isolated nucleic acid which is complementary to the nucleic acid of 
claim 22. 

24. An isolated nucleic acid comprising at least 15 contiguous nucleotides 
of a sequence selected from the group consisting of SEQ ID NO:242-284 
and SEQ ID NO:373-420, wherein the sequence contains at least one allele 
shown in Table 10. 

25. An isolated nucleic acid which is complementary to the nucleic acid of 
claim 24. 

26. A probe comprising the isolated nucleic acid of claim 24. 

27. A probe comprising the isolated nucleic acid of claim 25. 

28. A primer comprising the isolated nucleic acid of claim 24. 

29. A primer comprising the isolated nucleic acid of claim 25. 

30. An isolated amino acid sequence encoded by the isolated nucleic acid 
of any one of claims 2, 4, 6, 10, and 1 1 . 

31 . An isolated amino acid sequence encoded by the isolated nucleic acid 
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of claim 8. 

32. An antibody which binds to the isolated amino acid sequence of claim 

30, wherein antibody is polyclonal or monoclonal. 

33. An antibody which binds to the isolated amino acid sequence of claim 

31, wherein antibody is polyclonal or monoclonal. 

34. An antibody fragment of the antibody of claim 32, wherein the antibody 
fragment binds to the isolated amino acid sequence. 

35. An antibody fragment of the antibody of claim 33, wherein the antibody 
fragment binds to the isolated amino acid sequence. 

36. A vector comprising the isolated nucleic acid of any one of claims 2, 4, 
6, 8, 10, and 11. 

37. A vector comprising the isolated nucleic acid of any one of claims 1 , 3, 
5, 7, and 9. 

38. A vector comprising the isolated nucleic acid of claim 21 . 

39. A vector comprising the isolated nucleic acid of claim 24. 

40. A kit for detecting a Gene 21 6 nucleic acid molecule comprising: 

a. the isolated nucleic acid of any one of claims 5-6; and 

b. at least one component to detect hybridization of the isolated 
nucleic acid to the Gene 216 nucleic acid molecule. 

41 . A kit for detecting a Gene 21 6 nucleic acid molecule comprising: 

a. the isolated nucleic acid of claim 21 and 

b. at least one component to detect hybridization of the isolated 
nucleic acid to the Gene 216 nucleic acid molecule. 

42. A kit for detecting a Gene 21 6 nucleic acid molecule comprising: 

a. the probe of any one of claims 24-25; and 

b. at least one component to detect hybridization of the probe to the 
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Gene 21 6 nucleic acid molecule. 

43. A kit for detecting a Gene 216 nucleic acid molecule comprising: 

a. the set of isolated nucleic acids of any one of claims 12-20; and 

b. at least one component to detect hybridization of one or more of 
the nucleic acids of the set to a Gene 216 nucleic acid molecule. 

44. A kit for detecting a Gene 216 amino acid sequence comprising: 

a. the antibody of claim 32; and 

b. at least one component to detect binding of the antibody to a 
Gene 216 amino acid sequence. 

45. A kit for detecting a Gene 216 amino acid sequence comprising: 

a. the antibody of claim 33; and 

b. at least one component to detect binding of the antibody to a 
Gene 216 amino acid sequence. 

46. A kit for detecting a Gene 216 amino acid sequence comprising: 

a. the antibody fragment of claim 34; and 

b. at least one component to detect binding of the antibody 
fragment to a Gene 216 amino acid sequence. 

47. A kit for detecting a Gene 216 amino acid sequence comprising: 

a. the antibody fragment of claim 35; and 

b. at least one component to detect binding of the antibody 
fragment to a Gene 216 amino acid sequence. 

48. A pharmaceutical composition comprising the isolated nucleic acid of 
claim 21, and a physiologically acceptable carrier, excipient, or diluent. 

49. A pharmaceutical composition comprising the isolated nucleic acid of 
claim 25, and a physiologically acceptable carrier, excipient, or diluent. 

50. A pharmaceutical composition comprising the antibody of claim 32, and 
a physiologically acceptable carrier, excipient, or diluent. 

51 . A pharmaceutical composition comprising the antibody fragment of 
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claim 34, and a physiologically acceptable carrier, excipient, or diluent. 

52. A pharmaceutical composition comprising the vector of claim 38, and a 
physiologically acceptable carrier, excipient, or diluent. 

53. A pharmaceutical composition comprising the vector of claim 39, and a 
physiologically acceptable carrier, excipient, or diluent. 

54. A method of treating a disorder selected from the group consisting of 
asthma and bronchial hyperresponsiveness comprising: administering the 
pharmaceutical composition of claim 48 in an amount effective to treat the 
disorder. 

55. A method of treating a disorder selected from the group consisting of 
asthma and bronchial hyperresponsiveness comprising: administering the 
pharmaceutical composition of claim 49 in an amount effective to treat the 
disorder. 

56. A method of treating a disorder selected from the group consisting of 
asthma and bronchial hyperresponsiveness comprising: administering the 
pharmaceutical composition of claim 50 in an amount effective to treat the 
disorder. 

57. A method of treating a disorder selected from the group consisting of 
asthma and bronchial hyperresponsiveness comprising: administering the 
pharmaceutical composition of claim 51 in an amount effective to treat the 
disorder. 

58. A method of treating a disorder selected from the group consisting of 
asthma and bronchial hyperresponsiveness comprising: administering the 
pharmaceutical composition of claim 52 in an amount effective to treat the 
disorder. 

59. A method of treating a disorder selected from the group consisting of 
asthma and bronchial hyperresponsiveness comprising: administering the 
pharmaceutical composition of claim 53 in an amount effective to treat the 



-294- 



WO 02/083077 



PCT/US02/12063 



disorder. 

60. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6, and contains at least one allele selected from 
the group consisting of: 

a. allele G at single nucleotide polymorphism F+1 ; 

b. allele A at single nucleotide polymorphism L-1 ; 

c. allele G at single nucleotide polymorphism L-1 ; 

d. allele T at single nucleotide polymorphism M+1 ; 

e. allele C at single nucleotide polymorphism Q-1 ; 

f. allele G at single nucleotide polymorphism ST+7; 

g. allele A at single nucleotide polymorphism ST+4; 

h. allele T at single nucleotide polymorphism T+1 ; 

i. allele C at single nucleotide polymorphism V-1 ; 
j. allele A at single nucleotide polymorphism 11 ; 
k. allele G at single nucleotide polymorphism S1; 
I. allele G at single nucleotide polymorphism S2; 
m. allele C at single nucleotide polymorphism S2; 
n. allele C at single nucleotide polymorphism T1 ; 
o. allele T at single nucleotide polymorphism T2; 

p. allele C at single nucleotide polymorphism V4; and 
q. allele G for single nucleotide polymorphism V7; 
wherein the presence of the nucleic acid identifies an increased 
susceptibility to the disorder. 

61 . A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which is complementary to the nucleic 
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acid of claim 60, wherein the presence of the nucleic acid identifies an 
increased susceptibility to the disorder 

62. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises two regions, including: 

a. a first region which comprises at least 1 5 consecutive nucleotides 
of SEQ ID NO:6 and contains allele A at single nucleotide polymorphism ST+4; 
and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains an allele selected from the group 
consisting of: 

1 . allele C at single nucleotide polymorphism Q+1 ; 

2. allele C at single nucleotide polymorphism KL+2; 

3. allele G at single nucleotide polymorphism ST+7; 

4. allele C at single nucleotide polymorphism V-1 ; 

5. allele C at single nucleotide polymorphism V4; 

6. allele G at single nucleotide polymorphism F+1 

7. allele G at single nucleotide polymorphism S1 ; 

8. allele G at single nucleotide polymorphism S2; 

9. allele C at single nucleotide polymorphism V7; and 

10. allele A at single nucleotide polymorphism 11 ; 
wherein the presence of the nucleic acid identifies an increased 

susceptibility to the disorder. 

63. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises two regions, including: 

a. a first region which comprises at least 1 5 consecutive nucleotides 
of SEQ ID NO:6 and contains allele A at single nucleotide polymorphism 11; 
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and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains an allele selected from the group 
consisting of: 

1 . allele A at single nucleotide polymorphism ST+4; 

2. allele T at single nucleotide polymorphism V3; 

3. allele C at single nucleotide polymorphism V2; and 

4. allele A at single nucleotide polymorphism V1 ; 
wherein the presence of the nucleic acid identifies an increased 

susceptibility to the disorder. 

64. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises two regions, including: 

a. a first region which comprises at least 15 consecutive nucleotides 
of SEQ ID NO:6 and contains allele G at single nucleotide polymorphism T+2; 
and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains an allele selected from the group 
consisting of: 

1 . allele T at single nucleotide polymorphism Q-1 ; and 

2. allele A at single nucleotide polymorphism V-1 ; 
wherein the presence of the nucleic acid identifies an increased 

susceptibility to the disorder. 

65. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises two regions, including: 

a. a first region which comprises at least 15 consecutive nucleotides 
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of SEQ ID NO:6 and contains allele C at single nucleotide polymorphism V-1; 
and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains an allele selected from the group 
consisting of: 

1 . allele C at single nucleotide polymorphism V4; 

2. allele G at single nucleotide polymorphism L-1 ; and 

3. allele G at single nucleotide polymorphism T+2; 
wherein the presence of the nucleic acid identifies an increased 

susceptibility to the disorder. 

66. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises two regions, including: 

a. a first region which comprises at least 1 5 consecutive nucleotides 
of SEQ ID NO:6 and contains allele A at single nucleotide polymorphism S1 ; 
and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains allele C at single nucleotide 
polymorphism S+1; 

wherein the presence of the nucleic acid identifies an increased 
susceptibility to the disorder. 

67. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises two regions, including: 

a. a first region is complementary to the first region of any one of 
claims 62-66; and 

b. a second region is complementary to the second region of any 
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one of claims 62-66; 

wherein the presence of the nucleic acid identifies an increased 
susceptibility to the disorder 

68. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
forthe presence of a nucleic acid which comprises three regions, including: 

a. a first region which comprises at least 1 5 consecutive nucleotides 
of SEQ ID NO:6 and contains allele G at single nucleotide polymorphism S2; 
and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains allele G at single nucleotide 
polymorphism ST+7; and 

c. a third region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains an allele selected from the group 
consisting of: 

1 . allele T of single nucleotide polymorphism T1 ; and 

2. allele C of single nucleotide polymorphism V-1 ; 
wherein the presence of the nucleic acid identifies an increased 

susceptibility to the disorder. 

69. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
forthe presence of a nucleic acid which comprises three regions, including: 

a. a first region which comprises at least 1 5 consecutive nucleotides 
of SEQ ID NO:6 and contains allele G at single nucleotide polymorphism ST+7; 
and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains allele T at single nucleotide 
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polymorphism T1 ; and 

c. a third region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains an allele selected from the group 
consisting of: 

1 . allele G of single nucleotide polymorphism S2; and 

2. allele C of single nucleotide polymorphism V4; 
wherein the presence of the nucleic acid identifies an increased 

susceptibility to the disorder. 

70. A method of Identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises three regions, including: 

a. a first region which is complementary to the first region of any one 
of claims 68-69; 

b. a second region which is complementary to the second region of 
any one of claims 68-69; and 

c. a third region which is complementary to the third region of any 
one of claims 68-69; 

wherein the presence of the nucleic acid identifies an increased 
susceptibility to the disorder. 

71 . A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises five regions, including: 

a. a first region which comprises at least 1 5 consecutive nucleotides 
of SEQ ID NO:6 and contains allele G at single nucleotide polymorphism S2; 
and 

b. a second region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains allele G at single nucleotide 
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polymorphism ST+7; 

c. a third region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains allele T at single nucleotide 
polymorphism T1 ; 

d. a fourth region which comprises at least 15 consecutive 
nucleotides of SEQ ID NO:6 and contains allele C at single nucleotide 
polymorphism V-1; and 

e. a fifth region which comprises at least 15 consecutive nucleotides 
of SEQ ID NO:6 and contains an allele selected from the group consisting of: 

1 . allele C of single nucleotide polymorphism V-3; and 

2. allele C of single nucleotide polymorphism V4; 
wherein the presence of the nucleic acid identifies an increased 

susceptibility to the disorder. 

72. A method of identifying increased susceptibility to a disorder selected 
from the group consisting of asthma and bronchial hyperresponsiveness in 
a subject comprising: testing a biological sample obtained from a subject 
for the presence of a nucleic acid which comprises five regions, including: 

a. a first region which is complementary to the first region of claim 

71; 

b. a second region which is complementary to the second region of 
claim 71; 

c. a third region which is complementary to the third region of claim 

71; 

d. a fourth region which is complementary to the fourth region of 
claim 71 ; and 

e. a fifth region which is complementary to the fifth region of claim 

71; 

wherein the presence of the nucleic acid identifies an increased 
susceptibility to the disorder. 
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73. A biochip comprising the isolated nucleic acid of any one of claims 23 
and 24. 
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>BAC1098L22 sequence 

gctctaataaatttgcggccgctaatacgactcactatagggagaggatccgcggaattcccccatgtgccatgtccacg 
agatggttgaacagatgagacaacatggttgtgcagcttctctctttttttttttetcagacagagtctcactctgtagc 
ccaggctggagtgcagtggcgcaatttcagctcactgcaacctccgcctcccaggtcctgattcaagcagttctcctgcc 
tcagcctcctgagtagctgggattacaggaacgcgccactatgcccagctaatttttctaggtttagtagagacggggtt 
tcaccatgttgaccaggctggtcttgaattcctgaccttgtgatccgcccgcctcggcctcccgaagtgctgagattaca 
ggcatgagccaccgcatccggccatgcagcttctcttttctagagttaacaggagatgcggcaggtctgtagtagcccag 
aaattctcagacttgcactcttgaatttacacgcatgtacaaaatgcagctcagcctaatgcctcctgttcagttctgtg 
gtgctgcctacttcctgtgcccaaattagccttgcgtttataactaggaggaatgacttggtgtcctacaatattttggc 
tcctgggcttctttggtgttcattctattactagttgattttttttttcttggaaagaaaagtattcaaagagccagaat 
ttactttcattatatcttacagttgttaaattaacctatccttgttttcactttctgtgtacttctctctttggggtaca 
aaacagtgtttctttagattgtctatttctaaatattattttaaccagaacataccaaatatattttcctgccagtttca 
ttcttcattttacttcttaaccattgttacgattttttttttaacttcggtctcagattttgctcaactgaagttgtccc 
agttcatgagattttgttttctttgcagctcttgaccatgacagatgtgaccagcacacagattgttacagctgcacagc 
caacaccaatgactgccactggtgcaatgaccattgtgtccccaggaaccacagctgctcagaaggccaggtcagaggct 
gtttcttaaagatttcagaaaaatcccaacttgtcataggtttagcttttatagtgtatatggtataaataatggcccag 
agttacttttcaaatgggtttctatttggattttattatccctgaggttttcctttaggaagagatgttctgtatatttc 
aagggtcctactagccctgggaatttgtgtattgtgttttagaagaaggtaggactgttgtccctgggcatggctgtgac 
taagcactggatccttggttcgaggcatccgatagtgacctcactttaccagtaccaggttttgacaaagttagggtttg 
agagtgagtcaggtgtacagcggcccatattgaactgtggaaatgacagactttgcaaaatctccttgttttatatcttg 
gtttggcataatctcactgctcatcatatatagatttttaaatttaagttataagatgacccaggcttttatagtttttg 
acagcttacaagacttttttttttgtcagtctcatacagttcataaaatagaaaactttcagacttttgaagtgagcatt 
tgaaaagcaccaagttcaacattcccattttacagatgagcaggttgaggcatggtgtttaaaagagtgggctgcttcct 
tgccagttaaaagcatgtccttgagcagaacattccattgcagtgttccccactgcagagctactgcaacttatctatct 
atctgtctgtctgtctgtctatctatctatctatctatctatctatctatctatctatctgtctatctatttatttgcag 
tttgcttcctcaattagattttgttgtctataatatcctgttggtcctcaccataagttctctctgtggatacaaaagaa 
aagcttttgtccactgtgttctagacctccatttttaggtatgagaatcgccccaaggataaccccatgtactactgtaa 
caagaagaccagctgcaggagctgtgccctggaccagaactgccagtgggagccccggaatcaggagtgcattgccctgc 
ccggtaggccttgcagggtcatcttggtgtgtgtgggtccattacttcagcctgcttcccccaacactgtgcagcctaag 
ttgaacctagcagaggggaagagctaattctgtccattcatcccccacacgagtattatgggcttttttgtttttaacta 
aaatacagttcttaagtatttgttcctactgtcctttgaaataaagtgaaacatcctttgctgctctgtagaattgagtg 
acagcttggttctgtatcactgagcctgctccttgtctctttcccatccacctttgataacctgggactagaccatgatg 
tctcagcagtcagtctgctgagcactttatggagagtacttcttattaaccactgggatttaattgttggcacctgctaa 
tgggccttctctgagaaggagaggatagatacttctgtcagcagcaccttttaggggtgatctccagccctgaaaacctc 
aatatatcctgcttctgaggttcaggatgatgaactcagggcctgagaccagccagccatgtgatatatttggaccaggg 
tggtccagaaaggggaacttccttgtctgtgcacaacatctgagcttgttcagggaagttggtttggaccaggccctttt 
tgaatgtcccttggaagtttttgattcagcttgcagagtgggactcttttctatatctcagtgtgtctcaaattttaagg 
tacaggaaaagacctgggaatcttgcgaaaggcatattctggtgcaagtgtggggtggggcctagagtgtgcattttatc 
tagtcgactgctacactgtgaggagcaagtcttcgtctgttcttaaatgatctctttcccatggtacctttcttttatct 
cagtgactgttactgttaatgaacattgttgatgtctccaaagtactttggttctggtgaagttgctttgttctttcatt 
gtcttccaggaagcattcatagcttctgtgcagtaccttgtgtgggttcaggatgatcacaggtagcagattacaagctt 
gtcttgtatgctatagccatatcacttgggctgcttctcaagaaggaccttctcaccttgcttttgggatgctttgtaca 
cttgattgtaccttccacctgatgatatgaaaacagtgcagcttttggagactatagatttgttaattccttgattcatt 
tccattcttgcagtttttaccccagccctccaatatgcatattcatttgtctgctcttcacttaggattttagttttcta 
attgttcttcagaaggaagtgtaccagtctaacattggcaccaaactggtgttttcatctaagacataggataagtgacc 
tcagaatatgctttttaggatccgggagatatcaccagtaaacatttcaaaattcttgtattctgcatttggtccttaat 
aatgtgtcagaggctcccacatcctaatgaagtacctagaatttaaattagaaaggccatttcggtattcagtaatttga 
actcataatacagtagtttcgtctgatttctaaaattcttttctttctcttttccccttaatgaaagaaaacatctgtgg 
cattggctggcatctggttggaaactcatgtttgaaaattactactgccaaggagaactatgacaatgctaaattgttct 
gtaggaaccacaatgcccttttggcttctcttacaacccagaagaaggtagaatttgtccttaagcagctgcgaataatg 
cagtcatctcagagcatggtgagttaaaaccctcaaaacttaagtttctggttatccacctttctaccaagggcatgact 
gcagcttgcatgtggaaggctgtggatatgtgtaacgtgcttggcaagaaggggagtgctggtgaacgcagcctgaggga 
ctgtgggtttgtgctgtcagagtctcttcctcttaaaatttttaatactttgtatatataagatctatgaataattatat 
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gagggatgaattgtaacatgtatatgtgtacataatctggtgacaccagcagattacttcatacctgttttacctctgga 

ttctgctaggggagaaagagaggtcactgataattagctaggttggattaagccacctgagttccttggagttaaggtat 

tataatagtgcataagactgtataattaccactaagaagtgtacatctcagctggatgtggtggctcacacctgtaatcc 

cagcactttgggaggctgaggtgggtggatcgcctgaggtcaggagttcaagaccagcctgaccaatatagtaaaacccc 

gtctctactaaaaatacaaaaattagccaggcatggtggtgtgcgcctgtaatcccagctactcaggaggccgaggcagg 

agaattgcttgaacccaggaggtggaggtggcagtgagctgagattgcgccaccgcactccagcctgggcgacagagcga 

tactccgtctcaaaacaaaaaaagaacagcaaaaaaaagaaatgtacatctccttgggtctttcatagcccagccatctc 

aaaagaagagagcaccttcttgtcaagagtttctaagcccagaaaggctcaagttctctgcttgtccacccagtgctctc 

agggggcttatagtcaatattccatgatcacattttgtcatttttagtctggagtcataaattgtgatcccaccgtcagt 

taagtagactcatagaacaaagctctttcagcagtttcagctgtggtacagaaacgtttagtggaaatgttcttaccaag 

cagggaggagttgagggcaacacttccttgggtacagcctccttcatgtgaaggtatggaaatgtggcctgggtctcggc 

tgcctgtggcctctgtgtaccacctatagggccattctgagactggtaggaggtgccctgtatttagttttctccaatta 

gtcccttttttcagtgcaacttagatggggtatggacactcaaacattggtgacatattcttagtgtgtttacctcaggc 

tactgtgaccacatttggtatttcataatattttgatagctttttcagatttcagaatctccattggtgactgtctctgt 

tgtttctctttccatgtccaaatgtgggtctcttccagcattccatccttggctggcagctgacctttcccatagttatt 

cactccctcagaaaatggatggcacccagctttgcttattaccctggtgtctctaacaatgactctcgagtccacggaag 

ttaaaagggttcaaccaggtggccacagatatacttctggtaccctttctctcttccttaggttttctaactctaaacgt 

ttctgggtattctaatctgctgtggccacgtttatgaaacagaaattcacagtcttaatgataagactgacagatgagag 

acaactgaagtgtaatgtccttccacagctatacctctagatgtagcccagttagtagagcccagatttttatggaaaaa 

caagaaaggacacctagcctaacccttaggacagaggctggctggtagaaagctggaaggaggtgacccctgcaggtgag 

aaggagtgaattaggatgtcgaagacggaagggctttctgtgatttaattagtgcccccatctgtgagatgtagagggag 

atgattaagggagtggctctttgagtgagctgcaggtaagtttgcatggttggctgcaggctggatgggaggggattttt 

atagttgagcctcaggaaggaaccaaggccagaccctgcgaggccatggctacaatagtaatggattcgaattgtatcgt 

gaaggcaaagaaattattgaagggcctttaaaaagtatttttaactgtctttctttcttttggatctgtttatttttagg 

tcttttagtaggcatgtgtattttttcctctcaaaaatggaaataggctgggtgtggtggctcctgcctgtaatcccagc 

attttgggaggccagtgagggaggattgcttgagcccaggagttcaagactagcctgggcaacacagggagacctcgtct 

ctacaaaggaagtttttttaaagaattagccgggcatgatggtggcacatacttgtagttccggctacttgggaggctga 

ggcaagagggttgcttgagcccaggggtttgaggctgtagtgagccatgatcatgtccctgcactccagcatgggcaaca 

gagcgagaccctgtctcaaaagaaagcaaagggagggaaatacagtatatcttttgctttataactaccaaaattaggaa 

tacttaccatttcttggctaaactttatactttgatctttaaaacttgttaaaaattgcaatgagaaggaaatttcagga 

gagcagaagacagactgtcccaggtgtcactgtcctattattccctataaaatccagtgccaggatggatgaatggataa 

agcaaatgtggtataagttgaatatcccttatctaaaatgctttggacgaggagtgtttcgaattttagaatatttgcat 

tatactaaccagttaagcatccctaatctggaaatccaaaatgctctagtgaacattttccttcagcatccatcatgttg 

gcactcaaaaagttttagatttcggagtgttttggatttcagattagggatactcagcctgtgtttgggggtagccatct 

cttcatatagacatttcagaacttaaatattgctttgccataatttctgtgaatttttgatatattatcttctctgagct 

acatttttatcctttataaaatggccatattgaagtgatgatctatcctaatctaccatggctgagtcaagggataaaga 

ggttttcctgtgtctgtggggtatacttaacttggtggtttttatctagaagcttgttttggtcaagatgttggttatat 

tcaggccaggcatggtggctcatgcctataatctcaacattttgggaggccaaggtgggaggatcacttgagctcaggag 

tttgaaactagccagggcaacatggcaagactccatctccaaaatttaaattaaaaaaagatactatctgtattcatagt 

tgtgtctcttttgcctttagtccaagctcaccttaaccccatgggtcggccttcggaagatcaatgtgtcctactggtgc 

tgggaagatatgtccccatttacaaatagtttactacagtggatgccgtctgagcccagtgatgctggattctgtggaat 

tttatcagaacccagtactcggggactgaaggctgcaacctgcatcaacccactcaatggtagtgtctgtgaaaggcctg 

gtaagttcacaggtgaattaggtggtattcagagtttattgtgagagaaaccataggaggcatagttcattgctgagatg 

tgtgaagtagtcatgaaaacagatgaagtattgatttcaagcatgcaaagaagagtataacccagatttcagaagcagaa 

ggaaatattctgggaccctgaatagttttaattacaagcaaaactaaaaataactaacactactcgaagaaactgatatt 

ctaattaacaatgagattgataggtttattgaccagaaaaagtattgagaattgctctgaaaagcaaatttattggtgtt 

ggcagagaaatgctgtggaagaagaaacaaaaaaggaaaataaaaccaaagaaaagattagtaaagcaagcaagtgactg 

cagggacagtgttcagaaaggtagtgtcaacagggagaaaaatgatgagagcagttctgtaagcgaggacagaagcaacc 

cagaactacgcgagagctgcaggagagtactgagcagacagacactcggagtagcttcccagctttcagtctctctgggc 

tacattttggctaactaaaggcaggcccaggggtagctgctgcagcatgaagacaactcaaaataagccccctcctcctg 

cagagggactcacaagcagtcggtggaattgtgggttattttgggaggctggactccatttgcattgcgtccaattttgg 

gaaagtaatttgtctgaggaattacagaggtataggagagaattcatagcattgtagatttctaaatattgatttctaaa 

cttctctaggggagctgcccagcagcctttcagaattctgaatcctttgttgaacttaatgaatgccgcagaccttcttc 
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cctccaaaattgcaaacatgagattttacatattcaagtggattgtaaaacccctaaagttcatcctgggtccccaaatt 

ctaaggggcttacagccctacttttatggaaagattcactgtttcctcatctttgtccttaatgctctttgaaaagaaaa 

tttattttttccacaagtgtgtaaatacacttgactaaacagtacttgatgacttttactgttctctattttctctcatt 

taattgatttcatggcacattaatgggtcatcacccacatttgaaaagtcttggctgggcacagtggctcacgcctgtaa 

tcccagcactttaggaggccaagacgggcggatcacaaggccaagatatcaagaccatcctggctaacatagtgaaaccc 

catctctactaaaaatacaaaaaattagccagacgtgatggcgcacccctgtagtcccagctacttgggaggctgaggca 

ggagaatcacttgaaccggggaggcggaggttgcagtgagccgaaattgtgccactgcactctagcctggagacagagtg 

agactccgtctcaaaaaaaaaaaagaaaagaaagaaagaaaagtctttgaccttagcggacatggtgggagctctaagtg 

tctctcttgggtttcattcccagcaaaccacagtgctaagcagtgccggacaccatgtgccttgaggacagcatgtggag 

attgcaccagcggcagctctgagtgcatgtggtgcagcaacatgaagcagtgtgtggactccaatgcctacgtggcctcc 

ttcccttttggccagtgtatggaatggtatacgatgagcacctgcccccgtaagtgaaaaagggagccctaggcacttat 

gcatgccctctgtataggcaacaactcagccatgaggctgtgctgtcagcctctgaacattttagaaacaagactggaca 

tgacctctgctcaaacctgaccagagactgccatcgagaccttgctgcctattgagaaccttcatacagaatcaggcaca 

ttgacagtaaataaatgtaagatagatcacagagtacagaaataacttgtccaacttcagtgtcatattgctcaatccat 

gtaatatctccacatctgaatttcctaatttgtaaagtaaatgctttccaatagataatctctaaggtcccttttgcctt 

caacatcctgggattgagagaggagggaagggtcatctctgttatgtattgggcaaaatactgggctctttacattcatt 

atctcttttaaataatcaagacagaataatatttttgactcaagccagttgaatagtctgttaaaaaaaaagtaaataca 

gtgaattcagatctacctgtgatagtcaattgcaactttttttccttaatagctgaaaattgttcaggctactgtacctg 

tagtcattgcttggagcaaccaggctgtggctggtgtactgatcccagcaatactggcaaagggaaatgcatagagggtt 

cctataaaggaccagtgaagatgccttcgcaagcccctacaggaaatttctatccacagcccctgctcaattccagcatg 

tgtctagaggacagcagatacaactggtctttcacccactgtccaggtaagatgccttgcatatccaaattcaagtgttt 

cactactgatttatgaagaataaaaccttgaaagctacgt tgtgtatatgtaactccctgccctcagcccctt tccttcc 

tcctaatggttggtacaagaaggaatagaccagaagctggtccaaggcctgacctggacctgctgagagtggtggtgggt 

tcctaagaaaccaattctaagaaattggcctt tgattcagacttgaagtgaccactcagcaatgtgtctgtgggtttcta 

gaacagttgggagaggctgggctggtgcaaagactcctcagagattagcagtcaagaacttctctaagagcctgccattg 

acaacagggctgtttgtgaggactttgtaagggaaagtccactgtaaacaaagctaaaagggcagagacagactgggaga 

aaatacctgcactgcatgtaacaactgatgatcatccagaatatgtaaactcccaccctccagggaaaaaataagaagct 

aaatgtgaacccaacagaaaaagtggttattggagataaagaagccattctcagaaaaggaaatagaacaggaaaatgta 

aactaacaccaggaaccaattttttgtctactaaactggatcaaattttccgtgttcccttttttccataccaagatacg 

ggggacctgcaattctatttttaatctgctaggagttaactttttaacgaaatatttaaatctctgctttttcatgaata 

tcacgaatatatctggtaaaatgacaacccagagaatgggagaaaatatttgcaaagtatatatctaataagaatccaat 

gtccagaacacgtaactcttaaaactcaacaatagaaagacaacccaattaagaaatggataaaggatttaaatagacat 

atgtccagagaagaaatacaaatggccaataagcacatgaaaagatactcaatatcattcatcattaccaagaaatgcaa 

gtcaaagccacaatgagataccactttacacccactgagatggctgtaatcaataaaacaggtaataacaagtattggaa 

ggatatgtagaaattggaactctcatgcaggttggcaggtcctcaaaaagttaaatatagagttatcatacggcccagca 

gttttactcctaggtacatacccaagaaaattgaaaacatatgttacccaaaaacttgtatataaatgcttatgcttata 

gcatcattcttcatagcatcatgctcaaagcaacattattcataataagcaaaagtgaaaacaagccaaatacctgttag 

ctggcaaatggatgaacaaagtgttgtatattcatacagtg'taatgttatttggcaataaaaaggaatgaagtactgaca 

tattttctgacatggattaccctaaaaaaacatgctatgtaaaagaagccaggtgcaaaagattatgcgttgcatgatgc 

catttatatgaaatgtccaaagacagaaagtagatgtttagtggtttcctagggctgggcatggtaatgaagaataatag 

gcatgaggttttctagagtagctgcagcattattttctgacatggatcaccctaaaaaaaatgctatgtgaaagaagcca 

ggtacaaaagattatgcattgcatgatgccatttatatgaaatgtccaaaggcagaaagtagatgtttagtggtttccta 

gggctggggatggaaacaaggaataataggcatgaggttttctagagtagctgcagtattattttctgacatgaattacc 

ctaaaaaaacatgctatgtaaaagaagccaggtacaaaagattatgcattgcatgatgccatttatatgaaatgtccaaa 

ggcagaaagtagatgtttagtggtttcctagggctggggatggaaacaaggaataataggcatgagattttgtagagtag 

ctgcaccattttatgctctcaccaacatcatatgaaggttcatatctttccacatccttgccaatacttgtcactatctt 

tttaacgaaaaccgtcctagtggatgagaaatggcatctcactgttgttttcatttgtattttcctgataactaatgaaa 

ttgaacatcaacttcattagttagccttttgtatatcttctttggagaaatatttagtcaaattctttgcccatttttca 

gttatgttgtcttttttattattgagttgtaagagttctttatgaattctgaagtcccttattggatatattatttgcaa 

atactttctcccattatatgggtttcttttcactttcttaatatgccttttgaaatcaaaagttttcagttttgattaag 

ttccatgtatcagtgttttattttatcccatgtgctttttggtattgtatctagaaaatcagtgcccaagtaacccagga 

tcacatagatttgctcctaagttttctttgaaaagtttcatagatttgtgtgttacattaggcccttgatccaccctgag 

ttaatctttgtgtatggtgtgaggtaggggtccaaactctctttttgcctgtggatgcggtcctgcaccatttgttgaaa 
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agatttttttttttttttaaccattgaattttcatggcacatttgttgcagatcagttgactgtgaatctaggaactttt 

tcctaagctctcacttttgtgcagttgaccaacgtttctccttttgccaatattacactgccctgattactgtggctttg 

tacaagctctaaaattgggaaggctaagtcccccacctttgttctcattttgcaagactgttacagctactctgggtttc 

ttgcatgttcatattaattttaggatgagcttgccattttctgcaacaacaaaaagccaactggtttgataaggtttgca 

ttgaatctactgaccaatttggggtgtattgcctgtttgtttgtttgtttgtttgtttgtttgtttgttgagagagggtc 

tcactctgtcacccaggctggagcgcagtggtgctatcttggctcactgcagcctccgctttcccaggctcaagtgattc 

tccagcctcagcctctcgagtaactggtactacaggcatgagctaccaacacccagctaatttttatatttttttagaga 

cagggCttcgccatgttgcccaggctggtcttgaactccCgagctcaaagcaatcctcctggcctttgcctccctagctc 

tgggattacaggtgtgagccaccacgcctgggccccttgccattttaacaagggttaatcttccagcctgtgaacctgga 

atgtcaatttattaccctctttactttcttttggcattgttttatagttttcagtgtacatcattgtcttgttccttttt 

ttaggggaaaataattcagtctttattaaatatgatatttgtttagttttttttatacatgccctttattgggttgagaa 

agtttccttctattctagttttttgagggttttgtaatgaatgagtgctagattttgtcaaacactttttctgtgtctac 

ttagatgatcacgaagttttgttctttattaatactatgtattacattaattaagaatgttaaaccagttttgcattcct 

gggataaagcccatttagtcatgatgcacacctttttatcatgatgtatatttttaatctgctgttggattctatttgct 

agttggggattttgtgtgtacattcatgagggggatataagtttgtggttttcttcttttgtgattactatctggttttc 

atagcagggtagtactggcctcctagagtaagttggaacgtattcctcctcctctatttactggaagagttcgtcaagga 

ttagcattaattcttctttagatgttgattgaattcaccagtgaaaccaaatgggcctggcctttttctttacaggaaaa 

tttttaattattaattcaatctgtttgttatagatctattcagattttctttttctgagtgagtcagttttggtaatctg 

tttttctaggaatttgtctattctcagtaatcaaatttgttaacatacagttcgtagtattttcttttgttttttttttt 

ttttctttgagacagagcctcactctttcacccagtctggagtgcagtgacgtgatctctgcttactgcaacctccgcct 

ccagggttcaagtgattctcctgcctcagccccccaagtagctgggactacaggcgcacgccaccactcctagctaactt . 

tttattttgttttgttttgctttttttagtagagacagggttttaccatgtagaccaggatggtctcgatctcctgacct 

cgtgatctgcctgccttggcctcccaaagtgctgggattacaggcgtgagccaccacgcccggccagttcacagtatttt 

cttgcaggttgctctttagcaatatgtagtctcctgtctaacctgttaacgatttttttttctgtctaatttggttttat 

tattctcctacattttgcaagggctctaaatattgtgagggtttttttttttaagagcttgctctcttatattgtggatg 

caataatttaagccttattgaatgtactaaaattcgtatttttcattctctcatatctcttgcattaacccttccttcag 

gatttgttgctgtgcgtgtttatcctggtggtattcggagattgagagttttcttatactggttaatcctcaaggttaat 

ttgtattaataactaaatgtgtagatctgtcaatattggtcagtggttgggtgtgctgagctgttgtgaaagtgtggtat 

gatgtcctggaaaagctagcaagccagcctcaggacactccagctctggtccacttcgactgttgttgactggtgtgcgc 

atctccctcattggaagaaggaaggagcaatggacctgggtggctgggtgcaattagcatgttcatgggtgtggcaagtg 

cctcccagccttgggtggtagggccacctaaagatagctggcacattgccttgagttttctttcattcttgcttgcttgc 

ttgcttttatttttgagatggagtttcactcttgttgcccaggccggagtgcaatggtgctttcttggctcactgcaacc 

tctgcctcccgggttcaagcggttctcctgcctcagcctcccaagtagctgggattacaggcacctgccaccacctctgg 

ctaattttttgtatttttagtagagacagggtttcactgtgttggccaggctgatctcgaactcctgacctcaggtgatc 

cacctaccttggcctcccaaagtgatgggattacaggcatgagccaccacacccagctgagttttctttcaaacactcaa 

atactaacaggtgcataaacagaacaagtacaaggctatggaaggttgccaagatggtgaaaaactgcatcacactgatc 

aaccaaggcagatctgaagaatcagacctcgcttacagtgcaggttgtgcactatgtaaatgtttcccccatcctattcc 

cccctagagttctgcagtgcacagcctgtgcagctgtcctgagcagtcctcagaggtccaagcttcatatacctttctag 

ttgtaaagccctatatccattaggagtcttagtaatcccacttgtggaaatgtttatccgtaaagatggttatttttcta 

ataatgaaaaatgtaaagcaaactaaatattagaatataagggaatggttaaataactgttaagattcatcaaaatcatt 

ctaccatttgaagttttgtttataattgatgaaataaggtggtaaaatagttgtgatctaagttaaga^aggegaatatt 

aatatgaacataattgtgcaaattatgfctcttctggtatagaaaaataccagaagaaaatatatcaaatgtgaacagtgg 

tctctgagtggttggattgtatatgacctccccccatccttctattttttaagtatactgtcatgagcatacgttatttt 

ttagtgaataacaacaaaaaatattttttaatatcactatgggctttcaccttgctgtgatacattttcaattttcctgg 

actagctcttctttgatattttttatcttatttctcaaattagtttttagtgggccagttttggtgatttatcaaagata 

aattaactacagagatagttgcagataaatcttattgaacttaactacatgcagttcttccatggtacagaccccttgaa 

actctttttcttgcagcttgccaatgcaacggccacagtaaatgcatcaatcagagcatctgtgagaagtgtgagaacct 

gaccacaggcaagcactgcgagacctgcatatctggcttctacggtgatcccaccaatggagggaaatgtcagcgtaagt 

caaattggtcaggtttactcatggcaaatcggtgtggaacacagcacttctatttgacttgaatacctaggaggaaaaaa 

gccactttgttttggataccacatttcttattaaatcatactggtcagaagctcagctgagctggaagaaagaaccaagg 

tttcagtgtagctggttaaagcaacaagccaaaaatgttagagtcaagcctttgaccacattctgcagtggtataagcaa 

gggcccaaaaccagagaaagtgccaacttcccctgtttcctttcctcccattgaacagggcacaggtcaatagtcagagg 

gatcagctcagacatgggagtcttggggctcatctcactgaggcccggacaaaaggttccagcagtatatggacactaag 



FIG. 7.4 



WO 02/083077 



12/157 



PCT/US02/12063 



gttgtgggaagggcaaggtgtaagggctaggactgggaaagtcctgggttggagtcagagttaggaaagtgtcttctgaa 
ttccccctcctccccccatcaggaggccttggtgcctgaagaagacctcagcccaaggcctcagatagggagcctgggaa 
taaaggctccagggctaggaatgtgaacatgtgaaggagcgtgacccgtgaggtagacctgagtccttaactgcacctct 
ctcagagccccacacttaggcccacctcacaggtcatgctctgccaggtaaagcctgtggcattgcctgtggtcaggccc 
tgaaaaatcacacacgggctttttaaccaggagccagactcctcaatacctatagtgtattttaaaggtttaaatagtcc 
cgtgttggctgggtgtagtggctcacgcctgtaatcccagcactttgggagggccaaggcaggtggatcacttgaggtta 
aggagtttgagaccagcctgaccaacatggtgaaaccccatctctactaaaaatagaaaaattagctgggcatggtggcg 
g999 cct g taa t c caagccactcaggaggctgaggcatgagaattgcttgaacctgggaggcggaggttgcagtgagcta 
agatcacgccactgcactccagcctgggcaacagagtgagactcagtctcaaaaaaaaaaaaaaagaaaaaaaaaaaagt 
cctgtgtttccccatttatgtcaaatagaagcctgcaagcaataggacattttattatgagaacaaaaattatgacaagt 
gatattaatgaactgctttttttattctagtttccagccaaaaaacataatgagtactatagtcgaaagacttttcaaag 
ttctgagcaggaaaagtaaacaacatagggaaatctctacctactccccagatcccaactccaaactctttggcatggcc 
cttcagaatctgatctgatccaagaccagccaggagagaggcagcgttctagctgtgtgtcccttactctccctggggtt 
cagtttcctcctaataatagggtggttgttaagatgaaagcagagcagcacatgacacacgttccacaggatactagtca 
atacttagtagatgtgggcaattcttattacccttctaatgccattctccactactcctaagtaaaagttcttgctcaga 
ttaaaaggtatttggtgaaaatgttctccccactctttgtgaatagacttaatccgttagacagtaaccagagtacctaa 
cagagtggtggagtgtacaaggcattgtgccgttacccactcaagacacaggtgcttttattatccccaagatcaagtaa 
actgcccagggtctcacagtgagatgtggccaagctaagaaccacgcccagtctgtgctgaccctaattacttggctggt 
tgtgcttccagatgccatcatacccacatcagcagctgctatctagaagcatcacattttcctctgtgagatctacaggg 
cctgagtacaatttgccttatttttcagagtcctaatccacggtagaaagcgtggtgtttatagaatactgcagatggca 
ccaagtctttgatgttttcttcttaaaattgtatagtccttaggtaacgatagtagccatgatttcttgaacgcttactg 
tgtttgagacatcatatacatatcatcacaaacacccctatgagataggtactgttgttatccccatcatatggacaagg 
aagctgaaactcagaaaggttaagtagcattcccagagtgacagaagagccaagtagaggacgctgaattcaaacccagg 
cagatggacgtaaggcctttgcttttcactatacattacacaccttcccccagtctcaacagagccaagtgtcagacact 
catgattctgacttcagcatccattctgttaatctctgcactattctgttttatagttttctttgggtgattccaaagga 
tttatttttaaagcatacctctctccagctgaatgcctttcatttattcaccagcaaaacagggtataaagtgaaaaggt 
gttgaccaaaaaggctttcacttttttcaactggagtataatatttatcacggcttgtattacgttggatgataaaagga 
gagatgttgagttggccagatgagggagacgggaagagctattttctgtaaaggcaaaggaaaaggtgaagcaagggtgg 
catgtcctgagggcccctaaagcgctgaccttggcattggatcctggttctgagccctgcggtgattccaaatgtgacca 
tcgtggagaaggcacctcagccaagaccagccctccttcagaggacagtcacctccagagcacctccctaggggcagcag 
ctatgtctgtgtccccacacccagcacctggcattctagagatgcttcacaaatatttattgaatgaatcaaagaatcat 
agcagtgaaaaagagagtctattgaaaagatcaaaatagtcattgcttcagaaggcagtgggtaccagcattcactttcc 
tcctgtcctttgtttggttcctttaatgttacctatacattattatcattagctgaaaaatcatgaatcttacccattga 
atgctcgtactttaatctgtctttcctactgaactcaattcagataaaattgcttgttttggaaaaagtctccaatagtc 
agaattttttaagccagttgtgacctctgttcctttttcttctctgcagcatgcaagtgcaatgggcacgcgtctctgtg 
caacaccaacacgggcaagtgcttctgcaccaccaagggcgtcaagggggacgagtgccagctgtgagtaccatactccc 
tggaccaccagggaggaccaagaggctgtgcagctgcctgaaccccaccctgagagccacccacttccctgtgtcttgtt 
gctgtgggctctgagggatccctgggttgattagtttgaaattttgcccattctatttcagacaggtcagtccccaaaat 
gaggaggtcgtcgagttagcagcaattccttaatggctcttgaattcacatttttgtttaaatgatactgacatttcctg 
ggttgtccatttggagtagtcattttaacttcagcaactacttgatttttgtcatgtcaagagattatactctttcccaa 
agagtagagatggaagagctggtgttttggtgatggcatttatttggcgtttggttgtctgattgtggaaatgatcctct 
tacctttaacatttcccattactctagcattttccttgttgaagcatgtcaggcatcttcctggagagggtttgaagtcc 
ccatccatgaggatgcaggtgcagcagcatcaggcatgtttaaggtttcagacctggccctgcctccactgagcaaggtg 
ctcttgaggcaaatcactgttccccccatctgtcagtagtgggactagcagtaattgtttttggtgccctggtaggttga 
aacaggaaggatagttcctcaatcaatgtataccgggctgtctcactgggaaacctcataaaatgcatgtatccgcattt 
actcattctcaagaaaaaacttaaaaagtgtttagtgtcaaaacactgagctgattaatcattgtaaacatcattatttc 
ttaaataacagtagtaataatgccaccagtgattgagacatcattgctggtgctactcagcagggtttggtctgcctact 
ttttagtttatatctgcatatatgtttactagcttcatgttgccctaaagccaatatgtgaagaaaagtctaatccaagg 
ttaaacttaacttttaggtccacattcccttaaacagaaaaacgtcatcgagcaaaattaatcactcttaccctcagtta 
gctgatgcaacaaagacatgaggtggcatcaagtaagactttcaagttccttcgagatcgcagatgtaggcaggtgctgc 
tcatcaccctggctaaagggacacagcatacctgcctgggaaaggccattacccacttcccgccctttcctcttcatgca 
tagctgtcgtgtttacctaaatgcttcatgaagtgggaggctgggttttgctgatgtttatagatgatctgtatggggaa 
attgttttctgagtagaacatattctttttctacagactggtgaacacttgttccagggtttaaaaaaacagtgattctt 
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ggtatttagtcttctctcacttgtgtttaaagaagagaattagtttatccagggaacataggaaagaagtgagaaattag 

tatttgaaaaatgttttgactgcacttttagaagaatatgtagtccacacaaaattgaaaatgtttttcattatagtaaa 

taggtttattacatttggatttctctcattgctttggcaaatgcttgtcaattgtcctcatatagttggtgtgtgaactg 

ctgagcagtcagtattgaagcgttcatgcatgctcttcctagactgcctgagctgagttatggtgaaggatgcaattata 

atggctccagactatctgtactttatataaaaggatctgtttggttttaaaatgaattcagtttctgttttaaatagcag 

tataaaatagtcttttttcccccccagatgtgaggtagaaaatcgataccaaggaaaccctctcagaggaacatgttatt 

gtaagtggttttgcaattcttatttctagaagcaaagtagttcagtaaaacttcattgtttaaacgggtttgagaatagt 

aagtgctataagactatagcagccaccaatgaagtgttcccagacttgatatgtttacattctgttaagtttactacata 

taggagcactgttttaagctgttttaattgtgtttggggttaacgttaatgtgtccatagcagatagcaggagagagtag 

agaggcatgcatctttgtctatccacatttatgttctcttaaaactttactttatttgccattacctagttggggtcatc 

atatttcgtgtcttaggacgtagatcaaaaacagaattcttacaatatggttgaactttttgtcattctgtctggaactg 

atttgtgttaaccaccttgaggtgaaaggtgagtctgacaaggtgaccatgtttttatgggtaaatgtgttttctcttta 

tgggagaatccacatggtagaccagagtacgggaccagaaaaaaggagttaatgttatggcatatccattgtaattatat 

acctgctgtactgggttgaataacatcccttcaaaattcatgtccacttggaacctgtgaatatgacttcatatatatat 

atataaaacttctaaaagaaaacaagagaaaaacatcttgaccttgggtggggaaaagttgttagataggattctaaaag 

cataattcataaataaaaagtagactaatgaaactatataaaaatttaaaacttgtaggaaaaaaccatccaaaacataa 

aaagccacagactagagaaaatactcgcacaccatcatatatttgataaggagtttatatccagaatacatgaggaactc 

ttacaactgaataagatgacaattccagttaaaaacaggcaaagatttgaaaagacatttcatgaaagaagatattacag 

tggccaataagcacattaaaaacgacttaacatcattcgttatttggaatggagattttaaaaccacagtcagatacagc 

cctaaagtgattagaatccaacgctaatgccatggctttttagaagacagtggtaatctcatgtctgcttctgcattcag 

tctgttgcagtacatctttttggttaaacacatgaaaaaaacctggccttaccaggcatgcagttggaaaagggtatagt 

gatacccttttaatagccttttcagataattatggacgttacttgatattatgctgaaactggacaactggtcatttctt 

taaagcggtttgatgggttaacccaaataaaatgatactgcccacagacttgattttctttgaggaacctgggtacagct 

gagttaaagtgctttcctcgttacttaatttataagaaaagcagcctgtatctctaaagacctgttctattttgtgtgtg 

tttagtttttaaatatgcatttctttctttccatagatactcttcttattgactatcagttcacctttagtctatcccag 

gaagatgatcgctattacacagctatcaattttgtggctactcctgacgaagtaagattttttaaagtcttcctattttg 

ttttgaatttgtatggatctttttcttggtcattacggatggacgtactgccttaacagtgctctccagactggagtaca 

cgagatgatctctagaggtataggaaagaaatgttagactctacgttatctcctttccacataaaaggcaaaagtgatgt 

taataagatttaccaggatcttagacacagactgacattgattccacgcatacttactctgcctgtcagcccatcatggc 

ctcatacagaaaggggactccaccatcagagggcagatagcagagcctgggtttactctctcaagagtgaccagaggctt 

aaagacactgctgattgaaatgccagtgatgcagccccaatcagacagcaagggaggggaccccaatcagacagcaaggg 

aggggaccccaatcagacagcaagggaggggacactctgctcctagagtgagttcttagcctcattaggaggcaaaacag 

caaaggcttagttg-ggtccattaagaagttagccaggaataatctaaattgttaaaatatgtatgtaaaatgtggatttt 

tttatctgctgtcattaacgatgaagcccaacctgccttaaggtattacctagtggtagaaggaaggccacactgcggga 

catttaaaactgaaacatacagaacacgaagatgcacctgtacagtttcttcaatgaaatataaagtcatgcagtaccca 

cttcagtatttaaagaaattttggtaaacataatggtaaattatttaggaacttccttggagatttcttacttctcatga 

acatacacaaagccatttaccattacaaaattccattaacaataaatgtgacaaataatacatggaaaacaatatggcag 

taagacaaagtatattgctttgttaacaaatgttatctataaacattgctaaatttaattttaaaagtagaacaaagtct 

atagcgtggtatgtttactatacaagtaaatgaaaataggatttgtttttaatattctatttcaaagataaaattaagaa 

aaaatgtactagaaaagatacattctcaaaagtaaattgctttttaagtaaaaataataaattactttaaatgaattatt 

ttatggaaaaactataggtattaatatatcatttgagtggctgtattgactggaatttttttcttaacgattttagaata 

agattttaacaaaagtaccatatatgaaatgtattcactgcctcataagcaagcgtttgaagcgctcaaaatctttcagg 

atatacttcatgccattaatgtcattaaaaaaataaatatagtagaatctttgtaatacttcttaaccaggtgggaagca 

accatgaaagtatttggacctttctggatttcccagtttatcttacgacagaatacttactagagttatccaaatgcata 

tgttctgtcctctataaaagcacaagcattttaagtttattgattctttctgtggaaagacatatagttgacccttgagc 

aatacaggttaaaactgtgaggatccacttacacgtagacttttttcaaccaagcgtggaatgaaaatacagtatttgca 

agatgtgaaacctgagtatacagagggtggacttttcatatgcaagggttctatgggcagactgcggactggagtatgtg 

tggatttgggcatgctcagggggtcctggaaccgatgacccacgtataccaaaggatgaccgtaattgttagttgtgtgc 

tgccagcaaatactaaatacaaataagtaaacacttgagctgtcctttcaagatgaaggtgaggtcttatcagtaaatga 

gaaggtaaatgctttgtgagagaaacttctggtttaatgatcacattttaaaaatagctgtttggaaatgtttccattgt 

tgtgattttgtgctattaaaatgatcaaaacaacacccttaaaaatcttattctaacctctcaagatcttttaaaaatga 

ataatttcagtacagtcggatgcatctgtaaaagataaaaatataacattgattagtttgcaaaaataattgtttgaccc 

cagttaagagatgtactagtcaaatttcagtttgactaattattaatgttataatttacctaacatcaccaacagtacac 
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ttcctccactggcttaacagattcctcagcaatatctttattagtcattaagaccaaggatcaaaataaattaagttaga 
ttagccctgtgaactgctatatctctaagtggtagacaaggttttcaaaactaagaagccatactcaatcatatttctct 
tgaaataatatttttagtaagagaaaaatatttttcaaatcatgaatataataaaatcattttttaaattaagtacattt 
cagctctatatgccttttttaaaggctgtttccagttttggagaagttttacactgtatataatctatgagtttagaatt 
atatgggtttcagttataaataaatataattttgggcacttgatcaagtgttttattggtaggagtgaacttcagacatt 
tgaaaacagctgcccagagaatccttaggagggtgattccccagcacaggtcaggagatgcaaatcactgtgctctgata 
ggggtctttttaaaaggcactttctcatgcagttaggcatttgataaagcattaaattatgtatactttaatgggaggga 
ataaaattttattggcatatattgcttgtattaaggaaaattggttagaaattttgctaactcttctgtgagtttctcta 
aagacatcataaaatgtttgtgattaagaacatttagaggagtaaactttattgctttattttaaaatctagaaattgtt 
ttaattaattttctaataatttgtaccgctcatcagcaaaacagggatttggacatgctcatcaatgcctccaagaattt 
caacctcaacatcacctgggctgccagtttctcaggtaaagacacacctagagaagaccctgcaaatgaaggtgtggtag 
attaagaaatgtaatataggaattgagaaagcgagctcaggagacagattggtttgaaacccacccttgccacttactag 
ctatgagaccttgagcaagtatctaaatccctctctaaaccttagcattattttattcatctgtaaagtgaggataatga 
tacttacctcttagaattgttgtatagactaaattaggttatacctaccagagcttgccgtggtgtcaggctcagtgtgt 
ggttactaccctagcccaccaccaccatttctgttcttgctgtggccactggcactaccatcattgtctacatccgtgct 
tcggaagtgaaaaattcaaatgattcgtttcaataaatgaaaacattttaaataaaatgagattttagtaggtacagaga 
aatgtaacttgggaattacatcaagctctaaaagcacagctcttgctgtctgccttaccgtgattcactgaagatctact 
gtatagaaaatctaaagaaataaaggatgaaggccaagtgctgtggctcatgcctttaatctcaacactttgggagactg 
aggcaggaggattgcttgagctcaggagtttaagaccagcctgggcaacatagtgagaccctgtctacaaaaaaaaaaaa 
aaaaaaacccaggtgtagtgactcatacctgtaatgccaactactcaggaggctgagatgagaggatcacttgagcccag 
gggttggagattgcactgagccatgatcgcaccattgtactccagtgtgggcaaacagagcaagatcccatctctttaaa 
aaaaaaaaaaaaaaaagaaaaaaggataatcactacttaacttgataactcaacaagtagatatgggtttgaaatttgtc 
cattaaatttacttgcaccgtgctgttaggcaagttacttaaggtttctgagccagtttcctcctgtataaagtaggata 
gtaaaaacaccgtcctggcagggcgtgatagctcatgcctgtaatcccagcactgtgggaagccaaggtgggaagatcac 
ctgaggtcaagagttttgagaccagcctggccaacatggtgaaaccctgtctctactaaaaatacaaaaatcagccaggc 
gtggaggcacgtgcccgtaatcccagctactcaggaggctgaggtaggagaatcgcttgaaccttgaaggtagaggttgc 
tgtgagccgagatcacgccactgcactccagcttgggtgacagagtgaaactccatctcaaaaaaaaaaaataaaaaaac 
accttcccaagtagagtgatgtgagaat'taaatgagataataaatgaagtactcaatatagtgcttgaaatgtggtaaat 
ggtaactatattttatcattattactattacaatactgggtttttaaaaatcaaaaacacaaagcaatgagattgatgca 
aaataagaatattgccttgtgcacgccacttacgtttatcatcttaaaacattgtgtagaatttgagaaaagttcagaaa 
ctcccaatgaggagggacttttaagaaaaagtctgaattatcagagtatctggagaaaggcaacatctccaggcatgtga 
aagacttgcaatgagccgggcggtggctcatgcctgtaatcctagcactttgggaggctggggcaggtggattacctgag 
gtcaggagttcaagaccagcctgaccaacatggtgaaatcccgtccctactaaaaatacaaaactagctgggtgtggtgg 
ggcgtgcctgtaatcccagctacttgggaggctgaggcaggagaattgcttgaacctgggaggcggaggttgcagtgacc 
caagattgcatcattgcactccagcctgggcaacaaaagcgaaattccatctcaaaaaaaaaaaaaaaaaaaaaaagatt 
tgccatgagtgtctcaatgaagacgtgataatgtgggctctagtcacagggtctaactcagacatggaaaaaagtccatt 
tcattaatctttatcggcacttgaattcctggctaagggagaatgtggaacattgaaggactctctgggaataggatgga 
gttataccagattagggggacttaaatactgtggtagctggtggtagaagggaggactgagtgaccccttgaacccctcc 
tccctgctacagtgggttaggcagtgagcggtacatcagcattactggcatgggagtctggcgcattgccaaggaggtgt 
aaaggggaaatgcaaaggaattgaagtggtgtgggcaaagtgaatgccagtgcttgttaataggattctagtggtatctg 
tattttcatgatcatgtgtgtcacctgtttgggggtggggcaagggtggaagggagttacatggattcctggtaaaacca 
tttttctttctttctttttttttttttgagacggaatttcgctcttattgcctaggctggagtgcaatggcgcaatcttg 
gctcactgcaacctctgcctcccaggtccaagcgattctcctgcctcagcctgctgagtagctgggattacaggcatgcg 
ccaccacgcctggctaattttgtatttttagtagagacggggtttctccatgtttgtcaggctggtctcaaactccgacc 
tcaggtgatccacccgcctcggtctcccaaagcgctgggattacagatgtgagccaccgcacccggccagtaaaaccatt 
ttggttaggggcataggcttgtatatcagcctgcccagctttaaatcttatctccatttcttgttggctgtgtgacttga 
gggaagttacttaatttctgtgaacctcaatttcccagtctataaatgaagataataatagggcagctgtgaggattaaa 
tgagattgagctcttaaaggctattactggaacacaggaaatgtttgataaatgctattgtccttattattaatgaggcc 
agattctgttctccccctagccccccaaaaaatgtctcttctctttcatgtttttcttttaacagctggaacccaggctg 
gagaagagatgcctgttgtttcaaaaaccaacattaaggagtacaaagatagtttctctaatgagaagtttgattttcgc 
aaccacccaaatatcactttctttgtttatgtcagtaatttcacctggcccatcaaaattcaggtaagaagaggcttttg 
gtctcatacctgcaaaggtggtgaaatctctttagtaagactaaatttactaatttggagcttgtggtaaatgagatgtg 
caatgtggctttgcctttgtaacgtgtatggcagaggagtgctgagcacatgcatgctgcacagaagattggagtgggga 
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tggactgtatcactcatgaaagacatttgcaaaagcactgttgaaagcaagttggcatgtaacagatttggtcattataa 

acgtattcacttcttcagtgagcatttgccatgtgaaagcctgtagggctacacaaagaacttcaattctagagtaaggt 

ggatgtaagtgaaacaaacccacatataacaactgaaagccagagtgtggaaagtgacatgagatgatcccagaaatgct 

atcaaagtttaaaggaggacaaatggggagactatgttgaagaacatcagccttctagtcagacagaggtggattgattc 

ctggctcotatacaaatcatacagcctttccaagtctccagttctctgtgcatgtgacctgaagggtagttgtaaggggc 

tgtgcagctgctgcagtgtcttgttagcctgctcctttcctctgttcccagggggccagtgtactccctcttgtccgaga 

cccatggccccattttaactttttatactcatgtcccctggggcctttcctcaataccttctgcttcttaccttcttcat 

ttaggtgaatgtggaggttagggataggtgggctttcaaggactggttcacctttaaccatggaagcatggtcactggac 

ggaggctgttgctgtttgccaatgttcagaagcataatcaacctcagaagcaagtcaccacaaacatatgaaaaaagttc 

aacatcactgatcattagagaaatgcaaatcagaaccacagtgagacaccatctcacaccagtcagaatgtctgttatta 

aaaagtcaaaaaagaacagatgctggcaaggctgtggagaaacaggaacgcttttacactgttggtgggagtgtaaatta 

gttcaaccattgtggaagacagtctggcaattcctcaaagacctagaggcagaaataccatttgacccagccatcccatt 

actgggtgtatatccaaaagaatataaatcattctgtaacaaagatacatgcacatgtatgttcattgcagcactattca 

cagtagcaaagacatagaatcaacctaaatgcccatcagtgatagactggataaagaaaatgtggtacatatacaccatg 

gaatactatgcagccataaaaaggaatgagatcatgtcctttgcagggacatggatggagctggaggccattatcctcag 

caaactaatgcaggaacggaaaaccaggtaccacatgttctcactcgtaagtgggagccgagaacacatggacacatggt 

ggggaacaacacacactggggcctattggagagtgatggggaggaaggagagcatcaggaagaatagctagtggatgctg 

ggcttaatacctaggtgatgagatgatgtgtgtagcaaaccactgtggcacacgtttacctatgtaacaaacctatacat 

cctgcacatgtacccctgaacttaaaaattaacaataacaaaaaaagcaagttatcactcatcaattaggatgccttggg 

actgtgactaaagaacagttgatctttccattctggaaatatggaggacaaaaactatgttgtagtttttctcaccaccc 

tctctcctttttcctgtctccatggtctaagatttgttaatcctctcatcaggttctccctttgccct.cgcatactccct 

gtgtccctcctcagccagtcttgtaagcatcagccacgcttcttattcctgttttctcttgtctagtcacacatctgctc 

acaatggtctggcctcttccctcaccacatcctgaaactgtcctgtcatagtgggccccagtgatggaatttttcagtcc 

cctttttatctcattgcatggctttgaatcatcatctttcttgtcttcctggatttttcttttccccgttttctggccag 

tctttttagggaatcctcttcctcttccttaaacactgggcttgtctaggattcagacctggtccttttctctcttcact 

ctgtttcctgcatgtgggacttgtggtgccacatctattctggtgattctcagtgctgtctccagaccggcactggcagt 

tgctgcctggacagccctacctgggtagtcaaagccccccgcgttcggtgcatattccttcttgcgatcttagccccaga 

ct tcca tgct t cc t t tc tctgtaaacagcaccaccatcaccctgttgtacaagctagggcct gat ga teat ttgeat tec 

tgggataaacccaacttgagcatgacgtattatttccttactttacacgttgttggattcagtttgctaatatttaggtt 

agaattttgtatctatttcatgagtaagaatgtctgataattttcccttcctgtccttgttatggttttgatatcatgtt 

aaactaacttttaataattataatttgaggagttttcactctttctcatctctggagatgtaagattagagttaactgaa 

cctcaagtacttggtagctctctcctgtaaaatcatttgatcctagtgtcttatttgtggagatactttttagctgctga 

ttcagcttctttaatagttataggatattttgaatttcctctttatttgattcactttttaatattttttctaggttttt 

atatcatgtttttaaatgtatttttaattttactttttaaattgatacataattgtacgtatttatgaggtacatgagat 

attttgatacatgcatacaatgtataatgatcaaataagagtaattaggatatccatcacctcaaacatttatcatttct 

ttgtgttgagtacatttcacatcttctagctattttgaaataatgaataagttattgctaactatagtcataatgactat 

tgtgccattgaacactagaacttattccttgtaactctatttttatacccattaacctctatctccccagccccccagca 

gctggtaaccaccattctgcactctacctccatgagatcagtttttttagctcccacatctgagtgagaaaacatatcta 

tctttctgtgcctggcttgtttcacttaacataatgacctccagttccattcatgttgctgcaaaggacaggattccatt 

ctttttgtgactgaataatatttcattgtataatatatacaacattttatttgtccattcatttgttgatggacacgtag 

gttgattccatgtcttcactcttgtgaatagtgctgtgataaacatttagagcatgcagtatctctccagtatactgatt 

ttctttcttttggatatacacccagcagcgggattgctggatcatgtggtggatctattatgagcagtcttcatactgtc 

ttccatagtggctgtactaaataatttacattcccaccagcagtgaactagtatcgtctttctctgtatcctcgccagca 

tctgttattttgtctttttaataatagccattttaactgggatgagatgatttctcattatggttttgatttgcatttgc 

ctgatggttactgatgttgagatttttttcatatgcctgttggccatttgtatgtcttctttggacaaatctctattcag 

atcatttgcccatttttaaatcaagtttttttcctattgagttgtttgaattgctggtatattctggttataaatccctt 

gttggatggatagtttgaacatattttctctcattctataagctgtctcttcactctcttgtttcctttgctttgtagag 

ctttttggcttgatataatcccatttgtcgatttttgcttttgttgcctgtgatttcgacatcttacacaaa&catcttt 

gcccagaccaatgtcctgaaggattttcccaatattttcttctagtagttttatggttttaggtcttatacttaagcctt 

taatccatttaaatttgatttttgtatgtggtgagagagaggggcctagtgttatttttctgcacatggatatccagttt 

tcccagcaccatttattgaagaacctgtcctttcccccactgaatattcttgactccattattgaaaaccagttggccgt 

gaatatgtggatttatttctgagttctttattttgttccactggtctgtgtatctgtttttatgctggtatcatgctgtt 

gtgggtactatagctttgtagcatattttgaagtaaggtgatatgatgcctccagttttgttctttttgctcagaaatgc 
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tttggttggctgagtacagcagcccgtgcctataatcccagcactttgggaggccgaggctggtggatcacctgaggtca 
ggagttcgagaccagcctggccaacatggcaaaaccctgtctctaataaaaatacaaaaattagccatgtgcagtggtgg 
gtgcctgtaatcccagctacttgggaggctggggcaggagaatctcttgaacccgggaggaggaggctgcagtgagccaa 
gattacgccactacattctaccctgggcaaacaaagcgagactctgtctcaaaaaaaaaaagaaaaaaaaaaaaaagaaa 
tgctttggctatttgaggtcttctgtggttccatacaaattttaggattgttttttctatttctgtgaagaatgtcattg 
gtgtagagattacattggatctgtaggtagcttttggtagtatggtttttttcacaatattaattatttcagtccacaaa 
tgtggtgtctctttcaatttttttgcgacctcttcaatttcttacatcagcgttttatagctttccttgtaaagagggct 
ttcacctccttggttagatttattcctacggttgtttttggaagtttttaaaaaatctatcgagaatgggattgctttct 
cgatttcctcttttgctagttcgttgctcctatatagaaatgattctgatttttgtgtgttgattttgtatcctgcaact 
ttactgagtttgtcagttcttagtttggtggagcttttagggttttctgtatataagattatatcaactgcaaataggga 
cactttgacttcctcctttcagtttggatgccctttatttctttcttttgcctagttgctctggtcaaatatgttgataa 
cttgttgatgctgtcctctaatgctcgcagcgtctgtgttcatgaaaccctttgttgtattccaatgttgatagcattat 
tcacagtaatcaaaatgtggaaacaacccaaatatctatcagtggatgaatggataaacaaaatgtggtatgtatgtgtg 
tgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgcataataaaatgttattcagcattaaaaaggaatgaaattctga 
tacatgcgacaacacaagtgaaccttgaaaacatgctaagtgaaataagctagttgcaagaggacaaatattgaatgatt 
ccacttaacatgaaatatctagagtagtacacagattcttagagacagaaagtggattggaggttaccagcagctagggg 
gagcgggaaatagggaattactgcttcatggttattaaagagtttccctttggggtgatgaaaaagctttggaagtaggt 
agtgatattggttgcacaaaattgtgaatgtaactattgccactgatttgtacacttaaaaatggttaaagtgattttat 
gttatatcttactacaataaaaaaagtcttcaaaaatctcagcaagatgtttttgtagatatagacaagcttattctgaa 
atttatatgaaagggcaaaagttctagaatagctataacaatttagaaaaagaagaataaaatgggaagaatcaacctac 
ccaatgttaaggcttactatttaggtacagtaatcaagacattgtttcaggtagaggggtagacacacagatcaatggga 
taggctgatctataggtagatcccagaaatagaaccacataaatatgtccaactgattaggatttgattttgtttgtggt 
gtttttttgagacagggtctcactatgatgtccaggctggtcttgactcctgggctcaagtaaccctctcacctcagcct 
cctgaataactgggattacaggtgcacaccgccacgctagttccagctgaattttttttttttttttttttttttgagac 
agggtctcactttgtcgcccaggccaaagtgtagtggtgcagtcacagcacactgcagcctcaacctccctggctcaagt 
gatcctcctgcctcagcctcccaagc Cgctgggactacaggtgcataccaccatgcctggctaattctt ttttttttttt 
tttttttttttttttggtagagatgagatctccctgtgttgcctagtccggtctcaaactctcaaactcctgggctcaag 
tgatccCcccgccttggtctcccaaagtgctaagattacagacatgagccgttgcgcccagcccccagctgatttttgac 
aaaggccctgcaaaggcaattcaatgaagaaaggatagccttttaacgaatagtgctagagcagttggacaaccataggc 
aactaaaggaatctctaaatctcacatcttatataagaattaacgcaatattgatcacagatctaaatgtaaaacataaa 
actataaaacttttagaaaaaaatataggagaaaatctttgaatcccaaaacgaaacaaaagctttttagactcgatact 
aaaaacgtgattcataaaaggaaaacttgataaactggaccacataaaaccaaaagtttttgctctgtgagaaaccctgt 
tatgaagatgaaaaggcaagctatggactgggagaaaatatttgcaaaccatatttctgggaaagcacttgaatctagaa 
tataataactctcaaaactcaacagtaaacaaacgaacaattcagttagaaaatgggcaaaagacatgaatagacattta 
accaaagaggatatacatatggcaaataatgacataaaagatgatcggcatcaataaccattagagaggccaggtgtggt 
tcacacctgaaatcccagcactttgggaggccaaggcaggcagatcaccacttgaggtcaggagttcaagaccagcctag 
ccaacatggcaaaaccccgtctctactaaaaatacaaaaattagccaggcatggtggcacatgcctatagtcccagctac 
tcaggatactgaggcaggagaatcgcttgaacctgggaggcagaggttacagtgagctgagattgtgctactgcactcca 
gcctgggcaacggaatgagactgcctctcaaaataaaaataaaataaccattagagaaatgcaaattgaaagcacaatga 
gataacactacacacttatcaaaatgactaaaattaaaaatagtaataacaccaaatgtcggagagagtggggaaagatt 
ggatcactcatacattgttggtgggaatataagatggtacagtcactctggaaaatactttgtttcttaaaaaactaatg 
ttatacttatcatataacccagcaattgcattcctaggcatttatcctagagaaatgaaaaccagtgttctcacaaatct 
ctgtacacccgtgtttatagcacttgaaagcaatatttaacagctttatttataatagcccaaaactggaaacaagccaa 
atattcctcaacaaatgaatgcttaaataaatgatgctacatacacaccatggcctaatacccagcaacaaaaagaaatg 
aactgttgatatacagaaacacaacaactttgataaatcccaagcaaattttgctgattgaaaaagaaaatcttagaaat 
ttatatactgcatgattgcatttatataacatttgaaaatgaaaattttagatcttatatacctatgcacatattgaata 
agtatcccgaagcttcaaatttctctgacttcttaggactctgatctctagaccctgttttaaaggagcttatctgatta 
ggtcaggcctacccacacaggagtagggaagatacagagcggtgcccatcaagaaggtggaaatcttgggggcattctta 
gaattctccctgctgtaggctgcatttggaatagagcttgcaggggtttcggtagtttctccgatccgaaaaaggttgag 
ctctaggtcccagtcagtataatttcagtttggcaacatctcctgacactattaagatatagattatatatatttgttag 
gttattttcatgtaaacacttgattagggttctagtaaataaaatttatttatttactcattttaagttgagattgtttt 
tgtactgacatacctctccatattctcaatctaattgaacatcgtgctttgttgcttatagccgacaaatgatcaatact 
ttttagattcttagtgaaaatcttgacttatgctgctgacacttttcccagtctgtgcctataagcaggcttttgtgctg 
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gagatggccatacttgttgccaatgccgagtttttttcctcccagttccctattgctccaccctctgaaagtcactgaag 

tggttttagagctttcattctagagggagctcttccccacacttacttctaagaagtaataattttaagtttcataatca 

aggagatgttgattttcttcctggttttctggtgctccttatcttttctggatcactaacttaaattaagatccacttat 

gaatgctgcattcaggagcacttgtgaaaataaacgatggcatgcccaccatttagtttgccaagctttccttgaaaacg 

caggtaagttctattgcaaacgtgtctttttgctgtttggtacaaatgactgcactattcactgtgactaggcccgtgtc 

accgtgtgttctcagtcctcgcatgagagcctggttgacactggttctctctgctttggggaccgtatctggctgtgcgc 

tatgtggagggctgagctcgtgtcttcctgggtaaatacatcatgttctcaacaagggctgcatgtttctttattgctgt 

tttttccccagagacttagcttttctaggtgtgtagcatggctcttaaatacacacatttagcaactgtgcaaattcagt 

aaaaatggagagacttatttaaggaagacagtagggaaatgtctaaatagtgcagtgtaggccattcagttaaatttcag 

tatggctttcgatgcaggcaaagccgttttgccttcgattcccattttttaaattagattttaaaaattgagattctcct 

tttttaaatgagattttaaaaattgagattctccttttttaaatgagattttaaaaattgagattctccttttttaaatg 

agattttaaaaattgagattctccttttttaaatgagattttaaaaattgagattctccttttttaaatgagattttaaa 

aattgagattctcatttttttaaagcaaagtaactgaggaaatctaattctctacttggaaatttgtgtaagcattgggg 

ccttaaaaatggtaatactaaacccaagttccttcactctgaaccaatcccaggattcttttccataattaggaacagat 

caaaacctataactgctgagggaatttgaaactgactataaaaatctttgtagatgcagctaaaaaacaaacaaaaccca 

cagactgaaaagtcctgtttctgggagggtggcgggtgtgttggggccggttgtgcctgttttttgattggttgttcctg 

tgggtgagcacccttccctctgcccctgctttctgaaagctatcctagttgttctccccagtaagccggggctcagccgg 

cttcagctatgtccatagtttgggggcgtgggcggtaggtattcaagatagaagatggttggacagaaagtgtgtgaggt 

gggaaggtggagtggtgaggctctggctcgttcccgactctgcaacagaggttgttactgtaatgtggagctaatataga 

agtctgcgtgctgggtgtttcatctactgctttatgtaagaggccagtgttcacaagtaaaggcaacacagaattggtat 

tgagaactgggcattcatttagattttagagttctgtttatgggtgctaacaggatcgctatactgagcaagggatgtgt ' 

gaaacttttgttcacttggatgtgctttttttcttataacagctttattgagacatcatttacataccatacagttcgct 

catttaaagtacacaattcagtggttttcagtgtattcacagaagcgtgcagccacccgtcactacagccaattttagaa 

tattttcaccacttccagaagaagcccctagcagtcagtcctcattcccccctcttccggcaacccccagtctgtatttc 

ctgtctctgtggatttgcctattctggacattttatataaattgagtcacatgatatgctctcacttagcatactgtttt 

caaggttcacccgtgtttagtatgtttcagtacttattccttgtcatagctgaataatcttccttatatgtatataccac 

gttttgtttatccatttttctatcgatggatgcttgggtggttttcactctttggctattatgaatagtgtcagatatac 

ttttaatgtttatatttattgtataataacagtgcatatccgtacctcctggtgttaattgtctgtatcttaagagtttt 

aagaattatatggttgtgctgtaattctagcacttagggaggccgaggcgggcagatcgctttagctcaggagtttgaga 

ccaggctgggcaacatagtaaaaccctgtctctactaaagtacaaaaaattagctgggcgtggtgaagtgttcctgtagt 

cccagctacttgcaaggctgaggcacaagaatcacttgaacccaggaggcagatgttgcagcgagccaagattgcaccac 

tgtactgcagcctgagcgacagagcaagactcctgtctccaaaaaaaaaaaaaatttttctttatgatgaaacccacaag 

gctgtacaagaaccttaaatatagtttgaaacaaagaaagacacatgcttttaagtacagtttaattcttttcttttctc 

ttatttcattttttttttttttggagacagagtctcactctgtcgcccaggctggagtacagdggtgcaatcttggctcg 

ctgcaacatctgcctcctgggttcaagtgattctcgtgcctcagccttgcgagtagctgggactacaggaacacttcacc 

acgcccagctaattttttgtattttagtagagacagggttttaccatgttgcccccatcattctcagcaaactatcgcaa 

ggacaaaaaactgaacaccgcttgttctcattcataggtgcaaactgaacaatgaggacgcatggacacaggaaggggaa 

catcacacaccagggcctgttgtggggtggggggaaggggaagggatagcattaggggatatacctaatgttaaatgacg 

agttaatgggtgcagcacaccaacatggcatatgtatacatatgtaacaaacctgcatgttgtgcacatgtaccctaaaa 

cttaaagtataattaaaaaaaaaaaaagaaaaatgtgttattcctttgtctcgaaagtcccctgcttttcagcttttgct 

ttggttgtctcttgtcagctctttaatgctattgttcccttgggtttcatctggactctttttccctgggcaatctcatc 

cacaccatctgggaacattgcctttggctggacaggtaaatctttgtctctagcagacttctctcctgLgatctccagag 

ccaggaacccactagacttccttaaaattcattcagccagcagactttctggcccagagcagcagctgtgagtggacagc 

aggcacctcgttagaagcttggtgttggaggtgtgcccatgcattcctgggagccatgagaggggaggggagtgccctgc 

gtcacctgagtccaagttaagagagatcacaccagagctgattcctaaggagcatgggggcgggggaagggggtgttcgt 

ggtggggaaaggaaatgaccagccctttcaggaactaccaggagtttagcagagctggtgacagtgctatgagcaagaag 

gtggagagctggggagggccattatgaagagactggagattgcaggctgtgaacttgagtcctgatgtgactgggaattg 

ttgagatgtcttaggccgaaaagtcaccgacgatcagtgtttagaaagaccccactggcagcagttgggagtgtggattg 

gaaggagccaggcctaagtcctgggaggacttaggaggtccaggagtccagggagtccaggtggaattcttgtgtagaaa 

cccagggaaggggaccatctgatgacacacatcccatagcaacttcctttcctcctctgcacctccacagcagagtgtct 

gtttccactatacgaagatttcttaatggcaaggatcgcatttttatccctcagtgtcctcagtgtcatgtgctaatcag 

ttggagacactccctaaatgtatgccagatgagtatatagttgttagtataagggctttacggtaggtatcagaactttt 

tcttttttacaacttttataacttttataaaggagccgacacctttttctgatattgtcagaacaattgctgaccagtgc 
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tatgggctggctgaggttggcaaccaagaatctgttatctcagtctacacagcgtggagttagaagattggattgttgtt 

ggttagaattttgtccagagagttagaagatagaagaaaagggagggctgaggacartaaacatccaaattttacaaagaa 

gaaagttcagtgctcctgtctatattgaaatagttaagagggaaatgattgggtctggggaatgcttcaaaataatatga 

tcatagggaaagtaagtgtagtgtggatgaaacaaggttggccttgagctgataatttactaaagcaggatgatgagcgt 

gtgaggttcattatagtgctctttctactttttaaaaagcttaagattttccaaaatgaaaaagaagctgagcatggtag 

ctcacgtctgtaatcccagcactttgagagcctgagacggaaggatcacttgaagccaggagttcgagaccagcctgggc 

aatatagtgagacccccatccctactaaaaaagaaaaaaaattacacaatgaaaagaataatacaggtggtagaagaaga 

aataaaggtatgagtatatgatttggcctcctacagacagctacaggagcatgcctctgctgggcgggcctggagcatag 

tgcaccctggagctgaagttctgttcgcctttacaggaagtacacagcattgtacgggggttttaccagttaacagtgga 

cagtctctagaacactcaggggtcaggggcaaaactccctctccctcaaaccagcccacctgtcttggtgagaatggaaa 

acggagcttcccagcaagcctcctgccttcaccccgacgcacaggtgcatcagggtctgagggctcctccctgtggggcc 

tgggaagccgcccttcagtgggctatggctgcattgtcttctgatcttgtagctgtgtgtgggtaacaggcacatcgtct 

ccccatatcccagagagtcatgtccacttcctatctgtggtggcatctttccatcactgtcaccctcaggataatggcac 

aggaactaacagaggcactaagagcagtgatatatactggagggaggggcggggctgatgaccaaaatcaccttctgttt 

cagtaagaggttggttaaagtgatgcctaaaatggatagatcaggaaataacaataaaaacattaatgataaaaaggcaa 

ctgccagaagaactgaaaagtaacctagttcaaggtgggggtccctggagagaagccttcagaagagcagaggaagagca 

gggaacattgcttttcatgactagcctttcagtgccttttgatatttcagctaagtatgtagttctctaatgatttctaa 

attgttaatgagtgtgatagaataggaattctggagaatggtcaaaaggaattgctaggagaaaattagggaaagaaagc 

acagagcagtattgaggtgggaggttgggagcagacagatggtgagtgttcgtttgcatggcaggaatgagtggctgaat 

ggcattaactggccatggggctctgaccagagcttgggtatacagtttctgtgccagcacaaaccagcattaggtgcgtg 

gtgtgtgaggccgggtgatggtcatatggatctggtccagagctgtggtcttgacatagagttccatgtatgaactttgg 

tggtggtggtggttttgtttcgttttttaagaaaaaaagaagagttgttttatgtcacagccaaactcaaggaggtgaca 

gcacaagacccaggttagaatgtacttagccagggctcacggaatcttccatcctttgcaaagccaagaagagtagaggc 

ctcaaggagaatagaggagcacatgggctccttctgggctgagctctcacccttggtggggccgctggtcccagggggtc 

gttgggagcctgcggagattctgcccagtacatacataccttgcgttttctcggagcctgacaggtgctgccttgaagcg 

gaaagacttcacctcacagggtggccagctgtcctttctcttcctctccctgtcctgtgtgctcatgataatggatagat 

agacatctccatggtagaaataatgcagccctttctgcagtccatggttccacagcatatggtcagtagacttagatgag 

tgctacataaagtgtaattagttctgcaccacggtgtcttaccttctgaaaaggaaactctggtattgacttagaaatgc 

cagttttatccttccgaccagaacgcaagtcctgaatgaggccagaggccaatgaaaggtcctttcttqtctcttttcaa 

atctgctgtattaaacagtgacttttcctagtcattcagtattcattcattcatccatttattcaagtttttgagtacct 

tctttgtggatggcacatagtagatgcaaaaatacaatcatttttaaaatggaaagcatttgttatgtggtagtggaata 

taagaaatgccatgcttccatatttttatggtaatcttggactcaggttctacccaccatacttaattagaaattcattc 

tcagcctctgttttggcagagacccattctcttccctcaggaacatacagaaacctcagggctgaactgagctcagaggt 

tggggaaggtcccctggtctgatcttcagtcgctgactgctgcgcccgattccctggcccctcactcttgagggcagcat 

cctcactcttgtgtggcacaggcttggacatagaatgacacttaagcaccagagcctccctagcctgccacaaggacacc 

ctacattcccaggcactttcagaatgcaaaacctcccctccattccctccctttccctgcctccaggtttgggccctgcc 

tctccttttgtttcaaagcactgtcctctcttcctctacccttatccttccccatctccctcccactccccaaaagagcc 

aaactaattatggggatttaaccttccaattaaaggcgaggatttttttttttcaatgcagttcctgaaatgctaaagaa 

attgtccagggaaaggagcacaggaggagttttgttttgttttgttttgttttgttttgttttgttttgttttgttttga 

gacagtctcgctctgtcacccaggctggagtgcagtggtgcgatcttggcttaccgcaacccctccaccccctgggtttg 

agcaattcttgtgcctcagccccccaagtagctgggatcacaggtgcacaccaccacgccagctaatttttgtattttta 

gtagaaatggggttttgccatgttggccaggctggtttcgacctcctggcctcaagtgatccacccacctcagcctccca 

aagtactgggattaggcgtgagccactgcatccagcctgatgtttaaataatttcttacttcagagtttcctgaagtgtg 

gcctatggagactagcatcagaatcactggaggagcttaaagatatagagacagggatagggcctgagattctgactttg 

ttgtaagctatccagagtttgagagccatggcctagttcttggaatacaaaagtgtatctgctcttgtatgggaggtagg 

aagttaccttt teat teat tcttctacaacagtaatttaaaagttaggaattggt at cttcatagacagaagaatgcagg 

aatcctcttctcaggatgtctgtctttacacagccatggtatatggtagaccaagtttgtccaacccatggcccgcgggc 

tacatgcagcccaggacggctttgaatgcagcccaacacaattcataaactttcttaaaacatgatttttttattttatt 

tttttttaagctcatcagcactatcgttagtgttagtgttagtgtattttatgtgtggccaaagacaattcttccagtgt 

ggcctagggaagccaaaagattggacacccctatatagaccataattatcacacgccctgagagggaagcagatcaatgg 

tgagggagcgtagaggagagcctgccagagtttccagtggggccggggatgcttaaaaacggaggcagtgatgtgctgag 

ccagaaaggacagggaggacccccatgcacacaggcagggaggctctctggtgacatctgccgtcaggtacagtagtaca 

gtagtagagtgagcggaagcacttttggctgaagtgcaacattagtgattgaggaagtgcaggtccgtaatatcttaccc 
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caaaccctggggctcactgagcttccgagttcagagtttttcagaattgatgaacacatccagaaggcccatggggcaga 

tacaccttagacatattgatgaaagtggagttggaattgggttgtggaagtcgtaggacgtgaattctaggtttcttgga 

gtgtggagtatggtgtctactgtttggaaacagagaagaatgacattatgagttttatttataaaataagaatttcttct 

ttgtattaaaatagatttttatataggaaattttcattaccttcagtatcagaaaggcccccaaatctatttcaccgttt 

ccataataaattgtgaaagaaagatttgattgggatattgtctttaaaaatgaaaccaaaaagtatgggaaaatacttta 

attttagagttaacaacatttatagttacaatcatgaagcaggtatggtgctgcttcatgagccaagttcataaaagaaa 

aaaaaatcagatcttactgagttctgtgagacagatcattatttttcttattaaca tat teat ttattcagcaaatgttt 

attaagtgccaattaatgtatcagtggctgctttaggcactggagatatatggtcagtgggataaggggaagtctctgtc 

ttcaatcctcacagatcttattttccataagtaagtgtacaaatacgggctatgaattaaatagagcaagataaagagat 

gggggtgtggcttcagttgtgcacatggtgcccagaaatctgatagctccatcatggagtccttcacttgagacttcaca 

gcaggatgtaaaaagcaacagccaggaaactccttagttttcccaaagtcctcccttaatctcgtttagagaattatgaa 

acacttcctatggcttcatcctttatcccttcttcagttgataactgaagggctaaatctgccaagtcttcctttgccaa 

tggttttgtgcgagcagttctccaacagcgcttccactaaaatcttgtcttttacaagatggattttacagttggttggc 

attgtatgacagtctgtgagtagggacagcccaatgtatgtttgttaatcaagtttatgccttttacattgagtaatttt 

taattacctcgtgtaatttttaacagcactctccttcatcttcagaagtgttcagtttgaacaataagtttgtcatcttg 

tcagtcaaaggttgaccaacaaagactgaccgcagtgggcggggtattcagcagggttagatgtttctaagtggtcagtg 

ctttcataaacattttcacgttatgacctttgcctcctcacacaatcttgtaactatggagacttagatgataaagattc 

aactaagaggatccctttaaagtcttgggtaccagccataccaaaagctgttttatatgaggttctttatgattataatt 

tggacatgaacatggagtggtcctgcttcagcaagaaatcaggctaataaccattagcatccagtgtgggtaaaggcggg 

ggctgccctgaggagtgttgtgtggcacagggcgcatgaacaccagataagtcctcagacagtgcccttagcatccagtg 

gtttttggcatctagcagggagtaaaagtatctcatagattctttaagtgtaaacaccgaggacacacttaacgttttaa 

aagattgaaatccagacagggctggggtggactctaatctgtttccttctctctctttatgtgttataacttggattctt 

cattccccaaaatgaccttgtttatataattctccagtcttatctaaaacttttatgtgaggaccatggctcatattccg 

cagctttccttctctcaaggagctaggactggttattgtgataaatgaaggaaaagaaaagaaagtgggtttctgtttat 

tgtatgaaattgatgtaactttgtatttctgattataaatggtagccagagggaggacacaaaaacttcaaattccaagc 

cccataatatattatggcctgaatagtgttggaatttatttatttatggaatgtcaaaaagaaaatagggccatcaaaaa 

agcaaacagtggaagtctattttctggcccctgtgagtctggctctaaaaacagtgcaaggggttggggtttggtgggac 

taggaaagcagatgctcgtcatttaccttcctgtgtttgtcatgagaattgtctgtactgggattgtcaccttggtcagt 

aattgtttcgtttatagtaccattgtctctccaaagagggctgagttccctaaaaaaaagcaggcagcgcaccacaacat 

ctggctagtagcacgtatattatcggtgatagtaaggacacaattctcgtatttaatcaatttggaggtatgcatttttt 

cgtgggctaatatccctgaaatcatgatgcatcttatggtcagtgacttaaaaagcattatgtcatagttaaattgacag 

tgtttttcctaaattatacacagaataatgcctttttcagccaatgttattttagagttgatgaaatatgttgcattttt 

atattcttggtatgaagactattaactgtataataaaattatttaagttctgatgacgtagtgtcagatgatgttcagtg 

tagcaaaacgcgaggcacaattttttgtaaaaaggacaatatcttgcagtcagtacacactgagtacataaatgttgacc 

aaatgaagagcacagaatggggctgcccacctggaggcagagccatgaattggtactgaggcaaacaaggagagctttgt 

gtgtgggtggaggcgcagggagggagcgccaggcttcatggaggcatgcgagggtctccagggtcaaacagacagaacct 

cagaactggtgtgaagcttagaaggggctgcaggaggctgtgctaaagcctcagtgtattgctatgggactggcaactaa 

acaagtaaactaaaggggtgaggtacttttccttaagtgaaaaaagtttccttaacttctttaaagtgtaagctctgttt 

tcagcttaactcttgaaatgagaagcgtgaatccgtggtggtttcatgacctgcataatcacactacctgtttttctgtc 

acagattgccttctctcagcacagcaattttatggacctggtacagttcttcgtgactttcttcaggtaattttgcttat 

acagactctctctgttcttttgatttaggcactagatccattaatagagtatcagctcctaaaagataaaaagaataatc 

ctgtgtttccaccacagattaaatttacttcatatgtgttcctgaaaagttagatgtcagttacagtaatataaactcaa 

tctagcatgacaaagtgaaaacgtttggtttgtatgttctggcttccagcggggggctttgcatgtatcctgtattaatt 

ttgcatgcatcctatagataatttggctcttcatatggggagaatttggactaaaacaacatttaattaataatggttac 

tgatgtgagctacacatatgcttatgcttgttcaagagatgattctggtgattatccaaagttttaaagctgaaagtgat 

tttttagtgccctgggattttcttttatttccatagtttttttcgtagtctaattttatctttatgaaaatactatagac 

aaataattgaaaaagacaaaaaacgtgctgcagtt Cttaaaacagaaagcagcaacagcctgcct tgcgccttccccacc 

ccacgtcccccaatttaaattgttttgttttttacttctctgtgtttacacttggtttttcagttaagttctcttttgat 

gtcctactgtagaagatgaggatatactgtgttacatggcctcccctagcatctcccctcacaccacaaaagcatacaaa 

agccccatctctttcatttaataccattgtatcataatgcatggctaaatcagtgatccctatctatgttatggccataa 

aactcttattcacagctgcattgaacagtgattacactaccacattttttgggtagaaggatcattctttgcttatccta 

aatgtagtatttgccatgttttggctttttaattgtttgatttttctatgtgcctacccctgatttatcactatactctt 

gctgatgtacctttagaacagtaaaaaataaaataaaataaagtatcattgtattgtgacagaactgagaaatctcacaa 
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tggtcacacacaggaaggtgctctcttggttcgaatttctctcggagttgcccaccctagaaccttcccacctcctgccc 

ccaagtgcacttgttcctgaggcctgctgcacaactgtcatttggaaacttcctttgccttcctcctatattgaagtctc 

tgtttcccagatcccatgtcttcaccttcctatccattccctctttttactggagcacattctctagatttatgagaaaa 

gatgcctgggagataacttttctgtgagctctcctgactgaaaatatatttattttatcctacaatttgattgaatttct 

gggggcaggtgctatggctcacacctgtatttctagcactttgggagaccaaagtgggtgggtcacttgagctcaggagt 

tcgagaccagcctggacaacatggcaaaaccttgtctctacaaaaaatacaaaaattagctgggtgaggtggcgcaagcc 

tgtagtcccagctaatcaggaggctgaggcaggagaatggcttgagcctaggaggcagaggttgcagtgagcggagattg 

caccactgcactccatcctgggtgacagagtgagaccctgtctcaaaaaaaaaaaaaaaaaaagtctgactacagaattc 

taggttcaaaatcattttccattagaatttagaagacgttatactattttttctagtgtccagtataatattgagaaggc 

tagtgctgttataattcccactctttgtatgggatccgttttctctcttaaaaagaaaagtatatttaagtgtaatataa 

agttacagaaacatttgcaaatcctatgtgcgtagctcagtgaattatcacaaattgaacacttctagagtattgataat 

attatctcttgttcacccagaacacttcaggcttcttattccttctctgtcagtttcggcaagttgtatatagttttctt 

ctaatgatcccattccatctgttccattctcttgttcactgagaacaagagttaatattaccaatactctagaagttcct 

cttgtgtttccccCcactcacaacctcctgcctctcccccaaaggtaccatttacctgacttctcaaaccatacattgca 

tttgctatttttgaaatgcatatgaattatttctttttttttttttttttttgaaacttagtctctctctgtcacccagg 

ccagagtgcagtgttgtgatcttggctcactgcaagcaaccgccgcctcccgggttcaagcaattctcctgcctcagcct 

cccaagcagctgggattacaggcacgngccaccacgcatggctiaatttttgtatttttagcagagacggggtttcaccag 

gttggccaggctggtctcgaactcctgacctcaagtgatccacccgcctctgcctcccaaagtgctgggattacaagcat 

gaaccaccgcgctcagtcaaaatgtatataaatgaaatcacaacataaattttaagatgatttggttgatgaagaaaata 

ctgaaaacttgtgggttgcagttaaagttgtgcttcgaggttatagctgtaaatctgtggctctccaccttagctgcacc 

tagaatcacttgcggagcttttgatgtccaagccactcttcagaccatttgactcagaacctctaggggtgcaccccaaa 

catttgaaaactctccaggtgattccaatgtgcagctaaggttgagaatcactgctttaaaagcacatattagaaaagaa 

aaaacattcctctcaataactcagaactgctaattaaacctaaagaagtagaagaaagaaaaatataaataagagcaaaa 

attaatgatctaagggaagaagtacagtagagaggatcttaaaaactgtcagttggatctttgaaaagagtaagaataat 

attgacaaacccctggcaagattaaccaaggaaaaaagagagaaggtatatggtatatgtagccaatgtcagaaataaaa 

aagggggcattcctccaaatcccgcagacgtttttttcctccagaagggtatttgatgaacaactttatgtgaatgaatt 

tgaaaatgaaacagatggaatgggaccattaaaagaaaaatatgcaacttgccaaaactgacacagaagaagtaaggagc 

ctgaatagctctgtatccatctttaaaacaacactaatttaagttttgtcatagtgaaaactctaggccccaggtggctt 

cattggtaacataacctttttaggccgggtgcagtggctcacgcctgtaatcccagcactttgggaggccgaggtgggtg 

gatcacgaggtcaggagatcgagaccatcctggctaacacggtgaaaccccatctctactaaaaatacaaaaaattagcc 

aggcgcggtggtgggcgcctataatcccagctactcgggaggctgagacaggagaatggcgtgaacctgggaggcagagc 

ttgctgtgagccgagatcgcgccactgcactccaacctgggcaacagtgcgagactccatctcaaaaaacaaaaaggaga 

atactgtgatcatcttaatcaatggaggagataaagcatttgataaaatttaatacccattcatttttaaaaaaatttct 

tagaaaactagaaagagaaacttctggaacactgataatggaacaacataaaaagtcaacagcaaaccactttcttaatg 

gtagaatgttagaagtttgagtttgggaacaagacagagatccactttccacagtttcaattactcagggtcaaccgtgg 

tccaaaaacattaagtggaaaattccagaaataaacaatttataagtcttaaattgtactccattctgagtagtgtgatg 

aaatgtcgtgccatcctgcttcgtcccacccgggataggaatcatcccttggtccagtgtatccacgcccctatgctacc 

cacccactagtgtcttggttatcagatctaaaaatcatagtatatgtagggttcagtactatccatggtttcacgcatcc 

actgggaggctgggaatgtaaccccacagattaagcaggacactatattgccaaatccaaaggatactttttggttctcc 

tcttcttgacttctcggtgatatcaaccaactgactgcttcatgagcgagactcttccagccgtctctgacccatgctgc 

cctactgctgcttcttagcctgcctattatgcaatcttttctattcagcctctcactgttgattttttcccagaccttga 

ccctggccgtcttctcactctgtgttctctgactggactctgccttccccatctgcagcacgacatgcccagaagtgacc 

aattccatccctcctcaccggtgttccccacctcactaactggcaacatggctctgctggttccaagccacttctcttgc 

tttcactcagcaggcactcccttaccaggccccagagctgtggccgtcagtgcactctccacctccactccccaccccac 

caaccctagcatcacagtctcgcctggaccttttcacttgactttggcccctccagaatccacaaaaagtccagagaaga 

cctttaaagatcacgcatctagccccttaccttcctgttgtatctaaaataaaattcagcgtcctgggtgaactggcgtc 

cgcttttccatattcagcttgcactcttctcttccacatttctgctaccaggaccatctcctatgtcttcaaaagcacaa 

gctctttccctacactgagcatttgtacatgcttatcccattttctggaacactcttctcactacccttcattctgtata 

ctttcataccttgcttgatttttccagacaacaacaaattctccaattctctgacaccaactcagtgtctcacagttcag 

ttccattcagacactacccatagttggagcaaatcctgcaggttaggggctcagtcccacaagactgctcccacttcaga 

cactaagcacaaatgggtccccaggctacctcactgctactgggccaactacaaattcagggcttctggtgactgctcac 

tccaagatttgacaatttgttagaacagctcacagaactcaggagagcactatacgtagaattacagttttattatacag 

ggtacaactcaggaaccgccaaatggaagacatgaataggacaaggcatgggggtgggtgagcacagctctttcatgccg 
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tctctggacaagcctcctccctgcacgtggatatattcactatcctggaagctcccccaagcctcatcattctgagtttt 

tttattgaggtttcattacataggcatgatggatgaaatcactttctattggtgactgaactcatctccagcccctttcc 

ccttcttggaggttgggggatggagctgaaagttctaaccctctaaccacctagttgggttttctggtgaccagcccccg 

catctgaggctctttaggcccctctccccatgagtcattcattagcacacaaaagacactcctgtccctctgaaaattcc 

aaggggtaagctaaattgtctccccttgttcctcactcctgtagaactgtattatttttcatcgtggtattcatcacaat 

ttgaagaatatattcatatgtttacttgcttaatatctgggtcggtagcccaagcttcaggggagcagtgcactgtctct 

cgttcactcctatacacacacagcttcagcatctagtacataatctagtaagatatttgtgggaagtaaacttagactct 

tcagggtaagtgcagagtgaggaggaaccataggcaccaacctctcagggaaaaaagcttgtgtcaatggcttgctttta 

agggatgtaggggctgtattgtagctccagtagacatccttaaattcttggagcatgtgtatggaataagatggccctgc 

tgccaagtacgtacagcctggtgaggaccggccacttctactggacattcagtccctgggcgaccgaatgacactatcag 

aataagtcgaggaagtacttttcaggcccatggcctcttgtgtagtaaacaaggacagatggcattttcatcacttct.ee 

cttgtgggggttgaacttctggaaaagaatccggaactttggtttttctagacgatgtttactaactagcgatgataaac 

ttttagctaagaatagagttactttcatggcatggggaagaaaatattttcccaggacaagttaaccatgtctaaaagaa 

tttatttagctttttttttttttttttttttttttttgggatagggtctctgttgcccaggctggaacacagtggtgcag 

tcatagcccactgcagcctcaaactcctgagctggtctcaagcagtcctcctcacctcagtctcccaagtagagctggga 

ctgcaggcatgcaccatcatgcccagctaatattttcttttttgtagagttgggttctcactatattgcccaggctggtc 

ttaaacttctggcctcaagcagtcctcccatctcagcctcccaaggtgctgggattaccagcgtgagccactgctcccag 

cctggtccaaaagaatttaaaatggagcctataaggaaaaggggaaaggtatccagataatcctaagcactctgaagaat 

ggctcatataatatatggaaaaaagaacaaaagtcggggggaaattcgatgattcaaatgagaaaatcaatgtaaagtga 

gttagaaaaaatatttccagactccgggcacagaaaagtaaactctaggtcctgacacagtctgttgagactgtgtccta 

atgagatgcttttccagcctcaggaaggagcctccttaaagagggaaataagagactctcctctgcttgcttttctgtag 

cttcttgtagctggagcagcaagtagaagggactggggaactggtgttgtcaacagatcttcactgaatacagaagggcc 

ctctcaccttccctgaggcatagagggaagcaggcacagactgtattacgggtggacccaagagaggtttctgagtgaca 

ccagggtcctcgcctctcagcagcagcggcaggctcttgaccaacttagaagctgttgccgttggcctcatgctcttcct 

tgtggatttgctgcctccttatgtttatctattagaataaaaataacaaaaatcaggattatgcgtggaagccaagaaag 

actgagaaggttattctcagacctctttactggttcttcctgttctcctcaacttctcattgttagggtggccccaggac 

attctcatctatctgtattcactctcttggtgaatgtggccaggtttattgcttgaaataccatatatatccttatgcct 

tccaaatgccatgcccagcccagacctggcagccagcctccaggttagaatatctgcccaggacatctcccctgggctgt 

ccggtagatatctcagggttctgcatgtctaaaatggatctactgatatgtccagattccaaagtgcttttcacacctct 

gcttccacagctctggtttgagccaccatcatctcacctagactgctggtctccctgataatgccattgcacctctaatg 

gttttctcagcaaacgttaaatcacaacactctgtttaaaaacctgcactggctctttcccaatgccctcataatgcttg 

cagtgccccacatgatctgcccttcaccctcacccttctctctgcctttgtctctggccttacacaccccactccaaaca 

tgctggcctcctgaggctccccaagccatcccaggcacggccttgcctcgggctttgcactgactgctcctttttccaga 

cgcacttttgcaggcgtccatgagccacctcccataccttctccaagtctttgttcaaatgtcaccttctgagaaagaaa 

ggcccacacactctgaccatcctgctgtggagtgccactggtactcctgaaccccttgctcttagcatctttttatttct 

gtagcacttacctaccttttggcatgctctgtaatttgcttatttacttactgtttgtctccacctgctggaatgtaaat 

tccatgggacagtaatctttgctctgtttatcagtgttttccatgagcctagaacaggacctgacccaattaaagtattg 

attaaaaagacaatgtaggccagtcacgtgaggcctgtaatctttgccaaggcaggcagatcacttgaggttaggagttt 

gagaccagactggccaacgtggtgaaacctcgtctccactaaaaatacaaaaattagctgggtgtgatggcgcacacctg 

tagtccaagctactcaggaggctaaaagaggagaatagaatcgcttgaacctgggaggcagaggttgcagtgagctgaga 

tcgtgccactgcactccagcctgggtgacagagcgagactgtgtctcaaaaaaaagacaatgtagactgtcagaagagtt 

ggcagtgtcctgatagcagagaggcccagagatgaaggctttccaccaagtgacgatcctgggggggttggaaggcaatc 

ctattgaattgcctgttgtcagggcacccattggaggagtctgcacccttgtaccccttagggaggaggcagtgagcctt 

gagataagggcagttcagtcatcctagcatcttagagtccctctctccagccagcccaggagggctgtgttggtgtcagc 

agttgggtgcactgaggtgccattcatgaagctgaccccatcctacaaacgcagagctcagccagcaagagagtaccttg 

atgacactggcagagagccttatgtctacatgtagcactctggaaaaataccctgtagctagccccttttacaagttaat 

gagtaaaaatgcagttatagcagttatctatttttttacttgttaagcttattcttccttttaaccgcaagtcagctgtt 

ctctttgccatcactgtagcttttcaaatcattccacacccggtttatacatgagagagcattgaaatcctctgcaacta 

ttaaggccactgggagttattttctcttaatatttttaagaaagttgttggcttgggtttgatcatttcaaatgtgtttt 

attttacttaaaaatatgtatttgtaaaaatattgaatatacactaataattacacacatttcttctgtttgcctcttcc 

ttatgaaaagttgattgttactttaataattaactttttcaccttcttagattttaccagtaaatatctgctaacttgaa 

taaacagcatgactaatatttgttataattatcttagaattatttttcttatttgggataagccactgtgggccattctg 

atgaaggccaagtggaaggtcagggctctctctgaataccacatgggaacgatacagtctagaggcctggttaggaagcc 
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tggcttaccgaagttgtctcactcattacctcatgagccggcacattgttgtgacaaagcatcagaagggagcccggagg 

tggcatgtgggagctgttttcactgcctgggcctgaggtgtgggcaacggtgggagctgattctgggggctgggataata 

actcagtactttcctctctcacagttgtttcctctctttgctcctggtggctgctgtggtttggaagatcaaacaaagtt 

gttgggcctccagacgtagagaggtaagcttcagtgggtaaagattaaagaatccctggaagagctttttttccttcttt 

ttctcttaagcaagtgggttttagctatttagtgataatggacagacagacatctccacggaaggaataatgcagccct t 

tctgcagtccatggttctgcagcatatggtcagtagacttagatgagtgctacataaagtgtaattagttctgcaccaca 

gtgtcttaccttccgaaaaggaaactctggtattgacttggaaatgccagttttacccctcagaccagaacccaagtcct 

gactgaggccagagatccaagggcgtattagcacagccaccgcgtgattaggcaccgccttcatgagaacttgtcatgcg 

agagggaaatcagccatttaagcatctcaaaaatttttcattattcaaggaaagataaatgtgtgtgtagttaagttttt 

aacttgagttttatttttaaataagtcttgatttgtttccagagcttcattccaaaagtaataagcaaatatttacagct 

gacaaatttaaataaataaacttggttgcaaattacaacatatcaaatgcccatcaatcaatgagtggataaagaaaatg 

tggggtgtgtatgtgcgtgtgtatgtatatatatatatttatatacatttgcagcaacctggatggaattggagaccgtt 

attctaagtgaaataattcaggaatggaaaaccacacttcatatattctcacttacaagtgggaactaagccctgaggat 

gcaaaggcataagaatgatacagtggattttgaggactcgggggaaagagtgggagggggatgagggataaaatactata 

catcgggtacagtgtacactgctcgggtgataggtgcaccagaatctcagaaatcaccatgaaagaacttattcatgtaa 

ccaaacaccacctgttcccccaaaaacctattgaaataataaataaataaataaataaataaataaataaatattttaaa 

aataaccaccattaaaataattacaacataaaaactttgtgtattgcaactttcacttgcatcattactttgcctttcat 

ttagtcatgcctgtggcattgttggcattgttgctaatggaaatgctcacgtgtatataagatctagacaaagaagctgt 

ctggggtttttttggatctactcatgcctttttctatataaaaatgtatttatagataagatttagaacagaagggaagt 

atgtaaaatcacaactcacaagcactcagcactgttgataggatattcatgtctgaatagggttaagacaggctccagga 

tatgggcctcttgttgtgggccacagtaccacctttr.ctgacccacataaagacgatgttatcatagagggaatgttgca 

cactctttattattattttttaaataagtggataacatattcaaagggttttttagataactgtattactgatatacttt 

aaaagcctaatgaattacgtgatgttctgaaaaaaattatattttcaatttaatttaaatgtatatttattcacaatgga 

gtaagaaagaagtaagaagccacagtaaaccagttatcaagaaaggagttgaaaacttttgagtgctactctctccaaga 

aggacattagacccagggagtttcaaggttgagatgagcttatggagattatgatagcctgttcctccctgagtgcttat 

gagaaataattgtttctaaaatctgtaaagcaaaaggacagatggaaagcttgaaagagtccgaattgattttatacaca 

taaatcttggtaccaaaacctgacaaaggtagcaccaaaatacaaagggcagctgcatagtgaatgtatgtgcagcagtc 

taattgcagtgtaatgaaagagtcacttaataccagaaaacctattagaatatatcatgtcaaaaaaagaatatatcatg 

tcaatggggaaaaaataacacctgaattcttcccagtggcccagaaatcatttcgtcatttattgaaactcaatatccat 

tcctaatttgaaaacaaaactcacagtaaaatcaaaaatacctttttaacgctatgaagaatatctggataaccagcatc 

actcatgataaaacaaacaaacaccagaggttgtatccaggctagttagttttaacaagagctgacatctgagcatttaa 

aaattccaggcactgagttctaagtggttttacgtagattaactaatccttatctcaaacctatgaaataggtattgtcc 

ctcgttatgttttatggatgaggggactgaggctcagaaaggtgaagtgacttacaccacatagttaactaagaggcaga 

cgaggctttaactagggacttctgtcttcacagcccaagttcgttatcttcgtgtaatcctacagtcattcagctttgtg 

ctagatgttctagactgtgcactaagtgaaagggcactaagaagtgttgttactattggaaaggaaacaaaattattatt 

ttcatataaagttattaaaacaatgtagtagagatttgcctgcataaaaatgaaggcttaagtatatcaaaaaataaatg 

aataacacagaagctaaacaatcaactggaaattgtttgcagtgtgcatgagagaaagtaagatgcacagcctacaggaa 

aaagattaaaatatatatatgtgtttagtatggagttggtcatctaataaactatgcttttacccgtgtgatttggaatt 

tctctgtaacCgtgaagctgaccattcagaatatatttcatcatttttaaaatgtcagccattatttattccctctcaga 

attgtagaagcaatgatttgatgttatttttaaaaacaatagtagtagcggctgacgctgtttagtccttgctacacacc 

agccccggttttaagtgcttttacacatgttgcctcctgaaccctgtcttattgtgaccccagcacttaagcacagaccc 

caaggagcattgctgcatatattagtcgagtaaatggacGtacagctttaaaaaatgtaagaatgttggcatCttaagaa 

aatattcataaaaatgcccttctagaacagagccatatctatccaggccaagattctgtcttctacagtagttccaagga 

ggcagtttgaagtaaggcctacttgtatcccaaggcataaccccaaagcttcagcatcatgcccagaaagatctctaata 

tccttaatgatatgtgaacggctgttcaccatgtgctaggcgctgtgctgtagtatctcatttgatcccagtgacaacct 

tatgggctaggaacacatttcagagggaggacacggatgcccagagactccaaacacctttcttaaggtcacacagctgt 

agaccattggagcaggagttcacatcaagcctccctaaatcccgagcccctgtttgtccctctactctgcaaaatttcac 

atttgtttgtttttagcttataccacctctctgaaggaagggttctttcagttcctatctgccgtataatgaggcatagg 

tttttgttgtcctgtgaatccttggtgccagtttcagtaggaagaaagcaagcctgactttagtagtagaaatacgagga 

gatagggccacaccattttatgtatttgacaagtcgacgtcatagcattaaatccacagtacttttatttttatctttat 

tttcttattttactttaagttctgggatacatgtacagaacatgtagatttgttacataggtatacatgtgccgtggtgg 

ttcgccgcacctatcagcctgttatctaggttttaagccctgcatgcatttggtatttgtcctaatgctctccctcccct 

tgctccccacccctgacaggccccagtgtgtgatgttcccttccctgtgtccatgtgttctcatggttcaactcccactt 
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aagagtgaggacatggagtatttggttttctgttcctgtgttggtttgctgagactgatggcttccagcttcatccgtgt 

ccctgcaaaggatatgaactcatttattttatggccgcatagtattccatggtgtatatgtgccatgttttctttatcca 

gtctatcattgatgggcatttgggttggttctaagtcttcgctactgtaaatagtgctgcaataaacatcgtacgttaat 

atgtatctctgacaggggcttttaaaaaatataaggacaatggtgttatcacatccaacaaaatttagtaatccctaatg 

ttatctaatatcaagtctaaaaagtatcattttggacttgatttgtttgaatcagggaggatgaggtgtttttaacatgt 

agcattaactaataaaaccccttcagtactcattccattaatggctttgccatattatttatcagcaacttcttcgagag 

atgcaacagatggccagccgtccctttgcctctgtaaatgtcgccttggaaacagatgaggagcctcctgatcttattgg 

ggggagtataaaggtgagaatgtgactcagaagtccctataacttgactttttaaaacttaggctcctaagtctgggaaa 

ccagagagagcaaaagccctattccattacctcctttttttcctctctttctttttaacaataagctaaaacctgaagtt 

gctgggtactcttttgcttttgtttttcaatctttgtttcatgtcgctgagcagtcttgtgt.tcaggggattttaaaatc 

taaatgatttgtctgcttttcttttaagtattctgcggcatacaaaacttttatttgaaaagagctacattgaacatcga 

atcattgtttttttgtttgtttgtttgtttgagacagagtcttactgtgtcacacaggctggagcgcagtggcatgatct 

cagctcactgcaatctacctcccgggttcaagcgattctcttgccccagcctcctgagttgctgggattaccgatgtgcg 

caaccacgcccagctgatttttgtatttttagtggagatggggtttcaccatgttggccaggctggtctcgaactcctga 

cttcaagcaatccacccacctcggcctcccaaagtgctgggattacaggtgtgaaccaccgcacccagcctatttttctt 

tatattttggggaattttttaagcagcaaaatgataatacactgtgtattaatactctgtggaaaatatttaccaccccc 

ctccccaaaatgatgaagaaaatggtaagactttctccttgggcaaaatgatggaatatttaagatacgctggagaaata 

gctatgtatcttgaataaaacatacttacttaaaatgttactgagctcacaggaagtaggtaaaatctccagggaatact 

ctccttccctaccctaaaaagaacaaaccataaaccaaaaccagaaccatgatttgtcctaagtggttccagatggtaaa 

cacaacgcccacctgaatagatggaaatcctctctatagaaaaacatcttcaattcagcgctgtcgaagtcccacagatc 

aagatgaagtaattaaagatcaccagggacaagaggcaagccgccataaataagagtagcagaaacaacagatgattgat 

gtggacaggactccaaatgttagaattgaaatctaaagaatataatataactgtgtatgaaatgtttaaagaaataaaag 

atgaaatcataaagatgagagtcaagtatggccaagcagatctgaagaagaaccaaatggaacttctagaaatgaaaaat 

atacttgttgaaattttttaaataaataaactcaatttacatgttaaacagcatatcagacataagagaatttacaaagt 

ggaagatgaacctgaaaaaagtacgcagaatgcagctcagagagacaaggagatagaaagtacaatagatactaagaata 

tggaggatagaatgagaagatccaactcatatatatttccagaatcccagaacaaaggagaggcaatattcagaaatgat 

agcttcccagaactgatggaaaacatgaatatacagattccagaagcaaaacatactgcaagcatgattaaagaaagaaa 

gggaaggcaggaaggaaagaaaacaagtcggatttctcataatgaaattgtataactccaaagacaaagagaacatctta 

aaatcagccagagaaaaagtatcacatacaacgtggagaaactgaacgttagtgcactgttgatggaagtataaagtgat 

ggagccactgtggaaaacagtaccactattccccccagaattaaaagtagaattatatatgatctggcaatctgggttta 

tacatagaagaactgaaagcagagtctcaaagacatatttgaacatcagtgttcttggcagcattattcacagtagccaa 

aaagtggaagcaggcccaagtgtccatggatggataaatggataagcacaacgtggtctatgcatccaatggaatgctat 

ccagccttccaaaggaagcaaattctgacccatgctacaatgtgatgaaccttgaagacatactaagtgaaatactccag 

taacaaaaaaataaacattgtatgattacacttttaagagatacacttgtggtagtcaagctcataaagacaggaagtag 

aacagtggttcacaggggctgggagaaggggaaaatggggagttagtgtttaatgggtacgaagtttgagttttacaaga 

tgaaaagagttctggagatggatagtaatgatggttgcacaacaatgtgaatatacttaataccactgaactgtacattt 

ttaaatggtcaagatggtaaatgttatgtgtattttatcacaatttttgttaaaaatgggaaaagaggccgggtgtggtg 

gctaacacctgtaatccgagcactctgggaggccgaggcgggcggatcacttgaggtcaggagt.tcaagaccagcctggc 

caacatggtgaaaccccatctctactaaaaatacagaaattagccaggcatgatggcacacatctgtaatcccagctact 

tgggaggctgaggcatgagaatcatttgaacctagaaggcagagttcacagtgagctgagattgcaccaccgtaccccag 

cctgggcgacagaacaagactctgtcgcaaaaaaaaaaaaaaaaaaaaaaaaaagggaaaagatcacctgcaaaggaatg 

acaagctgaccgctaacttctcagaagtcagaagagagtgagagaatgtctttgaagtactgaaagaaacctccccacct 

agaatagtgtgcccagtaagcaccttccgagaacaaggatgaaataaagatatcttcaaattagaataaaaattgagaat 

ttgccaccagcagacctgcacaaaagaaatgtctgaaggatgtgctaaaatgaaggaaaattacccccaggaggatagac 

taaagtgtaagaaagaatagtgagtaaataaaagtataatcatatatagaaacctaaacaaacattatcctggtaaaata 

atatttgtgggttaaaaaatagacaagcctaaagtattgaacaacatgatatatgtcagcagcatatgacacagttagtt 

gctccttaaaacattttcttgattttcttccaggacaccacacacttgccttctagctgttcttgctcagtggtctttgc 

tcagtcctcttcatcttttgacagtggtgtgcgtcaggccctgcctttgtccccttcttgtgtcttttacacccacttct 

tcagtgatcttcaccatcacaggtttaagtaccatgatcctctcacctggactgctcccatgagcctccaacacaaacat 

cagttgcctactcagtgtccccacttggatatcttaacacggccatggccaaactgagctttggatgtttatttctcaga 

cttgctcctctgacataccttccgcctttcagcacataacacctccatcaccctagttgctcaggccaaaatcgtggaat 

tgttcttgactcattcttgatgtcatgttctcattggacagccagtctatcagcaaatcttgactccgtattcaaaaatg 

tttcccagccactgtgtcccatctctgccactattcccacttgatctaggccaccgctgtctctcacctggattcttgca 
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gttacttcataactgatctccctgcctctgccattcacccatctagtcagttccttgtgccacagcagagtggtactgga 

aattattttcttcagccaaactgaagaaactgatgacatttgtgggtatagtcagcagaattctaaaggtggccctcaag 

atttccatcccacgtttattcagtcacacactactctaggtactgctagagggattttgcagatgtaatgaaacaaggga 

gttgactttaagatacagaaagtttctctggctggtggcagaaagaggaagccagagagattggaaacatgagaaggact 

tgacacaccatccctggtttagagatggagtggccacagaagtttttttgtttttgtgagacagagtctggctgtgtcac 

ccacactggagtgcagtggcatgatcatggctcactgtagcctcaaactcttgggctcaagggatcctccctcctgggtc 

tcccaaagtgctgggattacaggtgtgagccactgcgtaagccctgaatgcatttcaaagcagagtcttccccagagtct 

ccaactaagagtcccatccagctgacaacttgacttcagcctgagagacctcaggagatagctgataattccccagaact 

catggaagacatgaacagacctgtctctctcggcagagaaccctacagagcttgctgggacttctgacctctaaagctgt 

gagataataagtgggtgttgttttaagcaactaagtttgcagtagtttgttacactaatataagctgggtgcggtaggtc 

acacctttaatcccagcactttgggaggctgaggtgggaggatcgcttgagcccaggagttcaagacctgcctggacaac 

atagtgagaccccatttctattctttaaaaaaaaaaaggaaaggaaagaaaaagagaagaaactaafcataggggacaaat 

ctcatctaaaggctgcaacagaagatgaaaacatattttccagagccaaatgcatccagagagtagacaaagaatctact 

tggtcagccaacccagcttctacaagcttttacatccagcaggcttgtctacttagtgtctaaaggacagacaaaataga 

atgttttttaagtctccatgatatctgcatctgtgtttgagccaggtatgcagaactatctgacaaatctaccaagcact 

caccactgccattcctgtgacctacagttatgaattagaatttcaaatcgagccaaagtacagcatccagaagaaaccca 

cttggcgtggccccttccctacagttcaagcagccctctggaaggggactgcgagacctcagtggaaggtagcaagcagt 

accacccgtgtcactggtgctcagagagtactgttagctctctcctgccctttttcattatcttaaaaactgcaatgtac 

cttttttatatacctgaaattgatggttcttataagcacaaatgaccgttaagtLgtccgtatacattaaagcttgagta 

tcacttaaggtaattttgttttcttctgccagactgttcccaaacccattgcactggagccgtgttttggcaacaaagcc 

gctgtcctctctgtgtttgtgaggctccctcgaggcctgggtggcatccctcctcctgggcagtcaggtgagtagatgcg 

gtccagcgaaagacaccttctaagcatgtgagggagctaagcatgggatacttccctttcctagagaacaaggttataaa 

ggtgataaccaacagtgcctgcccatgccatggcaaggatitcagtgagataacccatttticacctiatatgcccaaaactg 

ttttcccgaggtcatcgtcccttcctggggggacgtcccgggacctcctcaccgttctggcctccccagcctctcagtgc 

tcgtggatgcctctgaccacaccttcctttgctgtcggagtttctggtccccctttgcctctgctgctctttccctgtct 

cttttgccctgtcctgcccctgaatctgggtgtgctcccttcaccctgcatgttcttcctcagttgccttctccactcca 

caacttccagccatctcccctgggctggtgtttcttcaattggtatttctagccctcctgcatttcttttgctttttcac 

tgttttaaatataaggcacaaacacaacgataacataaaggaaacgtctgaaaatccacgcacagttctgctatcccaca 

acccaccaattttcattgttctgtgttctttccagccctggccagcaatttagttgaaatgattgtatagatctaatttt 

tattccgcttttttcatataactatcttaaacatttttatagcctactacataatattttaccaatgcatgtgacacttc 

tccatcaaatatatgaatcatgattatataaacttctattttgaagacagaaaatatggaatacagttttgaaagtatct 

cgcttaattacagagtgctttggaggatgcctcaatgtgatcattttagacaactttcagttgagatgattctggagtca 

gggttaccttagccagagagcaccctgtgtctggtggaaaggctgccccaggatactggtccaggccctgagctccactg 

tcccttcagtgcaccctcagacccagaagatcctccagcccagctttagaatggttaggatctatagtatcttcacgagg 

gaggcctttcgacatggctcggccaggattcactgtcttcatctcacctagggattcctgtgagttgagcccctaactcg 

gccttgcctgtgttgaggacaggacaactctctcgtctgtcactcaagtgcagtttgtgtctcctgccaccgcccagccc 

agcaaacagtggctggtgtgccctgagggcagggtggcggctggggcggcaaggtgacaggtgtgcagccctgtagcctt 

tcctgctccagcctttgcctgacatgccccaaccacctgaaacacctaataaaggtctgctcaatgcttgagacccaccc 

aacagtggggccgtccttgctgccccttctgtaccccctgttttaagtggtgcccttttctaggctcccagagcattccc 

cactgtgtgccaggcacttacagctgtctacccccacccccactccccttcacaggctcaacaaggcaagtaacacgctc 

cactaacacaactcctggcaggtctcagcagagctccgagcctgcgctggccatttcccacgtacattaggaatccagct 

cccccagaaccgtgccttccctaatggatgtcagattactccgctatcaatacaggaaggaaaatgggattgcaagtccc 

caaagcaaagtgggcggcggggcagtgatgagcataaagggtagttgctattgatcgtgggttgctgcgcgccagatact 

aagactttatgcatgccaggtactaagactttgtcagttcactcattgattccccacactagccctgtactactattatc 

cccattttacaaatgaggaaactgaggtgcagggaggttaaagaaccttgcttattccaaagcctggccccagagcctgc 

cctgcctgtatcgtggtatggaatggaagtgactaatgctgactgagtgataggggtgaatctgaatctcgtcataaagc 

gtgctgctatttctcggctccttttggaggaaggcaggagaaattgggcacttagcacacgtctacaatctcatttatat 

gttcccaccaccaagcagagcactgtgcttggttgatgtgctgaaaagcaaactgcaccctctgtgtaaaaagtagggaa 

aggagaagaatgtgtatttgtatttgcataagtataccctcaaagaa'tacataaaaattagtgaaagcgtttacccagat 

ggaggggaaggcggagggagggtcagggtggaaggaagacttttccctgtgtatgtgtacctttttgggggttttttgtt 

ctggttttttggtttgtttgttttgcaccatgggattgaaaaataaatgtttaaattattttttaaaacaaaataaaagc 

cagctgctcccagcatatgttctctctgtggttctcccaaggtcttgctgtggccagcgccctggtggacatttctcagc 

agatgccgatagtgtacaaggagaagtcaggagccgtgagaaaccggaagcagcagccccctgcacagcctgggacctgc 
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atctgatgctggggccagggactctcccacgcacgagctagtgagtggcacaccagagccatctgcagggaagggcgtgg 

cggggaaatggctgtgcggtgcgggacggaagactggaaaccctcaaagcatctg^ctcacctgcatgatcacaagcttt 

ctttgacggtttctcccatccgtgttccagcatctaaccttttacttttgcataggaaatacttgatttaattacaggtc 

cagggatgagctgatggttgctggaggaggccagtgtagagccagtgagagaactaggaatgacactcaggttcactgtg 

gaaaactgttcttgggactgtctcaactgtgcaaaaaacaaaagatggagtgtttacaagtagacattcgtcatcagttg 

ttcttgaacatggtcttttaaaaactagtcagatgaat.taacttgttttcatctgaagcctgctatctt:tttt.aaaagat 

gtgctatttattcttgcacgatttaggcaattatctctcttccagggagtacctttttttctagttgagaattaataatg 

gtccatctcttttgatcatatcaagctaggatagaaggggggctattttaaatgtcaaggtcagcagtgttactttgaat 

gtaaactggtataataggtagttttctatagtaacttgattaatttagtcttaatccatttgaaactctctcttcctttc 

tctctgcctgtccctctccttctccatctcaccctccctctctcacacatacacacacaaacacatacacacaacactaa 

gtgcccagactctaaatagatccagcaattggaaagttagcaagcctaagtttttacataattgcattcctacattcttg 

taaaatttaaatagctaccattggcaatctgctttttttctaaaatctgatttgcagccaggaaagaattttctcaccca 

aggaacatttgatctagcagcagggatgagaggaaagcagaaatgaatgaactgtgaaagctcctgtttttattatcaaa 

aaggacactgtcaagaaggcgccccctgcccccacccccgtgtcaccccaggcctgataagcgatcagaggaaaggactc 

attcatgtcacgcttccttgagcagaaaagagcactgagagcacttgggacccctggatcagagagcatctgtgtgtcct 

gcagcctcctctgaacttgtggttcattctcaggctggggtggactcagatgccaggaaagggacagcctcccattgtca 

ggcagaagctgcccaaagcctggagaaggacttgtttgccccctttcccccaggaggggctcgacccacccaccctccct 

ctcagaccaaggtggtggctgtgaggagggcagcaaatgctgacaaggatgaaaagcacatggaaaaaaatggacyagga 

gggaaaactctgccaaatggaaaatgaccaaatttaagagggtgggacagtcccctgctcctctcccagagggcactgct 

tggaaattgtgttttccccatttatggtgctctgtattctggcattatgcagcagcctcccagaagctctcttctgcttc 

aaaacctgggatctctggcattaccctattgggatggaccgctggacagcaatgctcgagtttgtgaatttggagagata 

ctcaaaagagctaaaactgcagcattttacctttaaatgcagtgcctagagagagagtattgtctcttccccaacactaa 

cccca'ctcccatgaagaattgcctggaaagatgttttcaaggaatttgaaccataaaacactatctgatgcacagaacac 

ctctactttgagactcacctctcataaagcttctttttcacattactgttaaagaccagacgttctagaaaagacccctc 

ctctcatgagcccccccatccctgctacagaacacagcacccatggcgcctgcagtggactggccccttaattcccacag 

gcccccccagcaaggccaaagggaggcccctgggtattgtcctcctacaaggaagatcctctttgtttgttcaaaggacc 

agttttcctaggccaaagaagtctcttccccatgttagtcctatgccttgaaatatcatgcaccatgacccacagccatc 

tggttatgtcttattnttttcctaaaagataatgtttattttxaaaaaggaaggaaggagcaagtgaagtttcattctgc 

tccagcggtggggaagccgctgaatccacctgcttctcctttgcaaccgacagcaaacagctttctccggcctcagggca 

gaaaaagggaatggcagggagtaagaggcgctgggctcggagcctgtttccaagaaggaattggttgtcatctggcagtg 

ttgcgcgtcacaagagagcctgtatataaattaaaatagtcaagacaacactgaccttgcacttgtacataactatacag 

tagtgtccagaatgttcagacattcggagtgtacataaaacagaaaaaatcttcatgtatttttattaaatataacaatg 

tctgagtttcacctaagatgtttttgtgccatatgctggatatccaggttctcgccaggccccgatacatgaataacaaa 

cccaagaaacgcatccccattgtgtgatgtgttcagatgcatctggcaccaattaggtatttcttaaaacaggactcatc 

tgtcagagtgcacatgaaaaatcaggcagggaatcgaaacgacagcgctggaggagactcaggaagcagaggcgtccctg 

ccgctgcccttggccctgcaagcacatcatgaccctttctggcagcctcttggtgctctgggtagtgagggatgaccagt 

cttgtcctgagaaatgtttctcttagtctt:t:aagt.tcaaagactaacctgtagcaatcagactttccaaaagggggttct 

ccattttttgtagttttgtctaaatttttaatgaccatttcctggaatcagtttattatactgaaaactgggggtgggag 

tagggagctagtttgttgataaatagttcccatttccccgtggagaatttgacataccctggactcctgtgtgcctcctg 

ccatccctgcacacagcctggggagaagcctgcgcctccccgtgtggagagaaggcaaccccagatcccctgagctaacc 

cggaggaaaggcagtcctggacagaagactgtcagcagaaggaaagtactggactacccgtgggtaagtcctgccattca 

agactggagacacctgggaaataaaaagagcagggcactgctggtgggaagaggcattttaccttccagtigcaaatcctg 

ctcctttgatttaatggggtgtactggggccaggggctgactcacttccttgggagatggtggtgttttcatgaacatct 

ttgatccttccatttcatttattcatccatccattcaacaagtatttgctaaacactaacttaagctaatgctagggtag 

tgactgagatgtaaaaatagattttagaattaaaacaaaatccaagtcctcacacccctgtcatcccaggagatctttcc 

ttgtggtggtttctgtgagaattggccatcctgaggacacagccaggacggcagaggcctcctggcctcagggcatgccc 

tgcctaccttctgaaatgtttaccccattgaccaaacttggctccagccattgcggtggtttctagatagccaggcccac 

caagagatattgccccttgatgagagtcaaacaccctgcctacaaggagatgttttgaaatggagaggaaaattggcacc 

tcatcttttaaaggcagtaatggaattgatcttcagtaactgaatttgtgcacaaaacattctaaacactagtgaagcct 

gtttcgttgaactaattaattctggctctggaaatgtttttgtttcatagtcatttacgacttcgtttgtttggattcaa 

gcttagtctgttaatatgtataatttagcatctattacactcatgtaaatatggagtaagtattgtaaactatttcattg 

cggggattgtgggtgttatacatacatttaggactgcaattttttggtatttttttgcattgtaaaataacagctaattt 

aagcaggaacaagagaactaagggaggtctgtgcattttaaacacaaatgtgaagaacttgtatataaacaaaagtaaat 
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actataatacaaacttccttctgaaataaaagtagatctggtaaaaatgtggcttttgttctgagtgtttcattttgatt 

ttgcattgttttgcctatatttacattttggtcattagaaccttaagggaaaaaaaaaagcgtttaccagttttcagact 

gcgtcagagggagcacttgacagttaacggaggtgagggtgaaaacccctggcaggtacctcagacagccctccaggcag 

cttgtgagccacagaccttccacacagcctttgtgtgcctacagggcagctgggcccgtgggggcagggacagatctgag 

cagtgaggtatggacagccgtctcaccgctccactaaccacaccactgtctccagtggtgctgttatccctccactttgc 

gctctcctcacaggctcctgaatgtcctctaggttgtctccaaactgctaaatctcaggtgtgctgacaggtctcagccg 

tggccagtgttttcctcggccttgaagttttgaggaagttactcccattggcaaggccttgtttgggggcctcctgtgtg 

tcctgcccccttttatttggctcctgccctcagaatgggtcagagtcatgtatttcagtgcagagtgcagattcccagtg 

ctagcagttccttccagaaggatgaggtcattgaaagaaggaggaaacacaagacggtcactcctgtaggggaagatggc 

agggtactagggccagggcctgaggtgaaggcaggagcctggaggaggcctgcctaccagagctcaggtctggcttaggg 

agggaaatggctggactagtggctgcaaacaagttgagaaggaccttgaaactcttccagggagggttatgagagccacc 

aacagtaacaggatagaaaccctgtgatttggtggtactgtggcagagacaatgctgtgtttaccaaacctactgggccc 

acagctcatcatttcccagcctccctgcagttaggttgggaccaagctaagccatggccaaggaaaggttggaagaagtg 

atgggcaccaattccaaacgtggcacacaaatgctcccacaggggactctcctacggatcacatgttaaagactggaggg 

actgtgggtccctgagtcaccccttagaggccggccgcctgatcagtaagagctgtgctggcctttgcatcactggaaag 

caaactcactctgctgatccactgagatttggggctttgttacactagttagcttttcttattctcactaatatacaaat 

cagcactctgaagtgcagtgcacatgtaaccaaacccttaagtacatggcactggctcagcgtgtggccaggtggcgagc 

agtgataagacaggtgcagctggcaggacaactgaggtccctgccaggcggagcaaaacatctggtaaggccgtttcact 

cgataactcggaaaacaaactccatgtctgctgactttttgacttgaggggaataaacaacaacaacaacaacaaatatg 

tgttgattgttactggctgcttttggcgaggcatcacgagaaaaagtttacttcaagaaagaattagccagtttacaagc 

agaagggaaaaagatggaaagatgttaaccagggccttgcaattctggaaaagccaactgcttctagtctccaaccagta 

agaggtgcaggtgagcaacaaaggccccatgagacttagttgaacacagtgactgggcctcatggtgaccatccaatgaa 

gggtgtggtcttcccactggagcatggaaactgcaaagccaccagcttgagggaggggcacgtggttaaggaagcaaaga 

aaaagctgacatgagaactgtttccaggaacaaaccatgaatgtagtcactgaccctggacctgcctggaaataaataaa 

gatgtagtaaatttaagaaaccattagcaaagccttaaaaaaaaaaaaaaaaaaaaaaaaaaacggtggcaattgaggaa 

gtcttaaatccaccttcaggcaggaaaatggttgtgaaagctgtgtccagcgagggctttacaagctcagatgtgggccc 

aagagcagaactgggcctgagcgaggactccccccagcccaggacagggcccagcaggattgcaaaaacaccgcagagca 

gcaattgctgcatctgcacccgctccttcctgggtaggagggcctaggagggtgtcctgcctgttctcatcactacctgc 

tgggtgggaggtggattaagctgtcttgttaattcaggattgtggacttcaaagagccacctgagggaggctgcatcacc 

tggaggtctgggcggggtgtgacctcgtattgtccccctaggagagggtgcctgtgacccatgtaggagaggaagcacaa 

aactggttttggtgatcagaatagcaaaccgggccagatgcttaccacacatttcctcttcctgagtccacagcttcgac 

tgcatggcctgggcccgtggcatctacccagatgccttgggaccagccctcaccaatgatgtgtcatttcaaggccattc 

acagtgacagtgtctcctccactgtctccatcaaccttgaaggcttatggtgaagatggtaatacaacaggattgaagga 

gactgggtccctgaatgactttgtggaacagagcactataggccaaaggtagtagaattaggtatcttgttttatgctgc 

ttaagtgaattaattaattctctagatttattcacctagttattcaacaaatttttattgagcacatcagggctagggca 

ttgtgtcaggatcagagattttgcaatgaacaaaacagaaaactccatgtcctccaggggagttgataataaaatgtcag 

ataagctctgtgaagaaaggtaaaagtagggttaggggtagaaagtgctggggggattgctacttaacatagtgtggcca 

ggggaagccattctgctaaggtgacagctgagcagaacgatgagtggagggggcaggaaggaatgtggaggtctgggaga 

agagcattccaaacagggatggcaggtgcagaggccctgagggggcagcatgcctggtaggcctgctggatatttctgag 

aatgaaaaagtccattcccagttgtttgcgtggagaacctaacccagttataaacacaaagtacttgagagctggtttaa 

tatacagccagctttccagcagtgcaactaccgtgtacatcaagggaaaactgaacttcgttttccttaaaacttatcat 

cagctggtcatcattttgacaaattctgtcaacaacagcagtgtcattcctggcatccgtatgggtcacgtctgaacaga 

cacacgccctgcagccctgcaggtaccagctgtataacaagaactcccttccaccctgtgtcctggaaacaagaaagcca 

ttagaccggaagatcccgatggctatctcaaatgtgctggatggagttgccagggcccactggcatgccctgtaagcctt 

tccttccacgtctggttcctgccccttgaagactccatttctgagtttgtgtgtgttttactttctagtgtgtgtcctca 

tcttaacttttctctctctcttctgccttgaactgaaggttcgcttgggcgtggagagacaggcccccagcagagcagct 

tcccgagacatcctccgatccagggcttcccagcagcccggcaaggcagggctgtgcctttctgcttcagctcacaagca 

tgccaggctcactggcaagctgctgtctggttgagggactgctcctaaagccctgcacagcccctgtcctcctggccctc 

tggaaattccacccccgtgtccacatttcatgcaaaaatgagctggttctgtgagcatggcccggcctgactcgcttagt 

gggcggtaagtggtttccacttcaacctitgcacccaatcaccgggctccacaccaggatggacattcatgagccgtgaag 

tttccagtaataaatccacagatgcttccagcacctgccttttcgcatcacctccactcccagccacctgccaggcaaca 

ggtaacagagacccagtcacaggagggcagtgtgggggcaggactgcagtctcccaaagcccatgcacaaaaccgacagc 

gccctggcaggacaaggaggctgacattcagatgtggaggaacaaggcatgacccattcctggtcacgggggccacagct 
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ggactcagccttgaggcttggccagacttaacaccgtgtataaaccaggacctttttaggtagagtaatggaaaccaaac 

tctaatgatcttagacagtgctattagtctcctggagctgccagaacaaattaccacaacttcagtgtcttgaaacaaga 

gaaactgattctcacagttctggagaccagaagtctgaaatgcaggtgttgccagggctgtagtctctggagactctagg 

ggaacctgtgcctacctcctccagcttccagtggctcctgacattccttggcttgtggctgcatcaccccaatccctatc 

tctgtcttcccctggtcCtttgctcaaaatgtctgtgtttagtttccctgtagacacctctgcatcactctcataagatg 

cagaggtgcgacatacaggtgttgagagcccacttagataatccaggataagctcctctcaagatctgtaacttggctgg 

gtgcagtgtctcacacctgtaatcccagcactttgggaggccaaggcgggaggttcacttgaggtcaggagttggagaac 

agcctgggcaacatggggagaccctgtctctactaaaaatagccaggcgtggtggcacacacctgtggtcccagctactc 

aggaggctgaggtaggaggattgcttgagcctgggagtttgaggctgcagtgggctatgactgcaccactgaattccagc 

ttgggtgacagagtgagactgtctcaaaaaaaaaaaaaacataaaacataacttaaatcacatctcttgccacagaaagt 

aatactcttttgcctacatataaggtaatatttacaggatccaggggttaggatgtggacatatctttgggaccactgac 

agccatgaagcaatctcataattttcaaataggttctgtcctttttatctttccagtcttttggaaagcatatgcctata 

ttttcaatccacaattctatttttatttgaggtcatttcatttctggcctttattttttattgagacagggtctcactct 

gtcacaggctggaatacactagcacaatcatggctcactgcagccaacttctgggctgaagtgatcctccagcctcagcc 

tcctgagtagctggaactacagacacacatcaccatgcctggctgattcattttttaattttttctagagacaggctcta 

tgttgcccaggctggtctcaaactcctggccccatgcaaacctcccgcttcggcctcccaatgtgctgggattataggag 

taagccgccttacccagcctccagttttattgtgttttgttttgcttcgtttgagacagagtcatgctctgtcaccaggc 

tggagtgcagtggcacgatctcggctcactgcaacctctgccttgcgggttcaagcgattctcctgcctcagcctcccga 

gtagctcggattataggcatgcaccaccacgcctggctaaatttttgtatttttagtagagaccgggtttcaccatgttg 

gtgaacacaaagtatttgagagctggtttaatatacagccagctttccagaaatgcaactactgtgttcatcaagggaaa 

actgaacttcgttttccttaaaacttatcaccagctggtcatcattttgacaaattctgtcaacaacagccatgtcattc 

ctggtatctgtattggtcacatctgaacgacacacgccctgcatgcagccctgcaggtactggctgtataacacgcctgg 

caagaattcccttccaccctgtgtcccggaaacaagaaagtgattgtgatccactcacctcagcctcccaaagtgctggg 

attacaggcgtgagccactgcactcggcttcctcccgttttttttttttttcaatgcttatatttcactctaattaactg 

agtcaaaaattgagaatagttgaatacactttcatgtaaggcgaatcatttagccgatacttaactctgcatttgggcta 

ccatgccgctgtggttagcggggcaaggtgatgagccctgcctcaacacacacaccccgcctctccccagcccacttaca 

cgcgtccattcccacgcaggcgtgggggccttagaggattccctcttcttcgtaaagtgagaatgggctggactcggctt 

cactgcccaacaactcctttttttcctttgggaaactgtccttcccattccatataaccttcatgggctgagatcactca 

ccgagctacaggaagaggcccattactatggccccatcagggttctgcctgggacagtccataaatgctggaagagagag 

gtgcctttccctactgaggctgctaaaaggagacactgcaaactggggctgctggtggcaatcttacactctcagtgaaa 

gcctgcctggctgcaggggaaaccaatgcacagaccagcaggggcaacatcatttgaacccctggagacagctgtgcctg 

aagcccacatggctcaggcacatgctcaagccactctgatttgcttgttacagtcaagagagtccttggccgggcacagt 

ggctcatgcctgtaatcccagcactttgggaggccgaggcaggcggatcacctgaggtcaggagttcaagaccagcctga 

ccaacatggtgaaaccccgtttctactaaaaatacaaaaattagccgggcatggcggcatgtccctgtaatcccagttgt 

tagggaggctgaggtgggagaatcgcttgaacctgggaggtggaggttgcagtgagccaagattgcaccactgcactcca 

ggctagctaacaaagcgaggctctgactcaaaaaaagagagtcctgactgaagctgagaggctggtggacagctgtcagc 

aagcagtgtttgtgagtctgatgtgagctgcgaagtccacacctgatttcagagctggtggctctttttccaatcaggac 

agctccaggcctgagttttggtgtggtctgtacctaaccgctgtgtcttaggtcaggatctctagaagccaagcccaaga 

caggagttctcaaatgccgtgatttactgagggaatgccttcgggggaacctgccagtgaggagagctggaggaggcggg 

cag99g c 9ggagctgagccaggtgggggtccctgctggagtctagcctcagcttgaccccacaggggctctggagcagga 

acagcacactgcattgtcccttgaggcagtcactggctgcagttgcctctgggagcagagtaaaagtggacaggcatttc 

tgggagtacaattccctagagaaggggacagctctgagctgtgttaccagccaccatttccaggggctgggggatgcgct 

gcactggcccagagaggtctctgagcaaggcccccacaacctccacttcaacctcctgccaaacctcaagcagattgaga 

aggagccgctagagacacaggaaccctttcctcatcagaagtctccaagctttggaggaagaaggaggeaggcaggaggc 

ggg a 93 at cgtttcaccccaggggtt:cagggctgcagtggactatgatcatgccactgcactactgcctggtgacagagt 

gagaccctgtctcaaaaacaaacagacaaaaaaaatcaatgctcttagcatctgctgggtccacattgggcagtgtgtgt 

gacgtggtccagctgcaagggaggctgggaaatgcagtttcattatctgagtactagtggagaaagaagagacgaaagca 

atgagaatccctgccatgaggaagtgagagctgattcatttcagcttcctaaggtgaggtgatggggcccagacaggttc 

tgaggccactggaggtcacacaggccagcagggatacacccagtggtagaccctggtctccaggctcccaggacacagac 

cacataaaagcaaggtcagtctcagcacacagatttaataagcactgtatgttatgcctacagctgcaggaaaccagggg 

gaagatggccacagggccaatgccaggcagggcacagggtggggcccctcaagcagcatatggggaacaggaaagactct 

gccagcatggtcatggtgcacaggtggctacaatggtgggtaatgcccagttgggaggatgcagtgttagacggggtctt 

gtccgatggattccttagggggttatatgcttgggaggctgatacaatttcctagaattctgggattctggatggtctct 
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gacctgctctaggggatctgggtctccagagaggtctcagctacaaaagtgaccctctcctgaccccaccttgtgggaga 
ccccagtggtctcaaggtctgtgaataaagggacttacaaccaagatgcctcctgacagggagacgggggctttacagtc 
aggccccagcctacatcccttgccccaggggacggcacacagggtgtccaaacctacaaagggcagcggagtgaaagcac 
cagggccggcttggggggtaagccagccccttctcaaggggccagtaggccacagaggcggcccaggcaggcagggtgga 
gtagctggctgtgctatccgaggtcgctgtcctaatcagagcagggccgggagagccaggacaggaagtatggagagcag 
ggagcgtctctccagggccctgcctgtggaggacaccttgggggtgggagccaagtgcagacttgaggcaaaagcctgtc 
tctgccctctgcctgcacccccaccccaggcccttcatgcctccacccaggcccaccagggacggaagggtgtgaaggtg 
acagtgggggaggtgccagctcaccctcccctccctgcacctacctgcaggaggctgtgctccgggtctccctgggggtt 
cagctggtccagcaggactgggggccacccgctggcaaaggcctgaatggcaccatctatggagggaggggtccagggtg 
gacttagctgggaaaccctagcacagggctcagcacaggggacaaagtgagagaccccctgctgcaaagactggctcctt 
ggacagaaggtgtgtggtgaggaataacaaaataataagcacagctgtgacagagtgactcaactgtacagtcttccatt 
ctcccctctaaaatggctcccagccaggcggcttgggagggggcagtaagcgccgcccccactccccctccactcccccc 
cgggcccctcacccaagcagcggttcctggtaaagagcccccggaaggcttcgcagtcctcacgccggttcccgctggct 
ccgcagtcgcaccagggcgccacgcgcgcgctcacgttgtccacgtagttaggggtgacggcggtgcctgcggggaccct 
gagggcgaagtcatcggcccacccgggcggagatatcccctggagaacccccgccctcgcccggatcccggccgcgcgta 
cccacgaggcccgcgtaggcgcgcaggcagcgggcgccctggtccagcaggcagccgtcgggggcgctgggcgctggggt 
gcacgagacctgaaaggccaggaggcgaggcctgcgcggcgggagggcggtgagccggagagaggccccgtccccaccct 
cgccacggccccgccgccgcccgcgcgcacctgcagacccggctgcgctcgcagaagtttaagggctcaaggcaggaggg 
cggcgcggggccgggccccgaaaaggcgcaggagggcacgaaggtctggcgfccgacgctcggcgcacgcggggcccgcgc 
acgggcagaagsgcagtgcgtgggtgagcgcgggcggcccgcgggcgaagaagcggcgcagggcccggcggcagcgggcg 
cggggacagcccccctgcgcagcccggcccaggcactgcgccacatactcggagcgcaaacgctggcaccgcgcgtccgc 
cgtgcaggcttcggccgcgtccacacatcggttccctccgaccgagctcgccgaccctgggaaaggcgcggggaggctgc 
aggtctcagtgcgccccgcatgcacacttcacccagcccatccacgcccctggaaggtgggggacactgaaaacccgaga 
aagcatagagtgtggcccaaaggcggggccgactggcactgatcggctctecgtcgtggggatgggcagcggcgggctct 
ggagggctctgcaacacgtcagggatggagagggagcatggaggctgagccaggccttccccaaaagctgaagcccctcc 
ttcccaagcatcgccaccattcccagaacggccacacacgtccaaagacgggtccaggggctcctcctacactcaatggc 
actacctgacgggggaacaatgaggctttctggggatctggccaatggagactttaaaaccagagatggcatagaagaca 
gtggctgctcctagtctcttaatgacaggctgtgacgccatacctttcctcctctaccccctccacctcgccgcttatcc 
tccaccatggagaccctttgggtgggtctattgatttctgtttctccctccttcctgacctcctgctcctcctcctgccc 
cctgcatgggtgtctcccctctccagcccttcctgatgctcccttgagtcgaggtcatcctgccccatctgcactggctt 
cctccagcccctccacccttcatcagccaggggagagcggatttatgcaagagctctggccactgaagaacccactgagt 
cctggctgccagggtgtggattcagagaagtccctgacgacacaccgccgggctttgtcctgagccacaggacccaaggg 
ttcatcagtggttttagtgaccaggccccatgataggtgacaccctgctggagacagagaaggtgtctggcaccccagtc 
acacaccaggggttgggtagacagagggtgagctcLgcccacatgcccacaggctgtgcctaaatagagccaggataact 
ggaagttgggacttggcagccccttcccaggatttgagtcacaaaaaccaggctaatcgaatcccccccagcctccacaa 
gttctccagtctggagcatctcattcagactagaagcctttctggtgcccagcctgagtggacgtgcacagcctggcccc 
tcatccatctccctgtgtgtttgtgcacaggggctacggggtggtatccccccccagggctgggagcagggtgcctctct 
ggcaatggctgtgtctttcaggctgtggattggaccgcgtggtgagagtgattcctgctgtgacttcagcacctgcctct 
gcaggaaggatgaacatagctcgagggcttagaatggtccctgagtggaggggatagatggcagcggggtgagtcctccc 
ttcctcccctccccaaagggcacactttgctttcctctcacagccttgccaggcacatcctcaggtctcagctgcacaca 
ccagcacctgatgttgcctaagggtttgaacccacctctgcctgttgcttacctgtcacccttgcccagaacacccttac 
ccagtctccaccctcacagcagctcatctcagctgcaagatctcctccaccttaaacctttaccagattccactttccac 
actacactcggtaaagtgacccctccctccgcgacatccatgcagggacggagcagtccaagaaggtggaacaacagatg 
aacaacaccgaagattcaccaaggaatctgtgtgatgggtaggagggctggggctgaggggagggtgagagagtcgggag 
atggcagacaataggcaaaagttcctggggcacgggtgccaaggctggactcgggagggggatggcggtggtggcttgac 
ggggctgggaggtgaggtgctcagcacatagtggggtcaattgccaggggtccccaagccgtgcgcacaggcctctcact 
ggcaacccttcctgtggttcagcttttcctggaatatcttgccctcccaccttgtcactgaggagcagggacagagaggc 
cagggttcggggtccaggggaagagaggggcgctcacccagtaacagcagcagcagcagcgcaggccccaggcagcggac 
cacgctggaccttcaacagagaaggatggctggcagagtcctagtctgataggcgcccccttcctagtgcctctggagca 
ggagcacagtgaaaacccagctgtctgcgctgatacccctgggaggagcctttgctgcagcaactccccgccctcattaa 
gtctccaccccaggctgcaccgtcaggtaaacctgggagccacgggattcggcgcctgaatgtgcctagacggaccccaa 
cacactagccctgccacgcacacccagcaacacacagacaaccacccagggagacctggtctctcccatgctaaactagg 
aacgctatgaaacagcatttttcttttccttttttatataattctattttaaacttaaaaaaatttttttgaatagagtt 
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ggggtctcgctctattgcgcaggctggtcttgaactcctgggctcaaaccatcctcccacctcggcctctcaaagtgcta 

ggatgacaggtgtgagccaccatgcctggcctattttaaacttcttattgttgactaatJ:ttagacttgcagaaaagttg 

gaaaatgtaagcattttccttacacccctcacccagcttcccctgatggtccatcttacttaatcatagtccaatcatta 

cagctaggaaactaaccttggtacaatcctgttaacgaaactgtagactgttttgcatttctcaCttttccactaatgtc 

ctttttctgttccaagatccattctaggatcccacatcacagttgtctccttattctcctcaacctgggagttccttcat 

ctttccttatctttacccttgacacttttgaagaatcctggccagttattttgcagaatgtttctcttgagttgtctgat 

gttttattctgatcagaatgagacacagcattgttttgactaaccaaaaagttattctataagtaaatattgaggttaaa 

aatctcatccaacctgggcaacagagtaaggccctgtctcaaataaagtctcaacactaagatttaaaaagtgaccagaa 

aagcccccactatgatttgtcttgacttttttttaaaaaacaaacaaacaaacaaacaaacaaacaaaaacgatgtcttg 

ctccctctccccctcaggctggagtgcagtggcacgatcttggctcactgcaacctccgcctcccaggttcaagcgattc 

ttctgcctcagcctcccaggtagctgggattacagacgcccgccaccatgcccggctaatttttgtatttttagtagaga 

caaggtttcaccatgttggccaggctggtctcgaacttctaacctcaagtgacccgcccacctctgcctcccaaagtgct 

ggaactacagacgcgagcaactgcgcttggccccatcagcatcccaaatctccacacaggggcgcgttccttactgttat 

aaggagcaaaggatcagtttgaggacaggtaaaataaaaatgcgcttgctgcctagagggagaagtccctgctgaagata 

gctttgcttgaatgagctcaattgcaatgccagtgctgaggcttgttgactgtacggtcaccacagttgctgctgcgcgc 

ctagaacatggtcactttcttgactacctatcctgtctcagtacatctgtctgtggtttgtggtggtccatttcctaatt 

tttttaatgaatcagaagactgtgatgtgctttccgctgtgctaaccatggccgctgaagcaaaatgtaaaccaagatgc 

ccctgcagtggttgtgcttcactctacgacatctgttaccggaaaggggtccagattcagaccccaggagagggttcttg 

gatctcgtgcaagaaagaatttgagacgagtccataaagtgaaagcacatttattaagaaagtaaaagaataaaagaatg 

gctactccatagagagcgcagccctgagggctgctggttgcccatttttatggttgtttctggatgatctgctaaacagg 

ggtggattgttcatgtctcccctttttagaccatatatggtaacttcctgatactgccatggcatctgtaacctgtcatg 

gtgctggtgggagtgtagcagtggggaccgaccagaggtcactctcatcaccatcttggtttgggtgggttttagccagc 

ttctatattgcaagctgatttttttggttggtttggtttttgagacggagtctcgctccaggctggagtgcagtgacatg 

atctcagctcactgcaacctccacctcctgggttcaagcgattctcctgcctcagcctcccaagtagctgggattacagg 

cacacaccaccatgcctggctagtttttgtatttttagtagagatgaggtttcaccttgttggccaggctggtctcgaac 

ttctgacctcaggtgatccgcccacctcagcctcccaaagtgctgggattacaggcgtgagctaccgcgcctggcctact 

gcaaactattttatcagcaaggtctttatgacccgtatctcctatctcatcctgtgacgcagaatgctgtaaccgtctgg 

aaacgcagcccagtaggtctcagccttattttgctcagcccctattcaggatggagttgctctggttcacaggcctctga 

cacatcctcttgtgttttggcgtgtgggaggaaagaggggtgagggaaggaaactcaaaaccaagctctgaccacacagg 

gcaggtacactctcccacctgtctgtgggtgccacaagtcaagggaggggcagagagagaagaaggtgtgacagatggcc 

gcaggccacagaatgtcagaggaagcccagttcctcccggggcagcccaagtagctggtagttgggtggccaaacagagg 

gcgtcacagctgagctgggctcgctcgctacccccagctcagcgtccactctgcccctcagtacctcctgctcagcctca 

gggtccatgcctaccctcccgcttcccagtcacttctgcgtgcctcctgcttttctgctgttggccccatgccagctcct 

ctctgctgagcttctcttctccagttctgcagcacagccaggtgatcctgggctccagacaggcctctcccccagtctgg 

ggcctcccctcttagagccctctttccttcccacgtggcctccccagggttcgccactgaatggagaaggggtggagggg 

gtgctgggcagtcttggggattagccaagagggcagagttggcctccccagggtcccttgtagctggagtcccgcggggt 

ctagacaccccctcctgaagggtaagagcgggggaggtatattaacgtgtatttttagagtctctcttcttttttttttt 

ttttttgagacggagtcttgctcttgttgcccaggctggagtgcagtggcacgatctcagctcactgcaacctccacctc 

ccatgttcaaacaattctcctgccgcagcctcccgagtagctgggattataggcacgtgccaccacaccgggctaatttt 

tgtatttttagtagagatggggtttcaccacattggccaggctggtctcgaacttctgacctcaaatgatcctcccgcct 

tggcctcccaaagtgctgggattacaggcgtgagccaccatgcctggccttagattctctattttatgatggtttaacat 

ctcggggtgggggcttgttggctggagagaaactgcttgattcctggagatcagaaacaactcatgcctttcatatgcaa 

accgaccagtcttgagtccatacaccaaccacccccttcaaggaactctcacatacgaaaccagtatttcccctgcccta 

aaccagctcagggccaggcacctacccagacaattagagcccaaccccacgccccaaaccccaccagaatgattcaaaat 

gccaatcctaccctcttcccctggcctgccttgcctccccagcggaaacggcactgtgggctgtggcctgtgccttccac 

tcgctcctgattctgtccctggaccaaacctagtgcctccccactgtggccctgcatggtgggaaactgtgagtaataac 

ttatttcaacagcattggcctctgtgtcatcagtcaccttcataaattaaaattttgcaactacaactgaggcaggaggg 

ggctctagagagcactctcacctggccccttgggcttgtggagtggagcccgaggtgaagttgcctccctgcactcagtt 

ttgggatggttttgttatgcggcaacagctggctgatataggccactcagcagctcttgcatgaagcaatgggagaatgt 

gaacgcccaagggaggcaggagtgacagagcaaagagggtgttcaaactaggcaacaccgtcctgtgcccagacacatgc 

ctggggctctggctgccatctaggctcggcctgccagccaccaccccgtcagccaccaccccaccaaccaccaccccgcc 

aaccaccaccccgtgtggtcctcagggcaccccatgggttgaatttacatacagaaaagactaaccaaggtccaagtgta 

atatgtctctctaaaatttattttagagacagggtcttgctctgtcacccaggctggagtgcagtggtgccaccatagct 
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cactgcagtgttgaactcaggctcaagtgatcctcctgcttcagcctcctgagcagctaggcctacaggtacacaccacc 

acgcccagctaattttaaaatttttctgtagatgcggtgtcttgctgtgttgcccaggctggtgtggacctcctggcctc 

tggtgatcctcctaccccagcttcccaaagtgctgggattataggcatgagccaccacgcctatagccaacatgtctttc 

ttttgacttctactttggtatcttttcttaaatggttccctctgtccccccgacacacacagaatgggggagaggctgtc 

agattctgagctccagaacctcaggtgtagcactgggattgggggtgggggctcaggaaccacctaggggagaagacagg 

gtgggaagaaacaggaaggaaggtccccaaaattatgtttgtttgcagaggccagccaggctccaggggagtgtggactc 

agtcgaaccatagggccccaggaccactagcttctggccagcagtcatgccctccacagagctgggtccgtggaaattgc 

atgtaggagacacaccagactcccaggacagagcccttttgggatggccagcactacccagcctccactggtgagggagg 

tcaggggctgtgtgacctttgcttctgggactgatggtttattgagctggagagtgtgcccagcagtgttctccagccct 

caggaacttctagtgtggctctgggttcctggagtgggtgggtcgaagctccactcggggaagaaacttccaagctgcct 

gcaggtgctggaggtccggtgattcactggctctgcccctgcagttcaagttcctggagtggctgtcagtggccacctgt 

ctttaaatctgttcattttaggagctacctctcaccagaggcaggatcttggcatctggacttgatctgctgagaatgag 

gaggatatgttgtcccctaaggactggggccccaggctgcaagctgtgtggcagagagcccatcctcactcagtgaggac 

cagtgatccaggaaaagccacagcttctccctccccagcccaggggcttccagcatcctggtctccatgataaccaagag 

gtcataaactcatttccataataacctgagcccagaaacctgattagggggcagcaaactgaggggtgggagaggtggga 

gggtgggcgatgagagggggaggctttgaatccaggtccctgcctaccttgggggtcaggcgagactgctggcagaggct 

tctcagggtggctgctgggctcatgagagttctcagggtctgggagaaatggtggagggtaaatgttgtgaatatggtca 

gcaggagaccctggggctggggaggggcataggggactcaaggtgactgggtgctgcccatctggaaggaggcaggaggc 

atgagcccttcccttctcccttccctctccacctccccctggtgcctcactcacccaggggccagggctgtccagtggct 

gtggggcccaactccatggggtgaacgccgcccagggggtggtccctgtgtgggccatctttggggctgagcaacgtgat 

aagagtccaggaggttggcacagtgatcctgagtgggttattgcctccccgcagcatggtgtccagcccagggagttctg 

cgtttactgagtttcttggggcacccatctgctccaagtcaccctctcagctcccttcctgctccctcttcaggggagcc 

ttgggatccaggctccaagtgagcctcatgccctcggctggcacctcctctctctagtcctaacatttcctccaggctct 

gacaccacccagcagcctggcactctccagatgctggcatcgctcagcttccaaagaaccctggatgtccgccccttcgg 

cagctatgtctgctctccttgcccctgggtgccctgctgcccttgatgattccaagccatctttgactgtccccatccca 

tcccccaaggccttgtcatttcctgtgatgttccttcaaaacattctcccctgccctgagactcccgcctggggatgaga 

agcagccgccaccctctgcagcgccccctccgtgctgacacgccaggctctggccaccttgctcctctgcccacagaccc 

ccaatcacaactcgcttcgtcagggtcctcttagctgccacccggggcccaggtggtgccctgcccctgtccgctatgcc 

ctctgcccccatctctggcccaaatcatgccatctcccttggcctgcccggagcactcccaagaccaggctatgtcagac 

atggccacagagtgcctgccctgcctagggccctggtgcagggtgagtcctaggacagccatgcttagtattatgtgact 

ccccactccgccaccacccaggtcacagagaactgggttaaggcagggccctggcacaggggcagccagcaccgcagctg 

accagtggtatggagtgaaaagatgtgctgggcccagcatttgggaacttcaagggggtgacagaggtgatttgtgcaga 

ggaagtggcaaagggccgaaaactggtgagacagaggctggacaggcctccgggggcagcatggtacagggactgcaatc 

tgagccagggaaaaacagggcgaagtcaagggtgaggcagccagctggtgggagaagcaggagagtggacaagaggagct 

gtactgggaggtagagggccatgccttgcggtgctggtggtggggcagggatccaccccctcgcttgactgggaggccac 

tggaacctcctgttcaaagctacttctttccatggcctctggggctgctctctgcacctgggggcaaggctgagggcctg 

ccccagctcccacagccccagcagagctctgaggaggggaaccgcaggagtaggctcaggaagcaggcgctcggagccta 

cccactgcacgcagggtcccttctgcagccccagctgcatcgctgcagatgggctcctgggagtcggtagcaacaccagg 

ccaggccggcccctgggagcagaggcagcaggacgctgaggagcatggccagcaggaaggtgtcatggtctgcggggatt 

gggggaaggggcgctgagtcctgagcaggtgcaccaccccagctcctgcccacatgccccccactggcatactttcagcc 

tgcacagggccactgtccacgctgccaccaaagcctggcttgtcacagaagggtggagcccagcctggagcacagtggca 

gttatggttgctattgcaaacctgcagagaagagaagaggagggtcacgtaggattaggaaccccaaggtcacccccact 

cctcgggctctcaccccgtggctgtggcaggcagtcaggcagcgctgaagctcctggaaggcattcttcctgcagcgcct 

gctctggcacacctacgggcagtgcaccaggcagtgagggggacactggcctgcgggattcaaacggcaaggaggggtcg 

gstgggcagagctcaccattctaggtccacactgggtgcctggctctaccaggcccaggccaagcaggtccagctgggca 

ctggggagtgccaaggctccccgacaagtcacttcctggccatctaggtgaacggtagagtccactggcaccatgtgcgg 

tgcgagcaggctgggctttccaccctggcactgcagcttcccacacagggcatccctggggaggaagtagaggggggtca 

acagctgcagtaccccccttccccaaacccactccatagcttctgctccctccactcagctccactccctacctccctgc 

acagggcaggaagtggccctcgctgtcctggccgcagtttccatgagcatctcccgcagagttcaccacctggaaacagg 

ccccgggagctgggtgggagcctgaggaagcatgggccaggctgggggcagctcggagaggggctgcgctcagcgggcct 

cagtttcccctgcccatcttccccacagtagaaaactggccccaccagaggatggggggcagggtgcaagggtgctcgtg 

tcctctcaccaggcccccagagctgctggcactgctgctccagcgtgggacatgcgccatcccagcagtagccactgccc 

ctggcacagggtgagccgtccagtaggtaaacgtctgggggacagtgggaggaggtgcccgtgcaaaactcagggaggtc 
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acagtcacccacggcctggcggcacagcgctccagccggcctcagctgcgcaggtgacgggtggtggggaaggcagagag 
aggccacgtgcagtgagaggtccatgccgagagcgcggctcggagctgggggagccaggcctacccaagcccagcaccca 
acgggggaacctgagggcaccaattaactaaggccaacaggccggctcccaagctccccgaaaccctcaccctgaacctt 
ccatgccctcaccaggcagcgcacgcagcagtccccgtgggcgcactgggcccccgggcgcagcgagcagttgtgagcaa 
agcagcagaggtcgcggcactcctgggaccagaaaggcaagaagggcccaggtgagggcgcagcgccccagacctgagcg 
gagagggcaagtgggggccgggcgagccgacttaacctggccagggccgcagtcacactcctcgcccgcttccacgaagc 
cgttcccgcagagcgccggcggcaccgggagtccggggtccggggcattggagaggcaagcgccgccccccttgcggaag 
aaggcgcgcagctggcggcggctgcaggcgctgaacacgcgcggaaacgggtgcctaccggcacggggagggcattgggc 
atggagggacagtcccccaacccccgcgctcccctgatccccacccctgggcttggctacagccgccagacgcgcagagc 
ccagagaggggaagtaacccgcgcaaagtcacacaacaagcgggacaggggacgatgcggccccaatagtgagcagcccg 
ggacccaaggtggaatcgcgacccgacggtgctcctcccggtgtaggagtaacctcgccaggttacccggaaaataatct 
tcataccgttgagaatccactttgcctgagcttcttccctttaagcctcataaaccaccctgaagcggacactatgatca 
ttatccccattttacagaagaggaaactgagggacgaccaaagaaacgcagcggaggaagtccccaggactagccgcccc 
gccgcagccccgaccccccacccgcgtacccggtggccgcagccatgacgcagcctccggactcggccgcagcctccacg 
cagcagccgtcggggtcgtggctgaggccgaggctgtggccgatctcatgggccatggtggctgcggcgccgatggggag 
ctccgagtggtcctggggggccgtgggagggcggtcactgcggccgtagagcctcctgtctctccctcgcccccgcccgc 
ggggctcaccgtgctcacgcctcccgagctctcggcgcggcacatgccctcgacgggcgccaggcccactgtggcgccct 
ggaaggcgcggcccctgggggcggagcgcggcgtgaccaggcggggccgggaggtgaggccgccccacccgggacccgcg 
tccgggtcagaggcacccacgtgagcagctgcgcggagtcgtggggccgctgcgcccacagcccccggcgccactgcagg 
aaggcccagagcgtggcgttggcgtcctgcgtgacgcggctgcggtcccgctcggtccacacctccaggccggtcagcgc 
cacctgaatgtccagagtcctgagaagctgagggcgaggcggggctgaagccgggacagggcgccccatcgcgccggtgg 
tccttcgtggggcgcccttcctcttccccaaaccccaccagcacctgcctgtcctgccgccgccacccccatcaccgctc 
tctccccgccgcccccaacctggtccacgtagttggcgacttccaggagacgctgtttggtgtggttcaagtttcggtgc 
cgagtcaagaactgggaaggcagaaatcccggtggcttgaggggctgagctggccccatccctgaccccgccaacccctg 
gggtctctcctcaccagggtgtggtctgccacaatgtacagttccaggtacttccgggtcctgcgcgcttctcgcctgcc 
ctgcggaggtgcaaatggggaccctgagtggaagctgctgggcttgagccctgacccccaaccccagctcccagaaggaa 
gtttaacatgttttctggaacttgtttcttcagacttcaataaaaatactgggactcgaggcctgtgaattcctgtctct 
tctgatttggagggctatagatacagcattcccactcccatccgatcgatgcccctgaccccgctctggggaccaccagg 
aaggctggtcatgcccgctttgttcccaggatccctgtggccacaggttcctttccaggtgagcagctgctccatccgaa 
agatctcgtgggttgagaagtccttggagccccggggtggccagggacgcagataatagctggcattcctgctgagggtg 
atcaggccactagggtgcagaggggtaggagcgggtgtgagggagctctttccccatcccaggcccagcctcctctccca 
gagctcacctcatcccagagcaggtgcagaggactacccaggagtcggggaagccccttactcgcccttggtagtggcaa 
tgatcctagggaggaaggggccagccccaaatctcagccagggctggagcaagaggggcaagagggagggtgtggtaggg 
gctggctccaaccgccccttaggaatgcaaggaggagtaggggtaggaatggtgggggggtacctctggcggtgcatccc 
agagcccatggaagcatctcaccgtgtggttgggggccagcaccactggctgcccatctgggccgtagtgggtttctatg 
tatcctggggccagcagcctgctgagagggggtgttacagggaacactgaattcagcttcctcctgcctcctccaggatg 
tctcccagccttcctccctaaatgctaatggagcagctttatgagtgagacactcacagtgtgtcttagggaagggacag 
gagcaatggtgacttgctcagatcagaaactcttggggctagaggaaggagccttggtgatggcttagttgtgggagatg 
tgaatatgggaagcaccagggaggacgccggggaggagtgggaataggggaagagtttgtggttcccaggggacctgcag 
caggcagcaggatccacaggatcgggaggggaggagtcaggagacactgccgaagaatgggacttggagttggggaaatg 
cggtgacctcccccagttcccctgcctgctgccctcctttgttgggcatctggtcgaccctcttgcccccacctgcccta 
gatccctgaaatattttcctcagacttctagaccccacatacctcccacctgtccttcagtgattgatgctcaccccctg 
cctccagagaaaacagaatcgccacctgcccacgctgcttccaccctccctgccttctccacacccactccggtaatgat 
tccatcttcaggctccatctcaacaggatcttccccacacggatggatcaatcataagtcaatctgtcttctttaaagaa 
aatccttaacccaacctcaccttggcctcattacctccagaccacccgctaatgatggctgcttcccccctcccaggcat 
tccaccacctgccccagctctgccccctacccctgccccacacacacccgccaccctaggaggtaggtgatgtgaccacc 
ccattgaaagggtagggacgtcgggaaaatatggttgggcacagtggaactagagtttgttccctgtccatccgactcca 
cgagggagaataaaatacgtgtcaagtgctcagaacagcgcctgcggtcaagcactcagtaggtgatatatactgataac 
ataatctgggtggttttaagagcctgcgctccagcccggacacccaccccacccagcccaaggcaccttcctgagcacag 
gtctgcctgtcccttccccagctcaaaatctttggctctggatggtccaggacactgagcaccaaaatggccttctgtga 
tctggccctcctgggactctccaaattcattcccccctgctcccccccctgtagatggagaccttccaggcaggtcagac 
aaactgctgcccccaagtgtagccactgcatctttttcttttttcttttctttcttttttttttcctttttctttttttt 
ctctttttttgagacggagtctcactctttgcccaggccggagtggtgcagtggtgtgatcttggctcaccacaacctct 
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acctccggggttcaagcgattctcccgtctcagccccctaagtagctgggattacaggcgccgccaccacgcctggctaa 
ttttttgtatttttagtagagacggggtttcaccatgttggcaaggctggtctcaaactcctgacctcaggtgagccgcc 
cgcctcggcctcccaaagccaccgcatcttggtccctgccattcccttagcctggggtgccggctcatcttttccctcta 
ggatttctttagactcagcatatcttgcaaatgtccactaggtggtgctcactcatcgccagcagggagctaacaagccg 
ctcctggggttgggagggcggaggtgccccacagcggggctgacagcctcagcggtcctcttcagcctccagggagccaa 
ccacaggcctgcgtgactctccctgtcatctgcaccctctctggggtcctctgcccatccagccacccgcacagatctgt 
gtcagtccctgccccccaacactgatcccctcctcccagccctaccccagcctggcactcactggttcttctccagctca 
agcaggagctcctggccttcagcctccagggccaccagccccatgtctggcttcgagacctgggcaagaaagagtgtgga 
gctgagatggtggcctccaggcctcctgcctgccagggagtaggtggcctgtggagccggctggggaggaagttcttggg 
gagaacgtgggctggggagtcagcaggaccccccacatactatggagggcgtggaggaggtgagaacatacaaagatgtt 
cccaaactcaggatgtttgcagtcctgacaacagccacttggaagggcgttggcacagcctgccaggcacaccagcatcc 
tccctagagaccagaggtcccagaaaggtgcccctcccctggccccgccctcttctttcatgcccagaaggggcatcaaa 
a 9 ca 9ggg a agacagaggggtgctgaggacattatgggggcatcgggtagccatggtcagggcctcctcagagcctctgc 
tacctgaggcttggttccaaatgagctgctgctcatttcctatagaattcaaatttgactcctccacttccaattttggc 
aaactgctccctcttccaaagttttcctgggcctccagcagcccccgtccctccggctccgacacctgcttcactggacc 
cacgaagtaaacatggacgccattccagccaagagagcacactggctctcagctaggtgtcaggaggctggcttggacgg 
ccagccctctctccttcccccaccctcttggcgtctcccaccctgtgggaacaccccacttcccccttgtccactcagcc 
tggctgggggcccagagttggagccggcccaggagcttcctgggaggctgctgcgccttcggaatgtttaacccccgact 
ccttttctccaaaaatgcactggcctggggccctgtccaagggtctcagagtctttggagggagttcttccttcgcaagt 
ggggagcagat^gtccttgcctccctggccacaggccccacaaggcctccagcatgagctcatgaggctggaatgccact 
tgctttattggggaaaggtctgcaccgggaaaaaggccatactcgaggtccctgttcctctgcagcccctgctatcttta 
ctcttgccctcctggtaccctgccccttgatatatacccctcatcttgaaatgtgagtgtttcctgccttttggagggga 
tacctagcctctactctttcttctgtaccatcttggcaggcttcctgggggcaggggcccaccggtgggggaagcagagc 
ccctttggggctctcctcttggtcacagcccaggccagacagacagggaggcccagaggcagagtgaccccagtgtgtgt 
ccagccttcccctcctggggatggggagggcaatctcaaagctcaggccagtgccgtgcttgaccagtggaatgggggcc 
ttatgggcctaggggatcccagtgagggccctgggttgggagctgctgggtctctgggggcctctcagccttcatggcaa 
tgctcccctgccttccctcttgctggattcggacagtagggctgaaaattccaaacaaagagggctctctaggaggggca 
ggggtgtagccaatggtttaaaatcgttcagaccttagtgggtctcaggcccccagcctaaagagctgtgtgaccatgga 
caattcccccaagctctctgggcttccgtttgcccctctgtaaaatgagcatatcaaggctactgccctcttagtttgca 
gcacagatattatggcacaaacagacggggcatggttattctggaagcgtgtgaagagcgggattgggaagaggctgggg 
cagagcgtcctgcagaagaagcacatggggtggtcttacatctgggggacatcaggagagtgaccactgccccccccata 
ccagaagtggattccacaggagccagtgaggctgaaggttcaggccttcgtggcagggccctgagagggacagcagtgtg 
tccacagggtcacatgttctggtcaactttgcaaaaggttttctttttggtgcttttttttttttttttttttttagagg 
ctcctgaaaagcttcaggacccacaaactctggacccatttctgcctggtgggggtgggggtggcccagatcatccaggg 
agggagggaaagagggaggtggggtggagaaagctgaaatgacttccatgtgtgcgggctcacgagatccagatgtccaa 
accccagtgccttcttctgcccacttgaggggcaggggaggcaggggcctataggagtagtgacttggtggttctgggga 
ccccagcaaaactagaagctgtaatgtagggagagacaaaagggctgggaggttcagggcccctgtggagggcggggaga 
catggcactgaccggctcctccaggctgacggtgcgccagggttgtccatccaggacccagtgcggggtgactggctgcc 
cagggatatgtcctggagtaaagacagagcacagggtgagggggacctgaggaacacaggggcatgggacaaagcagagg 
gaggggggtagaggacatccccagggaggcactggaggccttttggggcagacttcaccttcaacacgcgtgggctcagc 
ctggagaaggagggacgcccgtgggcatccttggatctgaggagctatcaaggaggaggaaaagagaaggctggaaaggg 
acagctcagctggggacacgggagccccctgacctttgtcggggggcaggcttgggctcggcgatcacaaggaagaggcc 
aaggccgccagtgcagaggggaaggaaagagcgcggcagccttagggatttttagatgggcagcagatgcctttagggtg 
agagatgtacgaagagaggacacttgtgccccccccatcatctgagaaaaacaacagccagatgttgccttgcgaggtcc 
accttgcccagagctccctcggggactctgtcctggtggcagggttttggtaccctggcccagaaggcccctcctcatct 
cttcaaggggaggggacgcttccggacggagccttggtgctccctggccgggtgtgcctaagggggctctaggaggaatc 
ccagagccaagcattactcagagggcgcctggaatgttcccctggaatgctcccagcccctccactggccccaaccactc 
tcacaggcccgccctgcaggagccaggccccaggcacccagagcctgcagcagccctccttcccccggtacccagtccca 
gctcccagaacagacagcctcccccctccacgcagccctggcctcagtcctgccgggctgatggctgcctgtggaagtga 
ctcagctcctgctaggccaccccaactccttttttctcctccaccttctctcccagactacaaacatcaaagacccttcc 
tccaagaagccctccttgattggatgagtgaattgccatcaggcagatgagggccgagaggagtctgccaccttggaaag 
gaggctagaggggccagtgcagggagggctctgagtggatgtgggggaggggaaggaggggaggtctctcagcccagaga 
gcacttaactgagagtagagaaccaagctttgctgcccctaggcctctaagggt.ttggggaagaggtagggtgggcccgg 
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gcacaggtgtggtgtgggtgcagtgtggtgtgtgggtgctgtccacatggccttgcgcgcacgtgctggccacgggcacc 

ctgaccccaatgagggagagaggggcagagctggagctggagctggagctccggtgaccgggtgaatgggggtggaaccc 

gagggagccaggctggtattgggcacatagacgcccctctcccaggggtcccatcacctcccctgaccccaggatagggc 

tcagaggggagggagcagtggaccgcctggggccctcccctggggccagaacagaccaggcccctgtacctgtttggtcc 

ccacacagtgctgtggaagccaccgcccagtctgcatagcacagcccagccccgcatgccccctccctggttgccctccc 

tgttcccggccaggcacttgctgtgcaggactggctaatcctcccacccgcttgcagaggttgttccagccccatcttaa 

catctttgtttggaggggttaccccgaggagacagctgcagtctttccagagcactgctaaacagacaccttctatctgg 

agaggcccttctctatctcaccaaacaaggcaacaatataaacaacatacacactgccctgctgccctgggaggagggac 

gaggggtgagcagggtggaggccacagctagttctgcagcctgagagcaaagcagggactctgggggactcttgggcatg 

ggggcttcctagaggatggagccccgctgagtcctaaggggtggaggagcaggagcgggtcacacggtggcctgcggatg 

gaagctggttgtgagagcgagaatccaggcagagggggctacggctatgggctgggggctgggggctgggctgtccccga 

gggggagggagccatgccctctgctttgccagcggagtggcagccgggcagtgtgggcaagtccgggcccggggccagcc 

caagcacacttgagcgtccctgggcaggtcccacggagacccccccaaagagtccccacgccctgacctactggccgtat 

ggtgccggggccgtgagaccctccgcgcgctgacccgagctctgagcagaacccatccccgccaccaccaccgcgcctag 

cctgcccctcagggcgcaccccgcccgcgtcctcaccttgaagcaccccggcgcctggcactggccagagcagcagcagt 

agcagcagcagcagcaacggggtcccccgagctctccggggcctccagcccatagctgtgagctcctcggcctctaggca 

gcggctcgcaactccggccccgcccaggctggattgcggccgacccgtgcccggtgcagcctcaggccgccgccttcgga 

ccttcccgcccccacctcccaccgcccgccctcgctcccgcctcccctccccgccaaccccgctcggagcctggccaggg 

gccccgacggcgcgcgccatgggggagccgggtcgccactcccggaccgccgcccctcgagggggtggagctgggcggag 

gagggaatccgtgcggcccctcggatgaccggcccgagccgtccctccccgbcggtctcagagggcctctactcctgaga 

ggaggagagaaccgctgggaaggttcttggaggaccgcggcgtggtgggatgaggcggtgggcaaaggccgcctctcgct 

gctgaagttggccccaggagcgcgatcttccgtggtctcctggggccgatctctgtcccctccttgctacccgtcctgcc 

ccgagggtgccctggcggaggttgagtcgggtcatccacctgcactgggtgcccccaaggataggaaggttcaggcaacc 

ggctgccgctgtcttgggggcttcattgctgggcaaaggcgatgcagcagacggagacaacctttcttccctggcggtgg 

ccagagggcagaattgcataaaagctgcagactcccaggcctgggagaccctttcggcctcagtaacatctgtttcatgt 

tttaaacttttgttttcctactcggtgcaaatLttggatgagatgttaactttttttttttttttttttttgagatggagt 

ctccccctgtcgccaggctggagtgcagcggcgcgatcttggctcactgcaacctccgactccctggttcaagcgattct 

cctgcctcagcctcccgagtagctgggactacaggcgcgcgctaccacccccagccaacttttgtatttttagcagagac 

gaggtctcaccattttggccaggatggtctcaatctcctgatctcgtgatccacccgcctcggcctctcaaagcgctggg 

attacaggcatgagccaccgcgcccggccggagatgttaacttttaagcaaatctttttttttttttttttttttgagac 

agagtttctctcttgttacccagactggagtgcaatggcatgatctgggctcactgcaacctctgcctcccagattcaag 

tgattcttctgcctcagcctcccgagtagctggcattacaggcattcgccaccacgcctggctaattttgtatttttagt 

agagatggggtttctccatgttggtcaggctggtctcgaactcccgacctcaggtgatctgcccgcctcggcctcccaaa 

gcgctggaattacaggcgtgagacaccgcacccagcct.actttt.aagt:aaatccatttgtt:tttgagaatttggaatgta 

gtaatttggttagtgaaagttcgagcagtgagagaaacctacattcacatatctcaaaatcaaaaagtacagaaagcata 

gggaaaagtctccgtgctcttagccctcctcaccaacaggaaaccaatatgattagtttctttcataggcttttagatta 

ttttttcacactcaagacaatacagacatatttttttctcttattaacgtttttctgcactttgattttctttttttttt 

tggtcgcttaatacaccttagatatcagtgcgtttagagggtccttgttgttcttatgactattatttagagacagggtc 

tcactctgtcacccacgctagaggacagtggcctgatcatgcctcattgcagccttgaaatcctgggctcaaggtatcct 

cccacctcagcctcctgagtagctggaactacaggcacacggcaccaggcccagctaaaatttttaatttttctgtagac 

a 9ggggtctcactttgtttcccaggctggtctcaaactcctggtctt:ggccaggcgcagtgtctcatgcctgtaatccca 

gcactttgggaggccgaggcgggcagatcactggaggtcaggagttcaagaccagtctggccaacatggtgaaaccccat 

ctctactaaaaatacaaaaattagccgggcatggtggtgagcgcctgtagttccagctacttgggaggctgaggcaggaa 

aatcgcttgaactcagaaggtggaggttgcagcgagccgagatcatgccattgcactccagcctgggcaacaagagcgaa 

actccgtctcaaaaaacaaaaataaaaataaaaagaactcctgatcttaagtgatcctcctgcctcagcttctcaaatcg 

ctggaattacaggagtgagtcaccacagctgtccagctacgagattattacttattattactactttggattttcaaatc 

aacttcattaaggtataatttacacacaataaaatgcacttattttaagtggccagtaagatgagtttcgataagtgtat 

ataactacataagcatcactataatgcagacacattccctcactcacagaaagagccctgtgcccttccagccaaacttc 

cccactcccaaccccagacagccactgatctgttgttctctgtctatagataagttttgcctgttctagaatttcatata 

aatggaatcatgcggcatgcactcttctgtgtctggcttccttccctctttccgatgtttttgagattcatttacactat 

tttgcatatcaatagtttgttccttcgtattgctgaatagtgttcggtggtttgagggaaccacagtttctctactcacc 

agtgcaccatagggttattttccagttaggggctcttataattggaactaitatttgcacagagagagagagaggaagaaa 

gagggagagagatatttattatagcaattggctcacgtgattatggaggccaaaaagttcccgaatctgccatctgcaag 
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ctggagaacgaggaaagccagtggtgtgattcagtttgagttcaaaggcctgagaaccaggagcaccagtatggaggtgg 

ctcgagctcagaacaagttggggacaggaaagcagagcagcaccccagagcagcccQtcagcgacacctcttcagtaaag 

caaggctgaacacagaggggctggcttcagtgtggatgtcaggtacagaaggcagctcgaggagctactctggcgttctt 

gcttactggtattcttacctcgaactggccaactcctacttaaactgcaggccatggctttaatgtcctgtcattcagag 

gctgtcccttacccaaagccaggttagcatcccctgactgacacttctccctgcaacacgtttcagaaggccctgtagtc 

gtccacttccctgtctctctccccaagctcctgagctccatgtggtctgggaatatgtgtgttgctcacttcctagcaca 

gtcagtgctaataactgactgtagaggggacacagtcgaaaagccacatggggatcagagtcatccttacacagttgaca 

cctcccaaacccagatgagctgtgtccaagtgcaggtcagaggaattttctgccgaagtctctgagaaagggtttattta 

cattttgaggttgcaggggaggagatgaggccaccaaaccaaagctgaggaagagggatcctaggatgcaccgagcagct 

ccgggggcgcctgacagcacctgggaaagatggcttctccactggcttgttggcgtcaccctccagaggggcatcaggaa 

atgtcctgggaaccaggcaaaccagtgagcattaacccttagaagtgcttggcatgggtgacacccaccacctgtaaaca 

cgacttctcccaaggagtgacgcagaacaggatgtctgagggaggcactccgactccagccttcagagatcgccagggtg 

gcacctggtgacgacaggctgatgcttgggtgccccagaaaaggtcatgtgtgtgaatgggggccccaaagccaacgctt 

catccctgacagcctggtgcatttagaggggaactttttgtcccttggcaaggtgggtggaatttcaggttcatagggca 

agggtattttagctttaatagatattgtcaaacagttttccaaagtcattgtacacactctgtgattctacttatgtaaa 

gtttaaaaacaggcaaatcaaatctatggtgtecgaagtcaagacagtagttacccttgtgggggctgcaactggtacag 

agtgtaaggggggactgtaggatggtctatttcttgatctgggtgtgttcgcttttggaaaagtccttgagttgcattta 

taatgtgtgaacttttctgtatgttacactttaattgaatgtacaaaaagtctcaggaggcctcagaccactggaagcgg 

acacaactaacccccctgagagcctccaatccaagatggacatatgtccccttggaagtatgcagaagcaggtgaagact 

cctaagccggatattcccaaatccccccagtagccgcagcttcagcagctgcttatggtcctccctacaccctctcttcc 

ccagacagcccccaaacatctggctgcatt tgacttgctctctccctgtcccacctctggatttagtccatgttctccac 

cctccccactgtcagcaatgtagacaagacaaacgctcagttcacgtgcccacctactgcgtgccatgcacggggctggt 

cattgtgggtggcaaatgtgagcaacacacgaagcctcaaggagcagaaagggacacaaatcacttcagcgtaaggtaat 

ttgtgataaatgtcatgtaacttgcagcccctggccccctcctacagatggtgtctaagaataaaccccactaacatgtg 

actcctctgttctagcccagctgtttgggttgcaagaaagagactcactccagttgcgtcataggatggagttttattgg 

gaggacattctggacgggctcccagcaagagtctggcaatggagcatgaaaatgaatgagcctggaatgagggagggtgc 

ggcctcggctacacaaagttcacccggcccccaactgcctcccagcgtgttagctcctgtggcaactccccacttctctc 

tctgtttttcaacccaaatcctagagcagagggctcttcgctcctcacaccacctccaccaggctgcagggggagtcact 

agcccactcaagagctctctcctgttggtccctgattcgcaggggcacgtcattccttcttggtaactcattctctttaa 

cagcccgacacagggtctccaggaaaaggacaggagctcacacagtcatgaacagagatgcatctggagcaggaataagg 

atgctgacaacatctgtgtctgccctccactcttgtaagtgccagtaaggtagtacccagtccctgggggtagggggggt 

ggtggtgggaatgagggcatctcttctttcaggggccaaatctggcacaaggatgtctgcttaggcaactccactgcctg 

gcatgctctctccctaggaaaacaccaaacctttcttatttcctcagtcttagtgtgagagtgatcacctcttccaggaa 

gcccacaaccagaatggccaggggcacaccctggccctcagtagatgggcactgagacaaaacatggcctgctggtggca 

ttcccaacaatgtcaaaagtctcagggatgagctcacagttgggagtcctttggaagtccaattccaaatgtcccctgga 

ctcaaaataaaaactacatttcccagtctcccttgtagcttgcagtagccatgtgacaatactccagccaatgggaagtg 

agtggaaaagtcctgtgcagcttctgagttgcgtccaccatgggcatgggtgtgcttctacctccactcttccttcttct 

ggctggagggcatgtggacaaagcggggccatggtgagcctcacaatcaaaggcatcattttagggatagagaaaaggaa 

gatagaaggatcctgagccccaacatcataaaccatcgtaacagccagactagcgtgggagagaaaaataaaattctatc 

atgttgaagtcactgtatttggtctcttgttagagcaggctgactgatactctgttgaatacagaatgccttggtgagcc 

tctgaggatgcaggatgctgcaatccaaccaaagtccagggtgactttcgaggagaaggatgtgaaagggcagggcttcc 

tttgagggagaaatggagagaagggaggagatccagagaagagagagcgagatcagctcagtgctcttcagtgtcctcca 

gggtcaacaccctcctgggcagacctctcaccacccatttttgggggggcaagaggcttggggccaggccataaaaagtc 

agatgtcacagagctgttttcgtagaggcgggcaggactcaacactgcctcattcacattcataggctggagtcatcaca 

gagtctggccactttctcctccggagggcaggcaacaccactggtcagcccaggggtggggcacaggttgaggtctcaca 

tgtggctgcatcaggatcaatgagcttcccacagagaaaaccctggtcacaggaggcctctgggagccagggcaggcccc 

caggcccccagcctcaggtggggtcccaagaggattggtgggagtgtgcatggctggaagtgtgacttggagccctggcc 

taccaggtgaatcagctgcagaagggacacaccacaggggaggagaaactgactttcacttctgcccaggtgacgggcag 

gcttgggagtggggaagaggcctcagaacaaaggtgggggagtcagaacatacgtgtcttggaaatggaggggatgctgc 

caagtccagaaagtctcccctagtcctctaagccagccgccagctgacacagggattccctctgccccgagcagaacagg 

gctctcctatctccc.gaggggctcctggttccctgcacactctccccaacctccccttggtcccaggcaccatcctctaa 

atgacagcagtctccagatgccagttggcactgtgggttagctgggtattatatctgcccgtgtgtgttgaatggatacc 

tgcgtggtctctttctgaaaagccatctccaccgaattctcgcccatgctctgcttacaaacacgcctccttctgcaagg 
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cccggagtggactcagagagcctctcccctccccacctcctgcccccattcctctcccgctccccacaggctggctccag 

accacaagagcgtgggaacctcatggagagtgggtgcttcctctgtcttcctccctagcccttgctttagcacagccagg 

gcagagaggaggaaagaaggaaactacagcaaggaggatttagggacaattctagaagggatttccaatttggaggggca 

gtctgggaagtaggcagcctgtgaacttaataggaggtcctaaaggacctgggggaatttggggagaagacgaggtgggc 

tgccttaccccaatcttctggcccactcccagctccagacccacctccaggtgtagcaggcccccaggcccaacagcagg 

agcaggaggcccaccagcagtcccaggacccagagcagctgctggaactgatgcaggcggtgcagggctggaacacagag 

cgggactcagagcagccacagctgcaggcccccagtgcttccttccaaggggacccacccaagatgacaccttctaactt 

ttgctttatcgctgaagctctccacacaagggtgccccgagcaatcagttcagagtcgacaggcaaatcatgctgcagca 

ggagggatacagggaaggaagcccctgggcctaagagctgggcttttgtgcagagtggcctgggagatggagccccctcc 

ccccacctctgaccccaaagtaggtggaggtagaggcagagcccaggacatttgaggcagaacagatgtattccccttgg 

tcgctgggcctcagcgcctcgatgtccaccctcagggcactgggggaagccaggacatggatgtgtggctctgcaggagc 

accctggggctgactggaggccaccagtcgactgccaaggtggagagtcaggctggcgagcggctcgctgtccactcggc 

aatccaggatgccccggaggccaccctcaggctccacgaagaccatcatggtgggcgtcttgggagggtctgtggggagg 

aagggagtgtggtgattgcagaaaatgaccacaaatttctccctccctgtacccatgcacagtgactttgctgctcctcc 

cattaagaggcagaacctatttcctcactccttgaatctgtgactcagtttgaccaattgaagaaagaagtgatgttgtg 

caacttcagagccagacctcaagaggccttgtagcttctgctctacttgttggaacacaacaatgtgggaagcccaggcc 

agcctgctgggcacatgtggcccaactggcagccaggagcgtgcagtcatctgagaccactgggcaactgcagccacatg 

aggtgtgaacaacagaagaactccccagctgagcccagcccacagaattgagcagataacatgacttttgtttgaagcag 

taagttttgttgtggtcaatagcaatagctgactgatacagtgtaggccttggtgaaggctggagtgggagaccaagact 

gatgagggcttatcgtgtgccagacacactcagcacgctttatttcttttggttttttctcttcttttcttttctttttt 

ttttttttttttttgagacagagtctcagtctgtcacccaggctggagtgcgttgtggtgatcttggctcactgcaacct 

ccacctcccaggctcaagtgagtctcctgcctcagcctcctgaatagctgggattacaggtgtgtgccttcatgccgtgc 

taattcttgtatttttagtagagacggagtttcgccatattgaccaggctagtcttgaactcctgacctcaagtgatccg 

cccacctcagcctctcaaagtgctgggattacaggcataagcccctgcaccagccatttatttcttatacaacttgcagg 

atgaggcgaatgatgtattccccattttacaaggtcacccaaccagacagcgggagagccaggattccaacccaggactc 

aatatccgtgaagccttcactcttggcagacagaagaggaggaagaactgaggtgggcgtgtgagatgtggggagactaa 

ggcggaggagggggttcacactcacagagcacacggagcatgactggagcagaggcagcagccccagtggggagctcagc 

caggcagtggtacatcccagcttgagcacgagccacgtgggtgaaggcgagagtgggcacaggctccgcgtgcagccgcc 

ggtcattccagaaccatgcaaaggtggagttgcccacaggcccagggccacccaggaggcggcagctgaggttcagggca 

gcgccctcaggcacgtccaggccaggctctgccaccacgcgtgcacctgcgggcggaggatagagagatgattggggatc 

tgtaggccttgggggctggagcctctgggtgggacctctgaggggcctctgcacattgtgggaaggtctcagtctgactt 

ggtatagggctttccaaggtggaaccatgccccctccagtgggagctgtgccccctccaccagattaggcttctcaaggc 

agagctttctcagaccagacagcccgctctagtgggagctccagcccctctcgtggacttggacctgagagcatcagagc 

ccctcctcctcccctctcttccactcaccttctacctgcaaccgcccgatggtgctgattgagcccagcaagttttgggc 

tgtgcaaacataggtgtcatcacctgcaggcacatcttgcacctgcagccgtagggcgtttcgggccacctggacatggc 

ctgtccctgatgccaagctgtggaccccgctgctcgtggccagcaccttgccatcatgagatagggccagctcagcaggt 

ggctcactgtccacagtgcactgtatcacagccatggatctggccctggagtcccggaaggaggacaggacagcgtcctg 

aggggcatctgtgggcaggcagggcacagatggggaccttgcttaggcaccctgtactgctcattgcctagctgcctggg 

gttggtcaggctgaggcctctggaccaatggccacttcctagaagtgacaccttcccaggagtcctgtgggcagagtgct 

ccaggcagggcttataggactgtctctttctctagcacctatgtcctctttccccaactgcccattcctgggcccactgc 

agtcctttgcccactcccaccagagcccagaagcaccctccaccaccctcctcctgggcagccctggccaaggaccctct 

gctcacagaggacttgcagggcagcaggacgggagctgcgggtgccccgggcatcctgggcctggcaagagtaggcgcct 

gcatgagcccgcgcggccaccaggaatgagagtgaggcagctggaccctcctgcagccaacgaccgttgtggtaccaagt 

atagagtgtgggtgcgtgggcagcagggtccgcacaggtcactgtgatgggggcaccttcaggcacggcagcctccggag 

ccaggatcacccgcacacctgtgccaaggaggccaggatcagccgggcccagccggacaccacagtgggcttccatccca 

gtgggcttggcctaggccctgccttaccctccagccgcagctccagggacgtgttggcctggcccagagggctgcgggca 

gagcagctgtagaaaccctcatccctgggctgtggccctcgcagctccaggcgcagggtgttggggacagaggctgctgt 

cgaggaggccaagaggcgaccggcgtggctgagggccagctgggcgggcgggcggctgtccacagtgcacagtaccaggg 

ccagctgcccgccatggctctccaggaggtaggtcaggcgcaggttgcggggcgcgtctgcagggcatgagaggcttaca 

cggagtcacaggcggcagcctgcaggaggccagtttgcaggtggcattcaaactcaggagggccctggctccccacccct 

atggctcccaaccacccagtctgagg'cactgctcctctgcccagacaggactcaccccagcgccccctactcttaatgct 

ccttgcccagtgctccactagctactgggtaccgagttccccactggacacctgtcgggttccggccatgccgtgatccc 

tgtgggcttctgtcattcctgtgagtcccaccctgcatccagccagactcacagaggacgtccaaggtgataggtctgga 
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gaggcggggtgcccgaccaggggggcccacaccgcagcggtaggaggtggcatccctgactgtgacgttgggcaggggga 

tggagtgggcatccaggcgctgctgcccatcctggtaccatgtgtaggtgagctigggccgggtgagtggtccacacaagg 

caggtcaggttcaccagctgcccctcccgcacggtagccccgggccacacctgcacattcacagctggggagagggaggg 

cacaggacaccagtgagggtcttgaggctgtgtacagcaggccacctgtctccatgtgggtgagctactcctactgctgg 

ggagccccttggcctttgggagccttgggtggcccacctataaatgtggcttagaactcaagcttgggaatagcccactg 

gctcctgagtggtcccaagtagagttccacagagtcctggaaagccctgagcccctccctgaccctggctgggggcaaat 

ggggagaccacagacctcccctcccctgctgcactttctcctgaaattccctgtttagttttatccactgggcatacttg 

taaaatttggtgtttataaaaggttctcctaagaaactcaagtctggagtccaccatcgttaggaagggtgggaatgggg 

actattcccgtgcccccaggcttctagaaccctgtcccacctcctctcctcttttaaaggacacacacacacacacacac 

acacacacacacacacacacacactgaccttgagcgtcgaagtcagctgaggccgaggcctggcccagggtgttggaggc 

ctcacagatatagacaccctcatcctccagcatagccccgtggatctccagacgcagtgtgttgggggccacagccacat 

gcagcctgggagagctgccttcgggtccccccacaccttgtagggtggaggccacaaggcgatccccgtggagcagccgc 

agctgggccggagggtcactgtccacacggcacaggaggaggcccagtcgtccaggacctgtgtccatcagggtagtgag 

tgtgacgtggcgtggggcatctacacagggtggggagtggtcaggacccagccaggggggtccctcatccagggctctgt 

catgtgacacagcccaggactgggggactgcattccttcccccacacctggatgggacaaaagtccttacaggacacgtg 

gaggctgatgggtgcagctaggctcgtggtggctgagcctggggcctgggcttggcaatgataggccccagcttgtgtca 

aagttatggctgcaaagcggagcgtggccgaggtcgactcctggaggggctggccatcccgataccaacgatatgaggtc 

ccctctgggactcctgtgggtacctggcagctcaggaccacagcctggccctcttggagctcaggtgat-.ggtgacacctg 

gacccaggctcctgcaggggaaaaccaagagcaggtgagggctctccaccacacctctcacagtctgggaccatgtgcgt 

gtccacctaggactacacaacccaaccccatgtgctggagccgcagccccttccagaccaccgccagggcccactcagcc 

ctatgccttggcccctcctgctccccactctcctgcacaggctccatcctggcatttacctcccaccacctggtgggatg 

gttttactgtgtcagtcatatctcgcctaccaaagcgtaagccttatgggcactggacttgtgtgacctgtaaccccagt 

tccaggcactgtgcctgcatgtgttagctcctgacaaatgttggttcaacccataaagtactgaaaagggagggatttag 

atcatcccagggccattgtcctaactaggatctggactcagcatgggtgaccgagggcttggaagggaatgttagcagct 

atgttatcttgagcgctctcgccaggccagccctgcatccagactccaggccacaagagcacaggacctggggaccaggc 

aaagggcagctcctcagtggcccctgggtgtgaagcaggagaaccccaggttgcagggagtgagatggagggacagctgg 

aatgctaagcaagaacacagaccatgcctgagaccacagctagcagggtcacgcaactggggaaaagctcctaaaactca 

aaccttcaatttcttcctgcgaaataggtatgctaagtaagaaaagatacacagaaccaaggctcaaagtaaatgtccca 

aaattggtgctgtcggtcaccgtggtggaagaaacactcatgttcctttgaccttcgcattatgggcattatctttcctt 

tttcctggtttatttgtttagggagaattttttttttttaagacagagtctcactctgtcatccaggctagagtgaagta 

caattctggctcactgcaacctccatctcccaggttcaaacgattcgcctgccttagcctcccgagtaactgggattaca 

aatgcccaccaccacacctagctaatttttgtatttttagtagagtgttggccaggctggtcttgaactcctgacctcaa 

gtgatctgcccgcctcagcctcccaaagtgctgggattacaggcatgagccatggtgcccggcccgagggaggattttta 

tagagactatatatatatacacacacacacacacacacacacacacacacacacacacacacgtatgtgtatataaataa 

agaatatgaatatattatgcataagaatacacataatgtatatttgtatgtgtatatactcatacataaacatatatgca 

taatataaatatacatatgtatgagaatgtttactatgtaacaaaattatatgatcgcaaaacaagttatttcagaaagt 

gctcccaccaagcaagccactttgtgaattgccattgcctcctagtggctgctgcagttattacaggtgaaaatatctag 

ctggaggcaaagggaggccttctgctggctggctggaaactgcccttaccaggactgaaaaagacatatttctgcacaaa 

attcacagcacccagcccaaagtctgaccagagtagtccccagggcatcagagtctaccatatcttcagaagactgacac 

tcacctcggacctggaagaagagtgaggtgttggatgatccaagaacatttgtggcctcacagcggtagctgccagagtc 

cccaaggcccagttctcggacctctaacttcagggagttggcctcagccttagcctggaaccgaccatgggatgggacct 

ggggacccaggctggtggccaggaggtgctccccatggaacaaggccagcaaggccagggggcggctgtccacagtgcag 

atgaacagagccatgtggccctggcccatgtctaggagggctgacagctttggacggtccgggggatctgcaggaacaga 

gggagctgaggccacccagcctagctcaccacattacaactgccacactatgtcccccacctcctgccacacacacagcc 

taagccatgctctttcctccccaaaccccagcccggcccctgcttctccccagaccacccctccaccctccaagtcccca 

ggctgcccatgccagtcacccacagaggacactcaggagcacgggagtggagagctgggcaccagcctcagtcaggatgc 

ggcaggcgtaaagggcagcatcagttctggccacgggcagcagtgtcacggtccccaggggaccctgggcccacagcacc 

ccatttcggaaccaggagaagttagcagggctgccagcagcttcccggctcacgttgcaagtcaagttggcttctgtgcc 

ctcctgaagtgtgtgtgatggtgcaatggccaggacagtggctggagagcaggcggcacagcttactgaccacccccggc 

ccccaggagccaggggtccagccgcccagcctggaaggagcaggaaggaggccccctggggtgcccacagggttggggag 

aaaagcaagctagctcacagggcagcctaggagaagtgggccctggggactgtgggggccctgggcagagagggttgcag 

tacagggaaggaggacaaggcagcccaactgtcccagctggggagtccttcctcagagaccagctgtgttgcctatcccc 

ggcctgaacagagatcatgggaaggagcagctccgctcttcctgaatgtggaggagggcagcgaagggcccccactgctg 
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cacagagtggttttgcctgattagatcctcctcggagcagaacagtccagagctccagcccctgcccccaggccacccat 

tccctgcctgactcaccctggccattgaaggtggctgaggtggaggcgttgcccagggcattgctggcctcacagaggta 

caagccctcctcttccagcaaagggttgtgaatctccacacgcagcaagttgggggctttggtgaccttcatgcgtgggg 

aacagcccccacaggtgctgcagccaccccctgatggcagggaagtggccacaacacggtccttgtggagcagctgcagc 

ctggcgggggggtcgctgtccacacggcacaaaaggaggcctcgccgtccagccccggccccagcggcatcaaggtccag 

cctggtggtgaatgttggttgtcgaggggggtctgcagggaggaagaacatgggcactcatcccacggatgctccagggc 

cccacaagcctggctaggctcccagaatacactggacaaaggcagatcccgaaggctgccccaagcagctgtgcagactg 

tgcactgcacaagggcagccagaccagggtgggtggaactcaagctaggctcatgctccccaagctaagctgtgccctga 

tgcctactgagtcttcccagaagaaggagcaccattttctagctcagacaaaggtgctgtatgggctggctgccaccttg 

cataagtcccatctgctgctgagcagggtgcccacctagcaatccctgggggccatgctgtctgctcccaccctgcaccc 

aatgccttaccacggagcctcccctcaagatctgccctgcagcttatctgtgatctccaagccagctcctgtgatactag 

cgaatgcatagaacggcctcttcctgaaggcttctggagacagttggccaaaagagagcaatcagccaaggcccccatgt 

gatctcactcagacaccagaaaactttgcagaggcagctgccctctgacagttccgggacgtggctccggcacacatgtg 

tcaggggaagcctctggaatcaggcccttggtcttacacggagcctccccgcttggcggatgcagaggggctgtctgtcc 

ctgtgctgaggggctttgccttggtgtctatgggagcccagcacagcctgtggggcatgtgcacaaatagaggagtgtgg 

tgtcccagggctcacccatgaggacaaagcatggatgtggtgcccatgctgagcttgaggctgggacagaggagcccaca 

gggctggcattggagggtaaagacatggggaggtagagaaggccccagcctcatttcctaaacttcagccccacgtggaa 

agacccactgtggccaggtctgtgacccagccctcccgatatagaccttggggcagggcaggacagggctggggggccta 

gaggaggtgg93tggctggtgggtcccggcaggaggctgcaacaggtggtggctgctcacagagcacagtgagaacagct 

ggcgaagaggggccactggcactgtggccgtcccgggcccggcagtggtatgagccggcgtcagtgctggaggccgcggg 

gagcaggaggctgctgccgggaccctcgtgaagcagggctccattcaggtaccaggagaagcgggcatcaggtgtggggc 

ttaggccgcttctgcagctcagtgccactgcctgtccttccaccacctcggctgccgggccgatgaggagacgggcggct 

gcggggagaggaagaggctgggaagggcccctcctctcaaccccacatgctgccccatatagaaagccttccagggttct 

cctttccccacactactgtaggtagctctgggctttgtggtgcatcaggagggtttgccctttaatgccaaatcagatct 

attggtagtagatggctgcagcatggttgaagattcagagccagagcccaggcttacgtccaacacctggcttggctggg 

catggtagctcatgcctgtaatcttagcaccttgggagactgaggcagaaggatcgtttgaggccaggagttccagacca 

gcctaggcaacatagtgagactccatctcaaacaatttttttttttttgagactgagtctcactctgtctcccaggctgg 

agtgcagtggtgtgatcctggctcactgcaacctctgcctcccaggttcaagcgattctcctgcctcagcctcccaagta 

gctgggactacaggtgtgccaccacacccagctaatgttatacatgtagtagagatagggtctcaccacgttagccaggc 

agatctcgaactcccgacctctggtgatccacccgcctcagcctcccaaagtgctgggattacaggtgtgagccactgtg 

gccagcttttttttttttttttttttttttttttgagacggagtcttgctccgtcagccaggctggagtgcagtggcgca 

atctgagctcgctacaacctctgcctcccaggttcaagcaattatcctgccccagcctccctagtagctgggaccacagg 

tgtgcgccaccacacccggctaatttttgcatttttagtggagatggggtttcaccacgttggccaggctgatctcaagc 

tcctgacctcaggtgatctgcctgcctcggcctcccaaagtgctggaattacaggcatgagccaccatgcctggccacaa 

ttttttttttttaattagctgggtgtggtgacatgggccgtagtctcagctatttgggaggctgagatgggaggatggct 

tgagcccaggagtttgaggctgcagtgagccatgaacataccattgcactccggcctgggcaacagagcaacaccctatc 

tcaaaaaacaaaaagaaaaacctggcttgatcaattagctaccatgccctcaggaggagggaaggacagtgcacataccg 

aaagttggaagaccgtacttttctttCttctdtttctttCttttttttttttttgagacagagtctcactcttgtcaccc 

aggctggagtgcagtggcgctatcttggcccactacaacttccacctcctgggttcaagcgattctcctgccttagcctc 

ccaggtagctgggactacaggaactcaccaccatgcccagctaatttttgtatttttagtagagatgggattccaccatg 

ttggccaggatggtcccgatcccttgacctcgtgatccacccacctcggcctcccaaagtgctgggattacaggagtgag 

ccaccgcgcccagccagaagaccctacttttctatttggcttcccacatctgactgctagcatagagcctgcccccagag 

tttcataattaaaaaacaatgaatgcttctgagggactctccaagtttagggtcagggtaggtgcaaaaggaatgatgtg 

acctgttgtgtttccctttttcccttgacttccaggaagctctgccgttgggtcactgcacagcccctgtcttttatgtg 

gcgtagccagttagctcagtcctgcggttgagtccactagacttctagaaggaacagactggagcaggctcctcctcagg 

ctccctccactctccctggctgccagtgcccatcttaccactggcatggaagtccagggtggaggttgcatttccaaggg 

agttggtggctgagcacttgtactccccactgtcagtttcctccaggtctcggatctccaggcgcagggagttgggacca 

gaggtaccactgaagcgtgggctgtgatcactgtccccggaggtggaggccaggatatgacccccatgtgacagcaccag 

tgtggccaggggctcactgaccacagagcagtgaaggatgcccacaagtcccgcctgggtctccaggaaggctgtcagga 

ctggagtgagaggcgggtctgcgtggagacgagaggtgggcctgtcaccctcagacaagggcattccctggataccctga 

tacaacccgtgacctctgcaccgctttgtcccacactgccctgcgagaagggggtgatcccaaagtgcctgagtgcctgc 

tgaatacacttttgtccttggctgggctgggtacgtcactctgttgtcccagtcagtgttcaaggccaccctgcaagtgg 

ggataaaagccccacttcaaagatgagaaaactaatagagagacctggtgagggacagcacctcagccaggtgaccaaag 
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tgagcatcatcagcaatgggacaagctgacacagagtgcctcctgacagggcagagcagcatgtccgcggccttcccacc 

cagagggcatggtctcaggccattcaagtcaggcaaatgtggagtgagggacctcotcacaacagcaggcctgcactctg 

caagtttcaacgtcaagaatgacaaggaaagactgaggaacagtctcagactaacggagaatgaagagacgcaacgaccc 

aatgcaatatgtgaactgtgattgtattctggaccagaaaaaaaatggctacaaaagacagtattaggtccactggtaaa 

atgtgaatatagattatagcttagataacagtcttctatcagccaggtgtagtggctcatacctgtaatcccagtacttt 

gggaggccgaggtaggtggatcacgtgaggtcaggagttcaagaccatcctggccaacatggtgaaaccctgtctctaca 

aaaacacaaaaattagccaggcaggatggtgggtgcctgtaatcccagctactcaggaggctgaggcaggaggagaatcg 

cttgagccccagaggcggacattgcagtgagccaagatagcaccattgcactccagcctgggcaacacagtgagactcca 

tctcaaaaaaaaaaaaaagtcttctatcaatgctaattttcctgatcatgatcattgcattgtggatatgtaagagaatg 

atcttaggaatttagcggtaaagaggcctcatgtctgcaacttcaggaatataaatatgtaggtagatacataaataaca 

tatgcatatgtatatagagtgtccatatatgtataagtgcacatgtccacatagagtggatgtgcatacacacaagtgca 

cctgtatatatgcaagtctatatccatacatttatatgtatgtatgtgcgtgtgtgtgtctgtgtagacagcgagaaaga 

ttaagcaaatgtgccaaaatgttaataaatgggaaatctaggttaaaggtatataatcattatttgtctttctctgcaac 

ttttcataagtttaaacttccaaatataaaataggagggaactagggaggtcaaagcattgcccaggtgtgcagagctgg 

gacatgagctcaaggccacctccaggtaagtggccttgaagttcccatgcccaggaccccaacccttccttggggctcca 

ccatctggtcctgctctgactgtgtccagtgccaccccaccctggcctcttacggttgactaccacgctgacagggcccg 

agcgctcgctgccatggacgctctgcacctcacagaagtagaagccagtaccagccctagtggccaagtgcagccggagg 

gtatgggagtgggcatcctccagcaggacatggttcttgtaccagctgtagcggagatcactgggtgcctcgttgggtgt 

gttgcagactagtgtcactgtctggttctccaggatgggacctgctgggctcacctggacctcagccactgcaagggcag 

catagggagtgctggggggtcccagccaacttccagccccagccccatacagtccccgggtctcagccaatcagcctcaa 

tctctgcacatcctacaccacccgcctgctgctacaggggagcctcctggagtgctctgcatctctttgtcctgctcacc 

aggatccccctgacaccccacccttcgtgggggtatcgtcacctgtctcatggtaaggcagggctgggactccacacctg 

catccaggaagcactgggttatgccagttgctggccctgccctgccctgtctcccctccgtccctgaggcctgagctcct 

ccctattctctggccaatacgcaaccttcccaagaactcactgaagatgtggaggctgatggggggtgagaccaaagagc 

ccacgccgttctcagcttggcaggtgtagacgccagcatcgctccaggctgcctggggcaggtgcagcacaccagtcttg 

gtttggaggcgtaccccatccttgagccacttaatggaactgactgcagggtagctgctgttcacctggcaggtgagtgt 

gaccagctcacctggaaggatgttcctccccgaggggctgaggaggatcttcacacccctgggggcatctgcaagtcaca 

gtagggggtattgggtaaggtgcttggggagggcagaggatggcacacttcttcttgcccccctttaaaagctcagtcct 

aaggaagtatgcccagataaaacagcagtccccttcaatcctcacccagggcatgctctgtctgcccatgtccgtccctt 

tcccgccccctgcgcctgatgcattcctatgcccattgagagctgatcatgtgacgcttggccagcgtccagcctacccc 

accagctgtagttttctgcttcccaatgctgtccattgctcctgctctatttgggatgagctccacacacagggtcatgg 

gtaccccactgctagctttagtggcctgtggaagctattggtaagggaccaactacctagtgggagggggccaaaggcag 

catcaaactagctctgaaaatagttacccagtttgtaagcaagaggccaacaacacaaagaactgcatttccttaaatct 

tgttccaaagcctctctctgtgtattcaagtgttttattcttatttttttagaaagaggatctggctcagtcagccaagc 

tggagtgcagaggcacaatcatagctcactgcagcctgaaactcctgggctcaagtgatcctcctgcctcagcctcccga 

ggaaccgggactacaggtgcaagccaccacatccagccaatttccgttttttaatttttttgtagagacagggtctcact 

atgttgcccaggctggtctcgaactccttgcctcaagcaattctcctgccttggcctcccaaagcactggggttacaggt 

gtgagccactgtatccggcctcaagtgtttaatttgtgccaggcactcttctaaatccttgacctgggtcatctccttta 

tttgtttttgttgttgttgttgttttttgtttgagacacagtctcactctgtcacccaggctggagtgcagtggcacaat 

cacaactcaatacaactccacccccggggttcaagcaaccctagtgccccagcctccttagtaactgggatcacaggcat 

atgccaccacacccggctaatttttgtatttttagtagagacgaggtttcaccatgttggccaagctggtcttgaatccc 

tggcctcaagtgatccacccgcctcagcctcccaaagtgatgggattacaggcataagccaccgcgcccggcccatctcc 

tttaatttttatcataaatctgtgagacaggaaccatctattgttatcttcgtcatagactgagaaaacagagccaggca 

ggaaagggataaatcccacctccccagcatcctccagcacggggtaaatcccggggcacttgctgccaaccctgaagctg 

catgggagctcctgttccccaaccctttgtgtctccctttctctgcccagcctggcctcaaggacaagccccttcgaagc 

agtaggaggttgagggaaccatttaacgaagtccttggggctcagcagtgtctctccacttgccctaccctggaggccca 

ggagcagcctggttttgcatcaggagcaagggttccgtttctgcgggctggagaggggctggcttctgtcaggagcaaca 

gatgcgctcagccacaagggtgtgtccccagacaactcacacttcacttggaggtgaatctcgctctgagccctgtgatt 

ggccacggagagctggcagcgcaggatccggccgtggtcctgccaggacatggccatgtggagggtctccaggtggccga 

cgccggtgggctcaaacttctggctgttgaaggtgacagagcgagcagggtcctggccttgccactgcagtctgacctgc 

tcctgcaggcatacgtagggagtggagcagttgaagtccacctctgtgccctcgagaagctccaccggggaggcaatggt 

gggcaccctgggctcctctgaggacagagacagcagtgctcaggacccgcttt tgccacccctgagatccctcgccctgg 

aaaccccagctgaggagagagccctggggagggggcttttaggggaaggaaagacgctttctactccagccccacttggg 
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tgaatacaagggagaaccaggcctgggcctacgccggggctggaacagaggctgagactggctggggttagattcaggac 

aagggctggggctgagagccaaggggtccagaagcagcttgggaatccctcccggggggcagccaggccaccccacttat 

cacctgttactgtgaccaaggtgcctttcacatctgaccagcggttgacctcactgatctcgaagcggaagttgtaggaa 

ccagagtcctcgggctgcaggtccttcagcagcaggttgcacaccctgtgctcggggttccccatgaactcggtgcggcc 

gcggaagcgggcctccaccagcttggggtccgccgagtggctcaccacctgccgctggcccgagtagtcgtagtaccaga 

tggccgtgatgccgtcgggcacctccacgtcggcagggaagctgaagatgcaggggataagcaggcaagaccccttcaca 

ccctgcacgtcctggggactggagacgccccatgaggcctggcctgggggaagaacggcagggggacagaggggagggtg 

atacaggcctcagggtgccacagagccctgcgacctgccccagagaaggtgccccagctgggctcccaaattctgccctg 

ccccgagaaatgcacacttagagcagcccttctcagtgccccaggggtcagtccactgcccgaggctgctcagaggtttg 

gtagggtggctcgaagacagatggcagcttcctgcccagctcctggccactgcccgattgggccctcccttgacctcagc 

aaccaaacatgtgacccaggcatagtctaatctgccaagggctggacttatgaaggctggaccctggaagacaggcacta 

ggcccgcaaagcttacctgctgggaagaatgaggccaggaggagaagcttgggcaagaagcccatagcaggttcttgtgc 

tgctcctgttgcctaagagggtggcgcgcactgcgctggctgggctcacaggggcctccagggacacctctgggcacttt 

agccccagcacctgctagaagtccgagcctgtgtccccacctcctctgctggccaacccaacaagagggcagggctctta 

aagacctctgagtcagacaccagcagagagccaggaggccacgttcccagctcaggctgtgcccaggaatgccctcactt 

ggtggcctgcctcagaaaagcctgtgtgtcccttggccctatcccaagttctgctttcccagcccctcaaggataccacc 

ctaaggcagatgaacagctgtttctccctcttccccacttccctgccccctccccaccacccaaaccaacaggaactgga 

gcccagagatgcccagttactcactccaagacacccagctagaatgatggtttcttcctgaggcttgtctcctaccacct 

gccttactaactatagaccataatggggctctactgaacttgccgaagtgctgcctttaacagtcactcccctgctcaaa 

aaccttctgtggctccccattgcccgtgagatgtgaaaagtcctcatttcctgcccccagctctgtccccattccctgct 

ctccgcagacccactctgggggcagttctgtctgctcaagggctccctagctgcccagctctatctccaccacagataat 

ctttgcctgctgaaaccttattcaaccttcaaggagcagcatgaatttggcttccagctggaacttctcctttgaggttc 

ctgtagctaccccagagctatctctattttccttgccttgttttttacagcttgtgagagcccatttctcaggacctaga 

actgaagatatgtgccccatagcagtgcggagcctaccaggcattcagcaaaccccttagtgactaagagaggggtgagg 

tctttaggggttcagagctgaggttcagagttggagtggggaggtggcaaggcaagtctaggtttgaaaggtagcatgag 

agcgctgtggaacacataccccacaaatatgagctcaatgtgtgcggagtgtaccatatccaaaaaggcaggccctcaac 

catggagtgcccctggtcagggagtgtctaaggggtaccatagacctgagcccaaaaggaagagatgccagaaacacata 

taagtgaaactataaaactcttagaagaaaacaggtgaaaatcttcatgaccttggattaggcaatgctttcttaagtat 

gaaattaaaagcacgaaaaaaaaaaataggtaagttggacttcatcaaaatctaaaacttggccagacacagtggctcat 

gcctgtgaacccaacactttgggaggctgaggcaggaggatcactttagcccaggagttcaagacgagcctgggcaatac 

tgcaagactctgtctctacgaaaaattaaaaaacaggcctgtggtcccagctactctggaggctgaggtgggaagatcgc 

ttgagcccaggggaggggtcgaggctgcagtgagccatgattgcaccactgcactccagcctgggtgacagagcaagacc 

ctgtctgaaaagaacaaacaacagctgggtgcggtggctcacgcctgtaatcccagcactttgggaggctgaggcatgca 

gatcacaaggtcaagagattgagaccatcctggccaacatggtgaaaccccgtctctactaaaaatacaaaattagctgg 

gtgtggtggtgtgcacctctagtcctagctactcgggaggctgaggcaggagaatcacctgaacccaggaggcggaggtc 

acggtgagccaggatcacgccactgtactccagcctggtgacagagtgagactcttctgtctcaaaaaaaaaaaaaaaaa 

aaaaggatatgaatcaaaggacactatccagagagtgaataaaaggaggacaacccacagaatgggagaaaatatttgta 

aatcatttatctgataaaggactaatatccagaatatataaagaattcctacaatgaacaacaacaaccaccaaaaaaac 

catgaaatccaactccaaaaatgggcaaaacacttgaataaacatttcttcaaagaagatatataactggctgataagga 

catgaaaagatgctcaacatcactaggcattaggaaatgcaaatcaaaaccaaaccacagtgagataccacttcacatcc 

gttagaatggctattcacaaacaaacaaagcaacacagaaaacaataaatattggtgaggatgcgaagttgaaattcttg 

tgtattgctggtgggaatacaaaatggttccgtcactgtggaaaacaattgggtcattcctcaaaaagtcaacataggat 

taccatatgatccagcaattccactcctaggtatatacccaaaagtaccgaaaacagggaccctaacagagtacaccaat 

gttcacggcagcactattccactaaaaggtggaaacaggtcaagtgtccatcagtgaatggatgtggataaacaaactgt 

ggtatgtacatacaatggaatatcaatcggccataaggaggaatgaattctaacatatgtgaaccttgaaaacattatgc 

tcagtgaaaccagccagacacaaaagggcaaatattgtagggttccaattacatgaaatacctagactacgtatattcag 

agactgaaagtagaataggatagaggtaaccaggggctgcagggagggggagctaatgtttaatgattgctgagtctctc 

agataatgaaaaagttctggaaacagtggtgatggttacacaatattgcaaatgtacctaatgtcatggggtgtacactt 

aaagacagttaaaacagtaaattttacattatgtatatttcaccacatacacacccatgttgccaactttgcaatcctcc 

ctggtcctaaatgctgacttggccaagtgaacgaggaggctggaagtggggacaggaaactcatgacctcccagctccca 

gcccatccgcctcaggggctgggctcagcagattccaatgactaccaggggtcacacctgggaagggggtgagccgaggc 

ccagggccagtcaggctgaccaggtgggacttagcctgctgcagaaggcagaaggtgccccagcagggggcacagtacag 

ggcgggattgggacaggaaggacaccgctccccaggggacccagccctctcgcaggctgctggagtggactgatctggcc 
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atttatggaggcccaagggctcatctccagttctctaggaagccctaggcctcctcctcttctgggaagatgcaccccca 

gcctccacaccaggttcttggccactggagaatgatatagctggggccctgggacQtggacacctcaccgtgaagaaaag 

cagcctgctgggcacactgggggtcagatgtgtccctggccacaggggatgtcagggtcagctctgctatggccagggca 

gctatcttgtcccagctcccctgttcctcccatttggggtcctgaaaagggcaatcgtgaacctgatggaagaatggtgg 

ggttctggacacagcgaccctggaacagggcgcgggggaggaccctttccaggaccactcccatcacataatgtagaggc 

cacctatgcttagcccggccctaaccccaaaggggtcagccccaccggaatccagcctattggctcagcctgtcaccaca 

aaggccagcttcagcccagataactgttctggaaacagaaagagcagggaccgctcagaaaggagatctctgtccctgtt 

tgaaagcctggagttgaggggacagtgccccgccccccgccccccgcaacttgggttgcagctgtggcctagtgagcacg 

cagcgccccctggtggtcgagggggaattgcgggtcccgggaaagggggcggtgtgccagcaacagggagcaggcagctc 

tgcagccctgaaccatccctcccttgggtgactcttttggaaatcattgttccccagacaggaggttcctgaggttcata 

ctcgggcccccaagtcttgggtgctctgaagacaggattttaaatccccactcctactattggtatgtgttgcatcaggg 

tggcttgagctggcctggtacacagtgggcactcacgtttgctgcctgtttgagaccaagtgcctcaggaggtctcggag 

ggttggctggggccccaagtccctgacctctgattccagaggccaagtttagctggggaagaaagggcagaggcagtttc 

cctatggacagctaggcccgggtgtaggattcagtttctgtttcctgacaccaaggcttctccccaacttccccattggg 

ctagagaaggaagaacacagggtgacatggccagctggaggttactggcccacagatagggagtcagggtacggatgggc 

aattcctggagcagattatggtcaaaatagtggaaatccccaatcacaggccaaatgtttaattctcagagccatagaat 

ccataactaatgcttattggctttgacatgggcagtagaaatttcacacttcattccttaatctggctctaaatgcttct 

ggctggagtgctcactctccaaactgtgctggacagcaccagaagccctcctgtggacggacgaagtggtgatggatgaa 

gtggaattgtgctggggttagagtaaggagagattatcgttgggctgggctggggctgaggtcagggtgatgatcaggtt 

ggaattctggaagcaaattaaggctgggattggggtggacattgaggttgagtgatgggtgggggtaagggtgaaggttg 

gagttgggtgcaggtgatggttaagatacggtggaggctgggttgagatgaggatgatagagtcggagttgtggttaggg 

ttgggatgatggatggttgggattagatgagatgagtaaatggttagggttggggtgcaagtgaggtgagggtgagataa 

ggttggaataggggctggggctggggctgaggtagggtcagagcagggtgatggtcgggactgggattgggatgcaggtt 

ggaaggatttgggggtggtggaagggtttcagttgagtccatctttgattagtgtcttggacttgggttggtttggggtc 

caccactcgcacccagatggagcccccccccgacccctgcccctatcccgctcagccagtttcagcccagccccgctcct 

aatgctccactcaccctctgggcccagggaccagggacaggggtacctgctccacagaaggaagtggctgcggcggtgct 

ggacctgcggagaggagaacaggaaggacggccaagagctcctggtgcagctggctccccagggctctgccggctcaaga 

gagaaggatcccgtatcaggggctgcttcctctttcccaaagcctcagctctactgtccaacccagaggctggtcaggga 

ggcagctgcaggccttgcgcaatgccaaaacgggaaagacctccataggggaaggccctcggcaaggccagggacctagg 

gactccagcaagcagaagtgggaccgctgcaaqgctggagcctccccaggcaaagtgagaaatggagtgggggactccca 

ttcaccagccaaatccacaccccactctctctgagccccttagggaggctgggggaggtggaagggggcttcctgcacac 

agctctcctccctgacacctgagagggaggcgcgcccagcctggggtggggatgcactacacgatgcccagaccaaggca 

gttattctccaagccattcaggagcctccctgtaccactgagtactcttaagaacccccaagaggcagtcccgtgttgtg 

ggggacataaggccccaatgtccaggacactcctcaggttttctctttcccctctcactgtcctgtacgttttttggttt 

ttggcttcttgttgttggtttttttctttttgttttgtttttgctcttgagatggagtctcactctgtcgtccaggctgg 

agtgcggtggcacaatctcagctcacggcaacctccacctcccgagttcaagtgatcatcttgcctcagcctctcgagta 

gctgggattacaggcatgcaccaccacatccggctaatttttgtatttttggtagagacggggtttcaccatgttgtcca 

ggttggtcttgaactcctgaccccaagtaatctgcctgcctcggcctcccaaagttctgggattacagaggtgagccgct 

gcacccggccactatcctataccttcacccccaccttgggacaggaaaggaaagcccccaccacagctagttactgttac 

attactgcagcccatcttattgagggcctagtttgtgcaggcacttgacatgtgaagcagatgttaattgctccaagcaa 

tcccagatacaagcaacaagactttttttggtttttgcttgtttgtttgtttttgttttttgagacagtgtcttgctctg 

tcgcccaggctggggtgtcctggtgcaatcttggctcactttggctcactgcagcttcgaaattctaggcttaagcaatc 

ctcctgcttcagcctcctaagaagccgggactacaggcgcttgccactatgcacggctaattttttaagttattttgtag 

agacggagtctccctatgttcccaggctggtctcaaacacctgggctcaagtgactcccccacctcgacctcccaaagtg 

ctggaactacaggcgtaagccaccactcctggccatttttttttttaatttcacaactcaaacttaattcaactcagtcc 

ctgcctacctgtactggtggggagctggccacagcaaattaggcccctgtaccctgagggcaaggccacgtgtgcaggtg 

taaaaggatgtgaaaccctaaatcagggtggatccagaatcgcaggccatggtgccccaaagcagatgtctggtgacatt 

ccaccctgaaatgctcaggctacagagatattaggtctctatcactctgttcctctttatagctcctgtgtccactccta 

tgcttgggccatttcctctttcggccaaaacacaaagggttcatcccattacttcctccctcaacagctggtccggagac 

acccagctctaggcctgtggggttgtgacacatgggtaccaatccttcagtccactgggactctatacatccaccccttg 

gcttcatggtgggaaagcatcccttgactttgaccttagtcatatgacttgctctggccaatggatgtgcacggacatga 

cacaaactaagtcttgaaacgtacttaagcagtttgcctttccccctgaatttctgtcattgccatgcaaggggcaggct 

ccaactaacctgctggtccaaagaggatgacgaacacatgcagatgacctgaatcagacccatggattgaaacaaaactc 
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agctgagcccagcctacgtccaccagaccagtcaacctgtggatgcgtgaatttattgctggatgctgctgagaattttg 

tggctacattagcaagatgatacaaggcctaagtcccagaacaacacacccagaacttgcttacctttccttagcatgag 

gagagcaaagacttgtctaccttgattagtcagggagcactgcttcctgtcatttccttgagtatacagcaaactaggta 

aataaataaaaataactaggtaggctgggcacggtggctcacgcctgtaatctcagcactttaggaggccgaggtgggca 

gatcgcttgaggccaggagttcaagaccagcctggtcaacgtggcgaaaccctgtctctacgaaaaatacaaaaattagc 

tgggcctggtggcaggcgcctgtagtctcagctactcaggaggctgaggcacgagaatcgattgaacccgggaggtggag 

attgcagtgagccgagatcacactactgcactccagcctggatgacagagcgagactctgcctcaaaatattttaaaaaa 

tgtaatttctcagtaggtcacactgttacacattcatctaataacaattattctttttctttttttttttttttttttgg 

agatagggtctcactgtcacccaggctggacaggctggagtgcaatggcacaatctcggctcactgcaactttgacctcc 

taggctcaagtgatcctcctgcctcagcctcccaagttgctgggactacaggtgagtaccaccagacgcagccaattttt 

gtatttttttgtagagatggggcttcaccatgttgcccaggctggtctcacactcctgagagttcccagtcaagtgatcc 

acccaccttggcctcccaaagtgctgggattacaggtatgagccaccatacccagtgaattattcccgttccagatagga 

aaaccaagtcatagggaggttagagaatttgccaaagacaaaactttttggttgaaaaaaaataagttttgctacaagta 

tagaaaacaccaaataacggtgttttaaataaaatagaagtgtttcaccctctccctcaagtaagtgttggcatccaaga 

tgatatgacaactccacaatcatgaaactagatccctttttattttttggctcagtcattgtcaatgggctgcttccagc 

cattgtccaaagtggctgcttgcgctccagccactgtatctgcattcgggaaagcaggacggaggaaaggacaaagtagg 

ccatgccctctactttaagacaactcctcaaattgctgatgtggtgggtcactcctgcaataccagccactcaggaggct 

gaagcaggaggctcacttgaacccaggagcttgaggctgcagtgaactataattgtgtcactgcattccagcctgagcga 

cagagtgagatcttgtctcttaacaacaacaaaaaaggtaattctttaaattgtacactttattgctgcttatattccac 

tagaagctagaagctagtcacatggtcttaagtctttatcctaacaggtgatgtgcccgagtgaatcctgggggcccagt 

gctaagaagtagagggagatgaagccctgccctcctgtggagcaaccaggaccttaattcagtcattcattccttggacc 

tacccagattccagcctcaggcccccgcccacctccttgccagtatctctgcagagcctcctgcctcccccagagggtgc 

cagagccctctgcttcctcagccttcagacccacttgctgtgtctcaggtggggacaactcagtctatagaggcccaagg 

gagatctttgagacccatactgtagtcagcttggggcagatgggagtcatgtagctcactggctaaaagcctaggtcctg 

catagagctgggttcaaatccaggcatatccatcacctgctctacgatcccaggagtcactcaatgactcagaagctaga 

gtcctctggaaagcttgtatgaagagtccccagggcctggaacatcaaaagagctctgagaacgttggctctgctgttgc 

ttctcttattgggtatctgaaggccgtactctcctctctcctccacccgaggcgacctccaccctcatagccacaccctg 

gaccctcacccatccagtcaccccacccagcctgtcatctgtccctctggctctcctctcctgttcatgcatcttgtact 

cacccacttctgtccctgagtgtttcaaggctctggggctttcagagacactgaagggacctccctcctcaacacaacca 

caaggtctaggtggaatgacccactaagggaccggctctgaagccagagacttccagggaaagtcaacaagcccaaggat 

gcccgttacaagaaagttaaaagacccatgtacatgtcctcccgttttattccctgctcagggtctgggcatacagtgga 

acacatgcagtccccaaaggacaccgtctgtacagagtcagatggagttaagaacatttagccggctgggcgcggtggct 

cacgcctgtagt cccagctactcgggaggctgaggcaggagaatggcgtgaacccgggaggcggagcttgcagtgagccg 

agatcgcgccgctgcactccagcctgggccacagagacagactctgtctcaaaaaaaaaaaaaaagaatatttagtctgg 

gcacggaagcttatgcctgtaatcccagcactttgggaggctgaggtgggcggataacctgaggtcagaagtttgagacc 

agcctggccaacatggtgaaaccccatctctactaaaaatacaaaaattagccaggcatctgtaatcccagctactcagg 

aggctgaggcaggaaaatcacttgaacccaggaggtggcggtcacagtgagccaagattgtgccagtgcactccagcctg 

ggtgacagagcaagactccatctaaacacacacacgcacacgcacacatatttaaaccaccacaccaacatctagttcaa 

gatggtggactgagaacttgtctctgccattcctggcccatccaataccactgagagcacagtaagcaaagggaaaagga 

gacagaagggctgggaacaggatggctgggggatgggaagtatccactgcacaggattttgatttaattctagaagatag 

aaagagggaggatcacgttcaggaacagatgcgggcaaaggaaaccagagccaaagcacgctgagagaaagctgccccag 

aggccggagcagaagtggactctctacagggactcaatacaGCccaaagggttggtagctggcacacgtacctctccacc 

cccacatgtaacactgcagggcagaggaaataccctaggtgagttccagtatcaatcaacctgccctttgttcataaaga 

tgaattggttaccaggtattaccagacaggtgaggaagaccaacacaaagagaaagatccagaaacaaacaggccaggcg 

cattggctcacgcctgtaatcccagcactctgggaggccaagatgggtggaccacctgaggtcaggagttcaagaccagc 

cttgccaacatggtgaaaccctgtctctactaaaaatacaaaaattagctgggtgtggtggggttctcctgcctcagcct 

cccgagtagctgggaggctgaagtaggaggttcacttgaacccaggagctcgaggctgcattgaactatgattatgtcac 

tacattccagcctgagtgacagagtgagatcttgtctcttaacaacaacaaaaaaggtaattctttaaattgcacactac 

actgctgcttatattccactggctagaggttagtcacatggtctcatgtctttattctaacaggtgatgtgcccgagtaa 

atcctggggccccagcgctaagaagtagggggagatgaagccctgccctcctgcttgaaccctagaggtagaggttgcag 

agagcggagatcatgccactgcactccagcctgggtgacagagtgagactccatctcaaaaaataaaaaataaataaata 

aataaataaataaataaataaataaaaacctacagcaaagaacaaagagctattgcaccgttactgtgggcctggcagta 

ttatgacaacttttttttttttttttttttttggagatggagtatttttctgtcacccaggctggagtgcagtggctcaa 
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tctcagctcactgcaacctctgcctcctgggttcaagcgattctcctgcttcagactcccgagtacctggtattataggc 

acatcccaccacacccggctaatttttgtatttttagtagagaccgggttacaacatggtttcaccatgttggccagtct 

agtatcaaactcctgacctcaggtgatccacccgcctcagcctcccaaagtgccaggattacaggcatgagccaccacac 

ccatccagtgttctgacaattttatacttattaactcatttagcaaccctctgagctagat.gctattgt:tatctccactt 

acaggaaactgagtcacagaatggtacagtaaaataaccttgaccgaggtttacatggctggtaagtggcagagaaatga 

aacttgaacccaggcaacctgtttcctgagttggactctgaactactcagccatactgcctatttatttatttatttatt 

tacttacttacttatttattgagatggggtcagccgggcatggtggcttacgcctgtaatcccaccactttgggaggctg 

aggggaacagatcacttgaggccagcagttcgagaccagcctggacaacaaggagacagggtcttgctctgtcacccagg 

ctggagtgcaatggtgcaatcatggttcactgcagcttcgacctcccaggctcaagcgatcctcccacctcagcctccca 

agtagctgggaccataggcacccactatcatgtccagctaatcgaaaaaaaaaaattatatagagatgggggtctcacta 

tattgcctagactggtctctaactcctgggcttaagcaatccacccaccttgccctcccaaagtactagaattacaggtg 

tgagccaccgtgcctgaccaatacttcctctttaataaactaaacaaaaagtgaacaaaccaatcccggagggaacagat 

aattcaaggaatacaagaaaacttatgaaaagaaagattctagtgccaatatgttcataaaacaaaaaaaagcagctatg 

aaaagaaagcaaaaaataattcttggaaatttaaaatagaattgctgaaataaaaagaaattcaatagaaaagggtgtga 

acatctgtggttttggtttccaaaattagttcactcttctttgggtaaaaataccctgattttcctgctgtattctcaag 

ccctgtgggttggtggaattgacacctcttccatttactgcacgagagaagcatgttgctgctcacagttcattatgagg 

agccagcaggtaaagacaacactgagggcagatgaaagatatatactgaccctgggtccttgtggacatcattgagtccc 

tggatcaagccttacctgaagctaaagggatgagtgcttctcactgttaatagccactgatcaaattccatagaaggact 

ccaattatccttgcctcagttgtgtgtccagccctcggatgagctaccatcaagggaaactgccagaccttggtgacaag 

cccacccactgaaaatggggaaaaaacccaagtcacatcattgaactgaaagagctgcagttccccgaaagacaggagag 

aaggaatgctggataaacacaacaactacttcatctcactgcacctgcaaagaatccaggcacaggaaggtgggcacagc 

aagtagtctttctggttgacataacaaagacaacatctgtttcttctctcagaaagaagtaagaaaggcaat.t:ggagaac 

cagaaacgacaaaatctattccaggaccaaatagaaagaaaacacatctgaagcatgattttgaacaattagtggggtgt 

aagaaaagagaatccatttgaccttgacatctgaaacagcaaatattctccatcaaggcactttagggtagaggaaatga 

tactatgtttccttctcaaaggagctatctggttacgtggttctgcagtaaatgacgtttatacaatcataacatccgtt 

ggttttcaattttcagaatcaacctgaagataaagcatggaagatttcattataattaccgaacagcatgcaaatgttac 

aaactttgaccatgaaaatgtaaaaaagagcccaggcgcagtggctcacgcctgtaatcctagcactttcggaggccgag 

gcgggtggatcacaaggtcaggagtttgagaccagcttggccaacatagtgaaaccccatctctactaaaaatacaaaca 

ttagctaggcatggtggcacgcacctgtagtcccagctacatgggaggctgaggcaggagaatcacttgaacccaggagg 

cggagtttgtggtgagccaagancacgccgcggcactacagcctggtcaacagagcaagactccatictcaaaaaaaagaa 

aagaaaagaaaagaaaagtagacagcagaaattagagggagatggcaggtgacctggggcegggggaggcacaatttttc 

ttacatagtgatagggctcaagaaattctttctaaaattgatgtatgaggaacgaaattttaaatatagattgtttagaa 

ctataagtagcaccaacacaggactccagtaatcacacagcaaacaaaactgagaaaacagacatgggcggggagatggg 

aatgggtgatttccattccctgctttaatgatgaggtgtcagtagatgctctcttgagttactaaattgagaaacaaaga 

taaagtatattgtttagaggagtgatgttcaaactccagaaaaggctgggcgcggtggctcatacctgtaatcctggcac 

tttgggaggccgaggtgggcagatcacttaagcccaggagtttgagaccagcctaggcaacatgatgaagccctatctct 

actaaaaaaaaaaaaaatacaaaaaattagcaaagcatggtggtgcacacctgtagtcccagttattcaggatctcacta 

ctgtactccagcctgggtaatgagagtgagaccctctctcaaaaaaaaaaaaaaaaaaaaaaaagaagaagaagaagaaa 

actccagaatatataaaaatgaaaagagggcggaagagtggaactgcggggtagaaaattttgcattttgtttcatctct 

ctgtatatgttgaactttttttttaacctacatgtactacgtcatatccacagcccccaaccatccaccaggcatcaact 

gctgtgtttatatggagggttgagcaatcattcctgcctccttccagttcgtcttcctgtactgcagaggctagaaaact 

aaattcatatcacccagattcccttccagctacctcaacgctaccacctcattcagccatcactgattctccacttcacc 

aaactggtcaataattgtctcttctaaaaatattagaattgggactaagagacactacagctgacccttaaacaacatgg 

gtttgaactgtgctagtctgcttatacacagatttttaaaaaaataaacatattgaaaaaaattttggagagatgtgaca 

atttgaaaaaactcacaaaccacatagctagaaatatcaaaaaaaaaaaaactgaaacagagttaggtaggtcaggaatg 

cataaatatatatgttaattggctgtttatattattagtagggcttccagtcaacagaaggctattagtaattaagtatt 

tgagtcaaaagttatacacagatttttgatgatgaggaggattagtgcccccaacccccatgttgttcaagggttagcta 

tatttgaaatctgctggtagctggaattgggcaactaaattcacgagatgtggaacggtcgtatacatggagaataaata 

aacaaataaataaataaataagtatttgttggcctggtgcagtggctcatacctgtaatcccaacactttgggaggccaa 

ggcaggcagatgacttgaggtcaggagttcgagaccagcctggccaacatggtgaaataccgtctctactaaaaatacaa 

aaatcagctgggcgtggcggtgtgcacctgtaatcccagctgcttgggaggctgagggatgagaattgcttgaacctggg 

agtcagaggttgcagtgagccgagattgcaccactgcactccagcctgggcgaaagagtgggactctgtctcaacaacaa 

caacaaaaaagcaattgttaaaagaataagaaaacaagccacagattgggagaatatatttgcaaaacaaatatctgatg 
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acccaaaatacacaaagaactcttaaaacccaacactaagaaatcaaacaaccccattaaaaaatggacaaacgggccag 

gcatggtggcggtggctcacatctgtaatgccagcactttgggaggccgagcagggcagatcaccttaggtcaggagttc 

tcaactagcctggccaacatagtgaaaccgtctctactgaaaatacaaaaactagcogaccgtggtgacgtgcatctgtg 

gtcccagctacttgggtgtctgcggcaggaaaatagcttgaaccagggaggttgcaatgagctaagatcgcgccactgca 

ctccagcctgggcaacacagtgagactccgtctcaaaaaaaaaaaatgggcaaaagatgtgaacagacacctcatcaaag 

aagatatacagatggaagataagcatatgaaaatatgcttaacctgatgtcattaaggaattacaaattgaagcaacaag 

gtaccactaaacccctattaaaatggtcaaaatccagaatgctaaaaacaccaaatgctggtgaggatgtggagtgacag 

gactctcattcgttgctggtggaaatgtaaaatggtatggccatttttaagacagtttggcagtttcttacaaaactaaa 

catagtcttaccatacaatctaacaattacactcctagatagttacccaattgaggtgaaaacttatatccagggccgag 

cacagtggctcacacctgtaatcccatcactttgggaggccaaagaggaaggattgcttgaggccaggagttctttcttt 

ttatttatttatttatttattttcttttgagacagagtttcgctctgtcacccaggctggagtgcagtggagtgatctca 

tctcactgcaatctctgcctcccggcttcaagcgattctcctgtctcggcctctgagtagctgctgggattacaggtgca 

cgccaccatgcccagctaatttttatatttttagtagagacagggtttcaccatgtcagctagactggtcttgaactcct 

gacctcaagtgacctgcctgcctcggtctcccaaagtgctgggattacaggtgtgagctactgtgctcagccaaggccag 

gagttccagaccagtcctggcaacacagtgataccctgtctctacaaaaaaaaatttttttaattagtcacatgtggtgg 

cacacacttgtagtgtcagctactggggaggctgaggctgaggatcacttgagccccaggagtttgaggttgcagtgagc 

cacgattatgcctctgcactccagcgtgggcaacagagtaagagaaagaaagagagagagagagagagagagaggaagga 

aggaaggaaggagaaaagaaacgaaaagaaaagaaggaaggaagggaaggaagggagggaaggaagggaggaaggaagga 

agaaagaaagaaaggaaagaaataaaaaaggaaaagaaaagagagagagaaaccttatatccatacaaaaccctgcgcaa 

gaatgtttatagctgctttaCtcataattgccaaaaactggaagcaactaagaagct ctccaatgggtgaaaggataaac 

aaactgtggtatatctgtgcaatggagtatcattaagtaataaaaagaagacttagcaaaccacaaaaagtaatatgcat 

actaaatatgcatttagtaatattaatatgcattaagcaataaaaaaagacgtgtcaagccacaaaaagtaatatccata 

ttactaaatgaaagaagccagtctgaaaaggctacgtgctgtgtgatttcaactatttgactttctggaaaaggcaaaac 

tacagacagtaaaaagatcaagggttgccgggggttctggggacaggaaaggatgaataggtggagcacgggaattttag 

gtcagtgatattattctatatgatactataatggtggatccatgatatgcttttgtcaaaacccatagaaagtacaacac 

aaagagtgattcttaatgtaaactatgtgctttagtcaacaatgtattgatattagttcattaattttaacaatgtacca 

cactaatccaagatgttaataatggggaaactgcgtgtgagggaggggatacatgggaactctgtactacagctcaataa 

ttttataaatctaaaactttttttttaaataaagtctcttagaaaacaataaaaataaaataaaaagtcaattttttttt 

ttttttttgagagggaatcttgctctgtcacccaggctggagtgcagtggcacaatctctgctcactgcaacctctgcct 

ccttagttcaagcgattctcctgcctctacctcccaggttcaatcaattctcccacctcagcctcccaagtagctgggac 

tacaggcatgcgccaccacgcctggctaatgttcgtatttctagcagagatggggtttcgccatgttggccaggctggtc 

aagaactcctgacctcaggtgatccgcccgcctcagcctttcaaagtgctgggattacagggttgaaccacatgcctggt 

ctaaaatgtccatttttaaaaggcagtctggctggggcagtggctcacgcctgtaatcccagcaccatgggagaaggctg 

agcagggtgtatctcttgagcccaggagttcgaggctgtagtgtgctatgacggtgccactgcactccagcctgggtgac 

agagtgagactctgtctctaaaataaataaataaaaatacaataaaattttaaaaggcaatctgcagcaagagaagatga 

agttagcaggagagcaggacagaggagagtccatgaacaacagggagaggaaagtggtcatggtggcagctcccaggctc 

ctggcatatgttcaattccctgttcccagccctcaggaagcccagatgtccccacctgcccctacatacaaaccctggat 

ccttgacatcaacttctcctttccttcatgtgctctaatgagtttttgttacttgtggaacatttaactaacatcattaa 

ccaagtggacctgtctcagagtggtgttatgagaagacagcagccaaaagacagctgcagccaagcacagtggctcatgc 

ctgtaatcctagcattttgggaggccgaggtgggtggatcacctgaggtcaggagttcgagaccagcatggccaacatgg 

cgaaaccccctctccactaaaaatataaaaattatccgggtgcggtggcgagcgcctataatcccagctacttgggaggt 

tgaggcaggagaattgcctgagcccagggggcagaggcggcagtgagccgggatcgtgccacttcactccagcctgggtg 

aaagagcaaaactctgtctcaaaaaaaaaaaaaaaaaagacagctgcaacaaatgtcaagctctgtgtgttttcttttct 

tttcttttttttctatttaattaatttattttagagtcagagcctccctatgtcacccaggctggagtgcagtggcacag 

tcacagctcactgtagcctcaacctcctgggctcaggcgatccttccacctcagcctccttcctagctgggactacaggc 

gtgtgccacgacatctggcttgtgtgttttcttttctttttttttgagacggagtcttgctctgccacccaggctggagt 

gcagcggcgcgatcttggctcactgcaacctctgcctcctgggctcaagcaattctcctgcctccgccuoctgagtagct 

gggaatacaggcgcacaccaccatgcccagctaatttttgtatttttagtagagacggggtttcaccatgttggccagga 

tggtctcgatctcccgaccttgtgatccacctgactcggcatcccaaagtgctgggattacaggcgtgagccactgcacc 

ctgcctggcttgcatgttttctgacatactgtcaaaaggatactcatactaaatggcaacacattctcaagccccttcct 

tttcttctcctatctgctttaccacacaccatagctgcttttaccgttttctccttaaaaactcaaaaaaaccttccccc 

aacctatctatccacttcttgtcctcccctcaaacccactccattttgattctgcatcataaaaaccaagcagagttctg 

gggaggcagaagcctcctcttatgatattggaagggggtggatgaaattgagcacacaaagccagaatcctcttttgtgg 
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aaagatgggggcagtgggcagagggaagcaggctcatctctttccctctcttcccttcccttctctttcacaccacacgc 

tccgcctgggtgagctcatcgtcctcgtgtcttcaatttccacctccaggagatcgactgccaaatttctacccgcaatc 

ccaaactccactctgagctccagacccatctactaattgcctagtcaacattttctctgtgggatcgaatcagataggat 

tatattctgctgtgactaacagagactgaaaattcagagtcctaaacaaaagactttctttttcttgtgctgaaaagtct 

ggaagtacacagccctaggctggaatagggatactgctggggggctccgcctccttgcaggctgcccctctgctatctcc 

agggtgtgtccctcaaccacactgtccaaagtggtgactgaaggaccagtcctcccactgacattccaggcaacagaatg 

gagaagaacaggctgaggaaggaggaaccacaggcgccctccagctgtttcaaggaaggttcctggaagatgctaaataa 

caattccgtttatatctcattagccaggcttagtcacgtggcaacacctagaagcaagaaaggctgggaaatatactgtt 

tctgctgggttaccgtatgcatgactaaaaagagagcgctctgttcttaggaaaatgagaatagactgggttgggggagg 

ataaccagcattggctacactgcagccccacaacctcctcatgcttgatgtggctgaaataaaaagagcatctcccccta 

acatttcctttcccttagtcttgttccttttctgtaccccataccctagcagaggggcaattctgtggggtgggctcact 

ggcaaggggaggtctcacccaagctcagtcaagggtgagcagagagaattgtacaaggaagctgaatttccagctggtct 

ggaggaaacactgaaaaccccaaacagaagatgtccaggtggagcaaatgcagctgagagttcttgtccaaggggacaga 

atgtcagcagagtaggacacaagggcaggtctctactgaaagcaccaggggcaaagtcacccagcccttcaaccggctgg 

ccaccaagcacatcggctccactctgcctcctgcccgtgccaccctttgcacttttgtctgtaccatctcctccaccagg 

agtgcccttcccgctcctccatgtggtatactctcagccacccctcaggctcagatccagagccacctcctccaagaagt 

ctccacctgtctgctgggaaccctcacatcacactgggagacgaagccaaaagtgggggggtcagtacagaggtgtggga 

ggtaggtccaatcctggctggacctggctttgggtgccatttgggacagggtattggggaggttcagcaggaaggtggga 

cctcaaggct ttctttttatgaaggaggaggaaccagctgt tgacagagcagcaacagcagggagccaccaaggctgagc 

cttgaaccctgggccaccaggctcctgcaggcaactgggagcagctcagcctctcacagtcccatttccagggaggaaga 

tgatgcagcctgaggcaattggccacctggaggaagttcagctgggtggggcaggcagaggaattcctggaaggaagctc 

acccctcagcactggggtagagggacactagagacaggggtggcccaggagctgcccacctcctatccccatgaataagc 

aggtgggcccgggtgaccagctgggagccacccctgggtgctcaggtaccatgcttctggcccagcctcctgagctgtgg 

cacaggcaagagcaccggcccacggggtctgcatggggcagtgacctgttctctctaagcttcagctttctcatctgtag 

aaagagggtgaagagtccttccccaccatggttgtggtgagggtaactgaagtgtcagggtaggggacccagtggggcac 

cagcacacctgaggttcagtcaacaaggcccactggtgaagaaaacctcgcccaccctcccgtccctcctgaggcaggtg 

gtggaggcacatgccctctccattgtgtgtgcacgagtgtgtgggcgcatgcacgtgtgtgcatgcatgtgtgtgtgcat 

gtgtgtgcatgcatgtgtgtgcgaatgtgtgtgtgtgcgcacatgcatgtgtgctaagctctggagaacaagcagccagg 

cttgggcaggagtccgggggcaagggggggacatgcaaagttctctccacagcccctcagtgctaacaggctgaggtggg 

ggatgacatgtcttagaaacaaacttcaccctctgccctaaaactgcctgctggtctgccacatcgggccaagctgtggg 

tacttggaaagagggccctgctcctgccttccaaacccaggaagtcttgaaatccccatggaggggacctggggccaggg 

gactccccaagcagacacagcacccacacaagggtctaagggacccctttctccccaaccgcctgaggtagggatggatg 

tggacaagcagctcagggctggggctggcctaggtggaaggttcagaagcacagaaggcaggacgtgcccaaggggccgg 

acatggggttggaggctaggggctggaaaaagagtttgggaggtcagccaagggcagtccacgccttggctcttctctgg 

gtcatggggagccacggagagtataggagtggtgatgggatggccccagagttgtgcctttgaaagtcacctctggttgc 

agatagagatggactgggggcagggccgagaggaagggcatctcatgagaggggaaatccctgaccacagtccagaggaa 

aacagcgggcctggggagaggctgctcctggggaggacctactcagaggctgattcttcctcgtgctaccccacccaggg 

agatgcaggaagctggtgaggtcagtgggtgggcccacaaccatgctggagaggagcaggagggagacgggagggcacag 

agaggtagagggcagggactctgaggaccctcatccatgcccacctggacaattgaatcgagcttcctctagtgtggaat 

gtgggctcactggggctatttgctatccttctctcatgtcagactggatgcggcccaagggtggggtctgtgactccctc 

atcagactgaggggggccgaaatgtgggggtgggggaaggagggagcttccaaacaggaaggtttggagggcgctggcac 

tgaggacaggggatacgccccccatgagtcttcacccctgctcagagcctgctgccgtggccctcagagacgagcccatc 

tttggagctggagttggggacagggaaaccaggtggaagggaaaaagaaaaaaggggcaggcatgagtgtgaggggccag 

accctgcagaccccaggacatgcacagcagacacacactccaatccctcctggctctgctcagcctcaaacgaggtgcag 

gcctgcaggccccaaccaaggacccctgttacacacagacacaaagacacaagatgcagcaacacacacacgtaatacag 

aaacacagtcacacactagcgcacacacagacacagtcagacaatccaagaaacacactcacacatgaacagacatataa 

gcaaagtcttacacatacaaagacacacttaagcacagacacaaacacaccaacagatgcacagacggacacacatagac 

gcacactgggaggcggcctgtggagactgctgattggataccatgttggcttggtatccagaattgttggtgcagcgcac 

tccccatgcctgacctcattaaggaggccctctggggcagtggctctgagaagaaaggcagcaagcctagggaggtgaca 

ctgtagctggggagcattttggctgggcagagaggaggaaggagagcatttcagggagggaaaatagcccttgcaaaggt 

ttggagtggaagagcaaaagggatgtcccaagaactgtgcctcatttgatggggctgtggggcccttctctgacgggcag 

ccttcccaggcacaaagggaggtggactgagaagcatctggaacttctgtgggggcacatggggcagaggggaaggggaa 

caggcagaacgagtcagctgcctttgaaagctccacgagggacctagcagaaattcattcattcatccgttcattcattt 
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atttatgacaggttctctctctgtcacccaagctggagtgcggtggtgtgatcacacttactgcagccttaaactcctgg 
gctcaagcaatcctccaccccagcctcctgagtacctaggactacagacgtgagctactacacctggctaacttttaaaa 
ctttttagagacagggtctccctttgttgcccaggctggcctttgttgcccagccagcttcaagccatcctcccacctca 
gactcccaaagcgctggattacaggcatgagccacgggttcccagcccagaaagagaccttcaccccacaaaccccctgt 
ctggaggccaagtcccagcatccccagaagggccccggttaggacaagaaaaaggctgtgtgcctcggaggagaggctgg 
gtgccccctggtgcggagtggtccccttgccgggcatggggcctcagtctgtgagactccgcgcctccgtcgtctgaact 
tgagcaaagacagagctaggcctagcctggggacccgagggcctgccctcatccgcaattctgcctcttttctggccact 
gaggaccgggctagggaagctgtaggggcagtaggggtgggaggagagagggttccgggtcaagggactggaggggtctg 
gggcagggaagtggaggggtctagggcgggcccttccccctccgctgttgggccctcgctgaaggggccgaggctgagcc 
ccaaggtgggcctgtgggggcggggcgggggagcccctcagcagctggaccgcaggaggccgaccagggtctggaactcc 
tcttcccctccctccgccgccccctctcccttctcccaccccctgggctgctaggggagcgggggaggcgcaggaggggc 
tcagggaggggccctggacgcgggaccaggctgggcccctcggcggaggcccgcgcaggcaggccctgccgctggcctcg 
cacatctgggccgggagcgcgggccgacccggcggcgcaggcggcgcggccatccggcctgggggagggggcggcggccg 
gggtgggccggccccaagaaggcggaaggcctcagggccgggcgcacgttgagggctgcgaccgccgcagcgggcacggc 
ccaccgagctgggggcccgggatcgcggcggctggacggggctggagctgtcgggagggcggaggtgagttctggggcgg 
gggctgccgggcgccccgagtggagaaaggcgaggaggtttgccgtcccgggctgtcggctgagaccccaccaaaaacct 
ccagactttgtctggtggggacagatctgcaagcccctctctgcagcgtggctgcgggctcgggaaggcacttggggcag 
cgaccttggtctcccacctgggcttcgggggaccacctcccctgtccctcgcacagctgctcagaggcttaggctggaga 
aactcctgtgtcttaggggctggcaggactggtgtgggctggggcctgctccagggagccaaactgagggacccggtgcc 
taggtctgaggtggaaacctgggagttgagtgtatgtgtgtgtgttggggggtgcctcaggggtcccaggccctgtggtc 
ccgaagcgcaacagagtactgagtggcccaggtgccagggcctgaggctgggacctggtcttccgggccagcttgaggtc 
caggtggggtggggtccaggttcccgctagggcagtggagcctatctgtgaggttagcgtgtgttgatgagagagactgg 
aagggagtgatcacgaaactagggaccctccggagagcagacgcagcgcaggaatcccaggccagaggggtaatgggggg 
ctgtgggcagagcatgggaggggctctgttgtttgtgaactggctgctcccatgatgaggacccaagggggtgcggcagg 
atgtatggaaagcaaagatgaggacagggaggggccatgggaggccatttctgtcactccctgccccagcgcaggatggg 
gcagccacctcactcagccacacatggtcccatcttcttaagggctcctctccagaacagagtggggcagccctttgggt 
gccgtttctgatctgtgctttgggccctggttggcccagtccgtggcttggaagaaacgcctgtggcctcacttagcccg 
ctgacggagtcctgttccattacagggctggctgggtgctgagcagagagcagcaggcggaagaggcagggtcctagtga 
ggaggggagctggaatgacctgggtgggctctttgtctacacaggcatcgtgtggagccctgaaacccagaacactgcat 
tcccatagtgctcctggggacatgagcgcccagctgaaacaccccacacaggaagagatcctccgaggatcctgtcaggc 
aaagggttgggcccctgcaaggtggctcacgcctgtaatcccagcactttgggaggatgaagtgggaggatcgcttgagg 
ccaggtggtcgaggctgcagtgagccgtgatcgtgccactgcactccagcctaggcaacagagcaagaccctgtctcaaa 
aaaaggagcgggagtgggggcagggttcgtcctaggctattgtaagaccaggggatactctgggatgggagtctctggtt 
gtgagttcctgctgggccacaccctatcctggtccccagtgactaatcggccctgcctattgggtagaagtcagggccag 
gggtcagggtgaagctgatgactagaggtcaatatgaagccaagagttaggggtcagtccatgggcagggttagggatca 
ggctggggccagagcttggcatgtgcttgtagccaggaaatggagttcctttctgtgtaaatcaaactgtcccactcaat 
tactctcaaacccataggcaccaccagtctgaaattagtaggcccctgaagcggggctcagcctcccctcattttcagct 
gtggactctgcccagacctagctttcgggagctgggcttgtcccgtgttctgtggtcagcactctccttaagtgtccctg 
ggtgtgtgtgctcgcttggcagtcagccacctcttggcttacctccacctctcggggaatccatgggatcctccttcttt 
aacacaattctttttttttttttttttggagacagggtctcactctgtcacccaggctggagtgcagtggtgtgatcatg 
acttactgcagcctcaaccttccaggctcaagcaattctcccacctcagcgtccaaagtggctgggactctaggcatgca 
ctaccacgctgggctaatatttgtatttttcgtagagatggggttttgctgttgcccaggctggtctcgaactcctgagc 
Ccaagcaatccacccactttgacctcccaaagtgctagaattacaggcatgagccaccacacccagcccttaacacaaat 
tgatttagttcctacctccttccttaaagaactcatgatgactagaatgcgattctcagtaagaccaaaatatctaacaa 
gaaaattcagaatctgcatcatgaagggaaagaacactgctcaaacctgcccaactccctgcttttatcaataccttgca 
tatctggacttgttaacataagccctctgtgtcacccattctacaggaaagggcactgaggcacagagatgtggcatgat 
gcactcacgctaggtgccataataggctcttgccatccctgaaaccccagcataggggcactcagcatatcgcacaatga 
gagacggtagataactggacaagcttccgatagctgggaagagtataggcccatggtttggatgtggaatccagtgtaga 
gcgagga'ctcagtctttttttttttttttcttgagatggggtctcactcttctcgcccaggctggagtgcaatggcgcga 
tcttggctcactgcaacctctgcttcccaggttcaaacgattctcctgcctcagcctcctgagtagctggaattacaggc 
gtctgccactacgcttggctaatttttgtatttttagtagagacaaggtttcaccatgttggccaggctggtctcgaatt 
cctgacctcaggtgatccacctgcctcagcctctgaaagtgctgggattacaggagtgagccacggcgcccggcccagag 
tctttttttttctggtggttctaggttgtttctcctccaggtacaatagcctcggtccagaaaatccattcattcattca 
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acaaagtggttccccagtgcttagcacagagcctggcatgaagaatctcgataagtgtttgctgaagaatcagaacaaca 

tacaaataaggctgccagctcctccctcgagaccttgatccagtagtatgctggttaatggtttccaatcagctctggaa 

gaaggataaaatcctgatttatggtgtttgccaatttctgtcttgtaaatactcccagccatgactgatttgaagctacc 

aatgtgatgtcattgaatgcagaattgggaagagatagaaagaatcagctcgcatgagctggtgggacccagctcagcat 

cccactccatcatcctgagctgggctctgggctcctgcccctggctcctaagaaccctcccctccccctctctgtgcttt 

cccttttctcctcaacagccagctgctatatgcatgcccacaaccagcacccatccttcttctgattccgactcctgagc 

atcca'actccaatggtacagaggacagcaagagtgtctggaaatctccatggtgctggccatgcaggagaatcctggaca 

tacaagagtgtgcgggagtccaggtgtctgtccacaatggggctggtttgtgtgaactgtgtgagcaagggtgtgtgcat 

gtgtctatgggcatgccagagccagctctgtgtctgcctgcgttctatagcaaaactttcagattgggccgctggtgaca 

gagaaggaatgcattggccctgcgtagcccccccctttgcctagaggagcagccaggcccaccaatgagaacatagtgtc 

agtggtctgcagactggctgagagacactgttgagcaacacttctcaaacttcaatgtgcacctgaatcacctgggaatc 

ttgttaaaataggccagacgcggtggcttacgcctgtaatcccagcactttgggaggctgagacgggcggatcacctgag 

gccaggagttcgagaccagcctggccaatatggtgaaaccccgactctactaaaaatacaaaaattagccaggcattttt 

aggtggcattttaggcaaattaggtggcgcatgcccgtggccccagctacttgggaggctgaggcaggataatcacttga 

acccaggagatggagattgcagtgagctgagaactgagatcacgccactgcactccagcctgggggacagagtgagactc 

tgcctcaaaaataaataaataaataaataaataaataaataaataaataaatatgaaatacatgttctgattcagcaggt 

ctgggctggggcctgaggttctgcatttctaacaagcttccaggtgacaccaatgctgctgatcctcagcccacactttg 

agtaacaaagctgttgatggtttgaaatatcctgcctatagcccggccccagaacttcatagacaggatcgagcccctcc 

catcttagaaaataaatattcttcctcacctcccctcagctctagcgccctccaactactgccctcacatgtattcctga 

tttcagccaagcttttcagaaactctagctcaatatgattgcgtcactacactccagcccgggcaacagagtgaaactcc 

atcttaagaaaagaagttctagctcatgtcacctcttgctcactcctcacctaagcttacctcccttgggccactaaaac 

agtgtttccagtctcatctggggaccttttacagggatctagctttacccatctcttggtacaatcttcatgagaccatg 

aggcatctcagtctcctgggccggtatatattcctctgttcagcctttaaatgtaggttctcctcgtgacagagtgacac 

cctatctcaaaacaaacaaacaacgaaggccaggcacggCggctcatgcctgtaatcccagcacttCgagaggccgaggt 

gggcggatcacctgaggtcaggagttcgagaacagcctggccaacatggcgaaaccccgtctctactaaaaatacaaaaa 

ttagctgggcatggaggcactcacctgtaatcccagctactgaggaggctgaggtaggagaatggcttgaacctgggagg 

cagaggttgcagtgagccaagatcgtgccactgcattccagcccgggcaacagagtgagaccctgtctcagaaaaaagaa 

aagacaaattgagtcatgcccccaccccaccctgccaccccaaacctttcaagatttgctgttgcccttcatataaaacc 

aaatctgaaatagggtgcacaggccctgcccgcagggtaagcccagcccagccctctgtccagacttaccatgtcctctt 

cccaccccgcattatccaactttgacctccctagcactcagcaacgcttgtcatttcaaaatcacttgcctgattatctc 

ctccatcagaccgttggtttctagggcagggacagtgtctctctgggtcactatctcccagcacctggcacagagcaggc 

accagaaaacatttagaaggatcctgattccaagaaagtaaagccattttttgacacaagctgggaactgtgggacttta 

gatgagatttaaggaattactgttaatttcattagacatgagaatggtgtcgcgatgacattgtttaaaacgtctttatc 

tgttggagagatgtattaaagagttcacgggtgaaacaacacgatggctgggatttgcttcaaaggactccagccaaaag 

taaacaaataaaataaataagcattcagatgagggaacaaagggacaaataaatcccactctgcttgtctgttcctgttg 

acagctacagggcctgcaaggabgttggccgtcccggagatgggcctgcaggggctgtacatcggtaagaacccccacca 

ttctgccctgacccatcacccactcacccgagcaggcactgagacccacctacaggttgtgcaatgtacctctcccacag 

agcttcccaacagctcttctgtgagtgagtgtgtgtgtgtgtgtgcatgtgtgcagggcaggcctgtgtttactggaccc 

ccccgtgtgtgcagaaagggtctgtgtgcaccggtctgtgtacgtgagtatgtgtgcacacaggactggtctattcacag 

gaccctcatgtgtatatgagtgcgggctgagtgcgtgtgtttgtgtgtgccagaagggtctgtgcacacaggagtgtatg 

catgtgcgtgtgttttgttcaggctctgtccacaggaccctgtgtgagtgtgtcagtgagtgtgtgtctctgcgggcagg 

tctctgtgcacgggatctgtgtacgggagtatagtctgcaggtgggcatgcatacagagagttctgcgtgtgcacggagg 

gtctgtgtgcacaggagtgtttgaatgtccatgtgttttgttcaggccatgtgcacaggaccccgtgtgagtgcatcagt 

gagtgtgtgtctctgcgggcaggtctctgtgcacaggatctgtgtacgggagtatagtctgcaggtgggcacgcatgcag 

agagttctgcgtgtgcatgtgcgtctggccatagcctggtctgggagcaggcttgggttctccgcaggggtgggaggccg 

gaggcaagagggacactcctcaggaagtccattcacaaagacctggtgggctgggcccttccttcctgcctcccttccca 

agctcgagccacagcaacttggcctctcgggctgcagctggtggcagggctggagcctgccctggctgcctgaggagagg 

gagcccttgaaatgtggccactctttccctgcccctccagcacctcagactctggccctgtccaggcgcttcccagagtc 

cccctcatgcctggcccttcgcctgtgttcccaggttacccccaaggccagggagagtggagacctgcctggcagcacgt 

gacaggactgtggctgagggagcacagggcctgcctggctcagcctaaggcgagccagacctgcctgatttgtccatgga 

gcttcctaccagatctcactctccctccctagccagcctccctggcctgtccatggatatcaaggctgccccagtactcc 

tcacatgccccttagcaatgcccagtgtggtggtggcagctaccagccaggcaggccctcctgccccataagccgcctga 

tcctcggggcgaggcctggggctgtcctccagggctactctactccctcccaccaaaggtccagctcagtcccaggcgtg 
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caagagggtcttgatggggcgtcctgccctgaaatgctggctcagggtgtgtgtatgtgtgtgtgtggggggtgggaatc 

agacttcctaatgttgtggacacccccagttcaggactgggctatctctatgggtgtgggaggaggtgctccccctcctc 

gatcctcctcaagcagccagctgggcctgtctgtggcccttgtgggcatgggagctgcagagttccctcaagtcctgatt 

agaaccacgtggcccaactttagggtggaggggagcaaccccacctcctagccagttagctctgtagctccacct tttga 

aggaggagggcttctctctcctctactccctcccaggagctcacctgccctatctcccctgcccttccatctctggactg 

agtctccctgccacacaatgcttttctgtctctctcccctcccacactgtgtgtctgaggggccccagtggggagacagt 

gagctcctggggaaagcatgggtcatctgctagggaaacgttgcctctgacagcagaccctcagctcctgcccctgcgtc 

atgtgcgtctttcctacaccccgcagtcctcctccccagcagagagccctttgccttagcccagcaaggactgatggaga 

acctttgggtgcctggcttggggccccgctgccatacctaccctgcttgtgccaggatgaactgccgtctcttctctgca 

ggctccagcccggagcggtccccagtgcctagcccacccggccccccgaggacccaggaaagccgcggcattgcccccct 

cacaccctcgcagtctccagtaagcccagagcagggaccaggtggtgggtgacctggctggtgtggacagggtcgtgcgt 

ggcaaagtcatgacagggcctcagctaggaaggaggaggggatgggggtagcaccatgcctcttgtccccgtacactcca 

gtcatgtgccccccagaaggattggccttgggtgaccctggactataaagggtaacaattctactgagaaggcgagcccc 

atgtaactaactcccaacccccccacacaggaggcacccacggccattgaagctggcctggctgaggcaagtgttcccct 

gagcatgtggcttttctaggtccccagagcactgggcctcttgtggaccacagcccaccttaatccaagtcatatctgaa 

ggagaggagatgtgttgccaaaaaaaaaaaaaaatagtctggaaaccattagcttgggctgggcatggtggctcatacct 

gtaatcccagcactttgggaggccgaggtgggcagattgctttgagctcaggagttcaagaacagcctgagcaacacagc 

gaaaccccatctctacaaaaaatacaacaattagctgggcgtggtggtgcacacctgtaatcccagctactcgggaggct 

gagggtggagaattgcttgaacctgggaagcggaggttgcagttagccaagatcgcaccactgcatttcagcctaggtga 

cagagtgagaccccgtctcaaataaaaaggaaactactagcttgaagtatccaggacaccacctccccacccccaagctc 

cctgggtcaccccacctggatgaccttgccctggtgatccagcttagggacccccttattcttaggggagtccagccctg 

gcataggagaaggcacacacttctctggggacatgcacgtgccctctcctctctgccacagaatcagacagtctggtttg 

tgtcactatggccacaaagagcaagaggataaaatatatacactggtccatgtgctgtaaaactgcctggaggagcaaaa 

ttgctcccagctgggcccagactaatggctgcctatgaagatggtgtgaggtgggggtggggctaaggcagtccagaggg 

agggtgggcagaggctggaagcaggaaagcaggagcttgagccaaaccctgcctggggctacctgagagacacgtccaag 

gctcagcctggagcctgggagggcaagggagcccgagcaggtgggcaggtggggtgggccaggtccagggctggtctgga 

ccacagtcagtggaggggagtgtcttccccatgcagaggaagcattggggctgtgggggatgggggtgtccctgctggct 

gcttgctaattctaagtcttgcacttgcagaaacccgaggtccgagccccccagcaggcctccttctctgtggtggtggc 

cattgacttcggcaccacgtctagtggctatgctttcagctttgccagtgaccctgaggccatccacatgatgaggtgag 

gtcggctgggctgagagagtgaggtggggagggtggggagttcctcataccttggtcccaaaagcactgtcaccgagaca 

tggggtctccttggaggccctgccacccccagtctggggctccccactggggtaaaagttggaggatgggccccaggcct 

gggtccttggcctcaactggagcagctcccaaccctctgtaggaaaagtcagactttgtttttaattttgaatttttttg 

agatataacaaacataacaataaaatatgcaaatgtcttttattttcctttttttttttttgaggcggagtctcactctg 

tcgcctaggctggagtgcaatggcacgatcttagcccattgcaacctccacctctcgggttcaagcgattctcctgcctc 

agcctcccaagtagctgggaatacaggtgtgcgccaccatgcctggctaacttttgtggggttctctgttttcttttgag 

acggagcctcactctgttgtccaggctggagtgcagtgacgcaatctcagctcactgcaagctctgcctcctgggttcat 

gccattctcctgcctcagcctcttgagtagctgggactacaggcgaccgccaccacacccagctaatttttttttttttt 

tgtatttttagtagagacggggtttcaccgtgttagccaggctggtctcaaattcctgacttcgggtgatccgccagcct 

cggcctcccagagtgctaggattacagatgtgagccaccacacctggcctttttttttttttctttgagacagggtcccg 

ctctgtcacacaagctggagtgcagtggtgtgatcatagctcactgcagcctcaaactcctgggctcaagcaatcctccc 

acttcagcctcctgagtagctaggactacagacagcaccaccacacctggctaatttaaaaaaaaaaaaaattttttttt 

ggagagacagggtctcactgtgttgccagggccggtcttgaattcctggcctcaagtgattatcccactttagcatccca 

aaaatgctggaattgcaggcgtgagccaccatgcccagcccacaaatatttttctaagcttgtagacaaacacacacata 

tacaaacatatatgtgtttttttaagtaaaagtgagatatacaacttgctttgttttaaagcttcaacactattttgtag 

atatttcatgtcagcacatacgaagctaccaccctctccgaacggcctagtattccatagtacagatgcgtcataattta 

tccattcccctattgacagttttaggggggttttttcctcctctcatttttcactattacaaaaagtgcagcagtaatat 

ccttctttgaaaacttgtgccgcaatctccatggggt'agattcctagaggtgattcctaggtccaacatctcaaatcttt 

tttcttttccttttttagtaaaaaggaaaagaaaaaataataaaacaagaaataaaaataaaaacaagaaaatgaaggtt 

ctaagggctgagttgacaagtcacacagttgttttgaaagacatgtttcaaaaatccattcaaatgttgggttttatgtg 

ggacctcaaggaccttgggacctcaagaccccagtcctctgaatatgtcctagagggtcggcaggcagtaggtgaagcta 

caggagacctcaaagctctcaacccagactgggatgggaggtaagaggggcttcccaaggcctggagactggatggaagg 

gtgaatggagacatagacagacaggtaactgctcccaaaatacatgggacttgagcttcatgagactggactgtccttcc 

ttccacatcacaggaaaccacaaaggctctgagcacttacagccctctaggagctgcagggcatcccagatgtcttgctg 
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atcaacacacatggctgaggccatgctggggctaaggacacatccaggaacactgtctctgtcctgtggaggtgacagtc 

acaggaaataactgggggacattccagcctcctcccaagctctctcttcccaccagacaccaagcccctaacatctctag 

aatacccatctacttctctccatggccactgatcgtgcccaagttcagccccgaccttxrtctcacctggatggctgcaat 

atcctcctccttggtttcctttctccatgctcccttccctgcattctgttgctccatgggagccagcgacctttctttct 

aaaggcaaatgtggtcatacctcttggctactttaaatccacgaagacagaattctgaatggccttctgtgctcttcatg 

atctcaccttagcattcttgtgttttctcctgctgcctatggagccgacccatagctagttctccctgctctttcacatc 

tgtaagtttttgcacatgctgttccctctgcctggattcccacccaagaagggagggttgcactgactgccctgtgcctc 

cgcaggaaatgggagggcggagacccgggcgtggcccaccagaagaccccgacctgcctgctgctgactccggagggcgc 

cttccacagccttggctacaccgcccgcgattactaccatgacctggaccccgaagaggcgcgggactggctc tact teg 

agaagttcaagatgaagatccacagcgccacggtgagtcacagggctccagacagggaggcggggccagcatggaaaagg 

gcagggctaatgggggtgggtgggacaaaaccaaaacgtgtgaggaccggcccgatggagtcgtggctgagagggggcgg 

ggctaaagggagacgtcggactccggtgtgggcggagctcagaaatgaggtggaggcggggctaatgtgggtggggctaa 

tagtgaagctggggttgcaggaggggtggggctaaggagaggggtcggggcagagctaatgtcacatggggcaagagtgg 

gacggtggtaaagaggaggggaagctccaggaaacggggtgattttaagagcgaggtcgtcaggaaatgagtgccaagct 

gaggcctcctgcagagccgccctgtgtccctgccaggatctcaccttgaagacccagctagaggcagtaaatggaaagac 

gatgcccgccctggaggtgttcgcccatgccctgcgcttcttcagggagcacgcccttcaggtgcgctgcggccccacct 

ctgccgactgtggcagggaccccctattttcccctcatccgaaaccgctcccccatcccgtccccgacattggatgggta 

gccaccgccggagctcagaggtcatcttctccagtaccctcctccctttttgtctggtagagcctgcaccaagccatact 

gatgggaggggggccgattcttccagctctgctgggaagtccttcctgtgatttgattagtacctccagttccgcagagg 

gctgaagaccaccctccctccaagccagctttcctctcactgccccctcctgtaccaggagctgagggagcagagcccat 

cgctgccagagaaggacactgtgcgctgggtgttgacggtgcctgccatctggaaacagccagccaagcagttcatgcgg 

gaggctgcctacctggtgaggacgtgcaggcgggcccgagaacactgctcaggaagggccaggcctgtccccatgcttgc 

atgcaccccaccacccttgagaccacagagtcattgtggaaagaacttcagcctgctcctgatgggagtttgtagagttg 

ctcccaccagaagagggagtgggcctgcaggaacaggggacagagggacaaaagacacagcccaggccagtgtagacagt 

gccttaagtcaggtgtcctaaaagcagagcccaagatggagagtcttttttttttttttttttttttttgagacagtctt 

gctctgtcgcccaggctggagcgcagtggcgtgatctcggctcactgcaagctctgcctcccaggttcacgccattctcc 

tgcctcagcctcccgagtagctgggactacaggtgcccgccaccacgcccggctaattttttgtatttttagtagagacg 

gggtttcatcgtgttagccaggatggtttcaatctcctgaccttgtgatccgcccgcctcggcctcccaaagtgctggga 

ttacaggcgtgagccaccgcgcccggccaagatggagagtcttatatgtgaggtctattgaagaaggtttctcaggagaa 

aggcgagggagggaggcgggacaggagaaggagctatgaaggatgtggtctttgctggagtctagcctcaggtttagcct 

cagctagatcccatggggagctgcagagggagaactgtagcacagagttggggccggcttcttgcacatccgtatcggtc 

tgtcactggttctggggaaatgggagggaaacctctccaagtgaggccattcccattcagctgagggcagttatccagag 

gaggtagcagctgtgagccaatagcagccaacactcacagtggcaggcagggcacccagaacattcacttcacttggcat 

cagtattttgagggacacccccaccaccgcccacctttgttggttcttgagcaggggaaataggactatgaaataaaggg 

aggcgattctagccaatgcatacaggtcttcctaccactgagatgctcaaaggtctagagcaaggagttggggggggctc 

ccgtggagacagagagggcatagcctcaagactcattgcttggtcgtaaccctagagccttggatacagaccctaagctg 

tccaaaagggatctcaagcctgcctgtggacaccctggagggacccccaatcctcccatacctatgggccaggcatgacc 

cagggccaggtctgtaaatgtctacacttttcactggaaggagctaaccctgagcacagtggacatccctgtgcaaagat 

caggatggaagcaaggctgtccccagccagaacaaacacctgctcccccagcgtccccttgcctattttcccaccctcct 

tctcacgctgttccctggctgctcaggaaagtctggccctgggtaaaagtctccattcacacccttatccctctccccta 

tctggtcagaatctggggaaccctacaaaaccataatatacttcccacagggatcccctaccctgaaggaaataactcca 

gatctcaagtgtttttctccacccaactggcctgtcttggtgcctggaatttcagcttcctccctgctgtatcagattcc 

ctgggaacaaagttctcctggaagtggggtggcctatggctaccgtctgacaccagcctagtggagaaagagggttcaat 

gagctaaaagggctccccaccacctcctctgagccatcacacacccccacattgtgctagagtctctgccaataccccca 

gccagcgctcagctggccgcttgtacctgtgaaggggaaccttgctgcagccccgctatcttgggaaatttgaaggggca 

gatcccaggggttctaggtctgcattctgtctagagtcttttcctgctgggcaacttcttggcatatttggccagggcca 

ttccctccccagcctggtcacagcgtggctatgaggcagctccaaatttgtgcagcacagaggggcctgagaggcctaac 

atgggtggggtgtgatggaggaggaggtgggagccccacaggccgcggttccccacctcacagtgccatcttaggtgtga 

cagcccacagtgctgcctgaccctgcccaccacccatccccaggctggactagtgtcccgagagaatgcagagcagctac 

tcatcgccctggagcccgaggccgcctcggtatactgccgcaagctgcgcctgcaccagctcctggacctgagtggccgg 

gccccaggtggtgggcgcctgggtgagcgccgctccatcgactccagcttccgtcagggtgagctgcccccggggacacc 

acccacccctggagggtcagagggtcactgaagccagaagctcagccatgtctagtatgaaggggagagggtacccaccc 

tggaggagcccaatttgagcaggcagaaacatggctggtggagcctgtctgaggaagggaggcacagccctgcccaaggg 
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caacctcgtctcagggtgggacatgccctacctagggcagcccaatctcagggtggagatttcaccttgccctaggggag 
caaagtctgaggagggcaggactccgcccagccctgagtctgagggttgggggagacacagactaccccaggaggaaact 
gctggggtactgacaaaggaggaagtcagcccactgcatcagctgtccccttcactcccctcatcttccccgacacacat 
cagccccaagctcctgctgccaggaccaggcacccaggccaaggacagctatggtcaccctcccaaatgccatctcccag 
ggcagggagaagccctaagtcctgagtcccctccgagactccaaagacctacctgcctccgctcctccaaacccctctaa 
ccctgattttgccatgacctgagacctcctgcgttaaaggaaggccctgtgtccataaatatttccccacagctgttgga 
tacagggtgggagtttggggttcaggattgccctctcccagtcaggagcaggttggagtttcaggagcactggctgctcc 
cagcgcccatggaggtcctgggcaggaggatgggagttgaacgccatagctggagcacctccttctaatctcactccctg 
ctgtctcctgacccccagctcgggagcagctgcgaaggtcccgccacagccgcacgttcctggtggagtcaggcgtagga 
gagctgtgggcagagatgcaagcaggtagggggaaagggggacggagtgttatccttggcccctaccgggcaccatatac 
tgatggggggaagggcatgtttgcaaagcccgtctcttcctcctccattcgctgtacccaacctggccgtcccctcacag 
ccacccgcacccccaccccactcacagcggcgcccctaactcccactcctccaggggattctccgcggacgctcgggtgg 
agttgcagagcctctggaaccatttcccccccacaccctgcgcccatatgtggtggtctgaggttcaagcacctgaagcc 
cctcacgtccctcccccgaccctgcagacaggccttgggacccggggcagggctggaggccgggcgaggctggagggggc 
gcagggctgagggtgcgaggccgcccacgagtgtgtgcccgcgctcgccgccgcaggagaccgctacgtggtggccgact 
gcggcggaggcaccgtggacctgacggtgcaccagctggagcagccccatggcaccctcaaggagctctacaaggcatct 
ggcgagtagccaggcggcgccccggtacccagcgcgacccgggctccggccccgccactgccccctggcggcccggcgag 
cgctgacgccctcttcgccccctgctccaccccagggggcccttatggcgcggtgggcgtggacctggccttcgagcagc 
tgctgtgccgcatcttcggcgaggacttcatcgccaccttcaaaaggcaacggccggcagcctgggCagatctgaccatc 
gccttcgaggctcgcaagcgcactgctggcccacaccgtgcaggggcgctcaacatctcgctgcccttctccttcattga 
cttctaccgcaagcagcggggccacaacgtggagaccgctctgcgcaggagcaggtgggtcctgagcccgcgggctcagg 
cagggtttgccgacccgggaatgaccgtgcactggagggtcccgggccccaaggaacggcgggggtctgcctgattcatc 
ccacatatacactaagccagcagggcgtcggggtggggcggcggggagcggcgagtgagtgccccagcccagcaggctcc 
acccacggaatccgcagcccgaactggggcaagacagagaatcatagcggggaggcggcaatgcctatctcctcccagcc 
ttctctacacccccaccccgggccctgcgggcccatgctcctcggcttccctgcaccaaagcaaggggaggcccctccca 
ggacctcgtacctggaacctggagcaggccggcaactaaatcctctgagtgagtagggtggagacaagggactaacatcc 
cgcaggtccagtcctccagacaccacgtgcagtcggtgcccaggcacttctgcctggaggcagaggtagagaataaggac 
cacggaccccaaactggggcaagcagctgggccctgaccgatggatatttgcccctttcaccaccaacagcgtgaacttc 
gtgaagtggtcctcacaggggatgctccgaatgtcttgtgaagccatgaacgagctctttcagcccaccgtcagcgggat 
catccagcacataggtgagcacctgagcttggtcccccacccgcccctacatgaacaaacagatgcagaataattccccc 
ctatcagtgcctagatacctccacacatccatacactgtgatgagacctagaatcatctagaacacctgcgggatgaagt 
gcagtggtgattaagagctagagggttgatatgtagtcttgccaaggcaaaaaacttctggtgcctcagtttccccattt 
ataaaatggggtgatagtattgggttctcaaaacgttatcacagggataaaatgagctgaagtacctagagtgagcacaa 
tgtcttgcacacaatgtctaggtgtttaatacgtgtaaaatgcatatccttatctctcgtcctccacgtcgtggtgggag 
agaagtggggagcgtgagtgttggggaggcgaagccctcgaggactcccgtgagctctcaaagaaagtgctcaaatggct 
actttctagtcgccaggtaggcacaggctagggaggggaggcgccggtggccgcctagtggtggcctcagtggctctctc 
tcccccgccccttctcctctgcccccttcacccgcgtccccccgtcctgtcccgcagaggccctgctggcacggccggag 
gtgcagggtgtgaagctgctgttcctagtgggcggcttcgccgagtcagcggtgctgcagcacgcggtgcaggcggcgct 
gggcgcccgcggtctgcgtgtcgtggtcccgcacgacgtgggcctcaccatcctcaaaggcgcggtgctgttcggccagg 
cgccgggcgtggtgcgggtccgccgctcgccgctcacctatggcgtgggcgtgctcaaccgctttgtgcctgggcgccac 
ccgcccgaaaagctgctggttcgcgacggccgccgctggtgcaccgacgtcttcgagcgcttcgtggccgccgagcagtc 
ggtggccctgggcgaggaggtgcggcgcagctactgcccggcgcgtcccggccagcggcgcgtactcatcaacctgtact 
gctgcgcggcagaggatgcgcgcttcatcaccgaccccggcgtgcgcaaatgtggcgcgctcagcctcgagcttgagccc 
gccgactgcggccaggacaccgccggcgcgcctcccggccgccgcgagatccgcgccgccatgcagtttggcgacaccga 
aattaaggtcaccgccgtcgacgtcagcaccaatcgctccgtgcgcgcgtccatcgacttcctttccaactgagggcgcg 
ccggcgcggtgccagcgccgtctgcccggccccgccctctttcggttcaggggcctgcggagcgggttggggcgggggaa 
acgatagttctgcagtctgcgcctttccacgccctccagccccgggggagataaggtcatgggagagtgggtggggacac 
acccagagactggctttgggattgggcactggtccgctgactgccaggctgaagggacccgccaaggactgaacgggtaa 
gagaagaggtttgcaagacagagcgcgcagcccggcaaggggcatgtgaccccgaaggaagaacgcaacagaagagtcct 
ggtctgaacttggccgagtaggggtgggggtgggatggcaggaggagccgcaggaggaaggaggttgtgcagggtctgga 
cctgcagggctgaagttcactcatcgaccgactcagccccaaccgggagccaggcagaaaaaccctgtgccgtaggaaag 
tgactggaagtggactccagagggacaggtgtggtggcacagtcctggtgtggtgctgaccacccaaatatgactgtgaa 
ttgtggaaagggcagtagatctctaatgtggaggtgggaacattattgtggtggaggcaattatgagggtagcatttctt 
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tcgagacaaaacacccgtctgggaaggccccaaggtcagcttatgaaggaccccacttgcaccccaccccagccatggaa 
gagcagctggagggtggatggggaggccagagggagcaatgaggggtggtcccagctctgctattgactcggtatgcctt 
taggacattctcttaccgctcatgggcctcagtttcctaaagtgtgaaatgtcaggcacttccctctaactggcacgcaa 
cagccccacctgcctgagagccctgaggtgacaataaaacatttatgctcaaggggaagccacagcctgctgatatggcg 
tggagaccctaatagtgggaggaatgcaagggttcccggtgctagagagagaagggagaaagctttcagctgtgcatagg 
gaactgaccagaagggggcgctgctgtctcccatcaagcatcccaaacaactccactgcttaagacctctctggcctaca 
catgaggtccctctctcctcattcaaattaactgtcttggaagccagcttctggcctaaaatgccaccacctgtgcatac 
ctcttgtggggctaggtgctataataccacgcggtgcccctgcctcctgagtgagtctiacccaagtctttccctggccca 
tctgcaaaggagtaggcattaccccaaccccagagaacaaaaatccacctggcctccggtatccactggaagtttatttc 
tttagggttctatcccaaccagtcgctcaaaaaccaagtaacacagacctgaggggtgggggctggggactgcacctccc 
tcctactcatggtggacagcagtggggactagggaggggcaggagaggtggctgaagcaaggcagcagtaatggggccac 
gacgccacagagccagctccgtcctctcccagaccctggtgggagtccctgtggcttggggtggggagtgggggacccac 
cccaggccctccctctcccttcctcagacagcctccttccgggctcaacccatttcttccggcaggagactgaggcacac 
agagaggaggaagtgggagaggaggacgagggaggggcagggtggcagcacaaatgaaggcagaggtgagaggcgtgggc 
aaggccactccacccccacacccaccccagagaggggcgaggaagccacaccatcacgcagcatgtcggggggacaaggc 
ggggtttaaggctgaggggcccggggcaggcggggcctcgggcctcagtcaaagccgtgccagtcgctgtgctctgagtc 
gtattccagctcggcgcccacacacttgacaccatccagcagcatgggcgtgccgtggcgccggtctgcaaggcagggtg 
caagtcagtgccatgctggcccccggccccacccatgcggcccactaaggggacccctccccttccctcagggatcagct 
ggaggtagggacctgccaaggaggttgagaacccccgagccgggcaaggatcccttgttcagccttggttccctgaggag 
gacagaaaccctcgcagtcgagctcgtgcatccctcctccaaccaggagcctcacgctctcacccatgacgcgggcctgc 
accgtcacgcgcacacaggcgccagcgccaccctcgcaggctagcagtatctcctctttgaggacgccctctttgtgcgc 
cgagcactcacacttgacactataacctgcccggggacaggggcaattggtcagcacctcccccagcctccccgatcctg 
cccctggcactcacgggcccctacccaccaccatggggggactgcatgctaagcccccccaagaaaggggcagggatggc 
ccttctggagcccagagacccattccccctagcaggggtgcacaggtctcaaaaacccattcttcagtgagctggacatg 
cctccagcctgtgggccaccccaatgtggctccaagagtgaatgaagtagggtccaattcaggctcccaaaagaaggtct 
ggctgttttctcccaaaaggaaggcagggagaggcggtgatgaggagtgaggggggcagggcagggcaggctttgagcag 
atccgatggcaagaggtaaaggcctgaggggtgtcaacccatcgaagggcaaggtcatgaagaggctggtgacggggaca 
tgaaactgaagctggagctccacgaaactgacaccctagcctctgccaggctgggagtggggcatggggcagggcctgaa 
gtgagcctgataggaacagcagcccgaaacccaaggtctgggactggtgggcgctaggagttatccacacggtgtgtgta 
gctccagcaggtttttctagagggttagggaggcaggagggaaggctggaggcttcaaaccagttcctcagcagctccca 
tcttggttactgccccacggaggtaaccatcacaccatgggcaggcacagggaagtatgagctcatggacttctgttcag 
gacagggagcaaggcctggagtgtgggacctgccatgctgccacagtgcaagctcacaaagaggtgccacatccccgact 
tgctgagctgcccatccaccctagttggaagtaagggagcaccccatgttctcactcccacacccatccaggccctgctg 
gaggaggggacgcaccttcagggacgggcaccacgctgaggagcttgaggtgcaggctggggacaggtgcctcgcggaca 
tccttgctcagcctgtgcactgggggcagagtgaaggtaatctcatacctgtgcaggatcttcaggaagccaacctacag 
caggagaggtgagggagggaggcaccttcacttcctgctccccacaggcccacccagttgtgccacctatgatcccatgc 
gccccagcacctgcttccccagagagcccagaaagcccaactccctccatcactccagctgatcgacccccagccctttc 
acagaccccttgagggtcccagccctacagcttggccagcaagatcctcagcatctgtctcacgggccccaaagtcctca 
gtggctcatttcatagacaggaaaatggaaaccacaaaatgactggccaaaggctatgtagtgtggccagggtcaagtcc 
cccctagacttcttggccattcctgctatgcatggcacccagcaaggcaccacccactgcctacaagggaaagttcaaag 
tcccatgggtcattcacctgtccccatcactgccagtatcccagggtcagggatgctggtcctccattcgctcataggat 
gacaccaggccacagatagcgtgggatgagggatgggatcaaattagagatatgacagtagaccagctgtgtcagcccag 
agctggcccacagacctggcaccagccactgaactggggctcctggctgccctgccggcactggactggggatgcctgcc 
tgcccactcaaaggctgtgtccaacggcccatactcacgacccaagcaatttctgttgttcattcacagtgcttggtgac 
aaccaccctctgggacaaatgccattcttggaaactccagtagtatgaagggtcacaagccagggtggttgctgagcagg 
ggggctggtgggggtgccacgccagcaggcaaagggtgcctgctgtactcccagcttgcatgggcacacagagtcctgct 
cacagttactggggctgggcagccccatccctgggggccaactgggactggctgcagagagttttagccattcaatggga 
ccaggttgatattgctacatgacaaacttcaatccatgcgcccccctcatgctctgctggcagggggcacatccctcaag 
aaggctttctggctgtacctgctaactgtgaccctgtgacccaggggtactacttatagaattttccaggggaaatagag 
atgtgcagaaaaatactcttcgctgccctatttataacaatgaaaacaaaaacagccgtaacacccaggaacagggagcc 
agctaagtcaatgatgacacatatcatggaccagtaggcagttgtaaaatctcgtggaaaattatttactgacacaggta 
ggagacagttcacaacagaagcaagcaaaaaccagtgtggacacggtgaggccagcttttgtttttggaaacaaagcttt 
ttaaaataagctggatccatcaccacagtgttttcagaaaaacaaataaataaataaataacattttaaaaagctggaat 
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aagccaggtgtggtggctcacgcctgtaatcccagcactttgggaggccgacacggacggatcacgaggtcaggagatcg 

agaccatcctggccaacagagtgaaaccccgtctctactaaaaatataaaaaattagctgagagtggtggcacgggcctg 

taatcccagctactctggaggctgaggcaggagaattgcttgaacctgggagctggaggttgcagtgagctgagatggcg 

ccactgtactccagcctggtgacagcaagactccgtctcaaaaagaaaaaaaaaagctgaaataacacacatcagaatat 

taaccatggttattttgggatagtatgactgaccgtacgctgaagaggatgcgtaccgactgcgccttccctctttgctg 

gtggatatgaaaactggaatggacattgctatgacgggatggagctttcttacaaagctgagcatgagtcttaccataca 

atctagcaatttcactcctagatgactacccaagtgaagtgaaaacttatatccagggccgagcacagtggctcacgcct 

gtaatcccagcactttgggaggccgacacggacggatcacgaggtcaggagatcgagaccatcctggccaacagagtgaa 

accccgtctctactaaaaatataaaaaattagctgagagtggtggcacgcgcctgtaatcccagctactctggaggctga 

ggcaggagaattgcttgaacctgggagctggaggttgcagtgagctgagatggcgccactgtactccagcctggtgacag 

caagactccgtctcaaaaagaaaaaaaaaagctgaaataacacacatcagaatattaaccatggttattttgggatagta 

tgactgtgggtacttttaaatttcttcatattttctctgtcttccaaattttctcatgtctattaagacatgagtggatt 

ttgcaatgaggggagagagctacttttaagtgttggtttgttccatttactctcttggggaagctgtcagctgtaggaca 

aagagtgggaacttcagagttggacagaaatggatacaaacctgggccccccaggttcttttttgtttgtttgtttgaga 

cagagtctttctgttgcccaggctggagtgcagtggcacaatctcagctcactgcaacctcagcctcctgggttcaagca 

attcttctgcctcagcctcccaagtagctgggactacaggcgtgcgccaccacgctcagctaacttttgtatttttagta 

gagacgggtttcgacccattggccaggctaaacctgggccccttttgagcagcgcagcagaccccccactcagggccatc 

tcaatgggccagacctcccccgacccaagggcactcctgttcaactctggacccctgatttattgaccaacctgaaggcc 

tcagtctccattttccccaatccagctccaccaagcagaaaacagggggtgataatactacttcaaagagcctaactgag 

ccagagatttggcaaagaagggttaaaaaaaaagttgcaccttgttattgccatagttagcttcacacctgtatcacata 

catacattcttccattctcacaccaactctaggggatgtgattgtctccacttigagagaaaagaaactcaggtgaccttc 

ccaaggtcacagagccagaatggctggcctagacctgaactcaggccact.ggcaccacagccacgacact:gccttccatt 

gccattgtctgggtgaagcagatgacagcagtggctctgctcatcttgatctggatgacattictaaatgctctcatctca 

ttgaaatctcactgcctcccatctgacagatggggcaaccaaggcacagggaaaggcacagacatgcctataagcccggc 

tgagaggaacggggataggcacccagaagccaggcagagccggggaggggaagaagctgctgatggggtaggtgtgcatg 

taccttgaccagaaagctgctgtcactctcctgggtgaccatgaccaccgagtcatgcagcttctcatcaaagtggacgt 

ggctgtgggatccttctgcatcgtggcctgccgcaaagcggatactccggactctgggcttgttgcctaggaagagtggg 

agagggctctgagggccttccctgtccccaggaaactccccccccttgtccccccgtcacccccaagactgcccttgaca 

tcatccagctcccactggctgggctcttttcagggctagatggacactaggatcatgaggcttgaggcctccccgccggc 

cccgacctgcccctcccacaaaaatccatcccctgagagtgagcgtggggggactcagggactggctaatatgacccttg 

cttggagggggtggggctggtgccagagccaggaagggacagtcttccagactcccaccaagccagggcagcgtgggact: 

cagcccagtcctctggataccctgcccagtgctctccttcacagtgcaaagctccccatccctggggcctacccctaagc 

tgtgtcagtgcatgaggcccctgcctgcccatcctcatgacccaaacctgaaagggcagggacaggaacaggctctgggg 

gacctgcctggggtggcagaaacaggggggatcaaaaaacacactcaggccacctgggcgaggctgcagctgccactaaa 

cctcactggcccctggcagcaagaagagagaatccaaggaagcttccagacccacctcaagacccccttccttcctctgt 

ctagcccaccatcaaggccctcagctgtctctgagctgggtcgggttctccaaggccagaaagggcagacggatgagaca 

ggcagggtaggcgggagccgagctgaaggcctggactggatgaatgctcaggctgtagaggccgagagcacggccctagt 

cctctctgacccctggcccctaggccctcccaccaagagccctgcccagggtcctcttggcgggaggacctgatccagct 

ttgagccggcctcttggatactcggaggcccaggggaccagcctggcacatccacaatccctggtgagccccacagaacc 

ctcaaatctcaccagatcaccctggcctaaaaccctcccttatccctcccggactctgaaagaaacccaaactccccccc 

acagcctgctgtggccctcccacatcaccagtctcagtgctcaccataagcccctcggctcccccaacccccacagtgga 

acggcagggacacaagagcacacacctccacagaccagcctaaggagtcagcagggctctgcaggggtcctgctagggct 

tgcgtgagccccagaacctggcacaaacactgacatagtcccaatcacattcgcacaccagcacacacgcactcacacat 

gcacacacacacacaggtggtgcgggccctacctgcagcaatggccctcctaccagggcccaccggaagagcttgtgcaa 

gtcctaccacgccaggaaggcacttacccttgttggctgcagccatggacgccctccctgccacgcagctcctgccagac 

accgccactcacgctcagcagcctcccatgctccagggacaccagcgggagcctgaaggatagggagtggggagggcagg 

ggtcagggccacccatggacccatggctcagggacaccagaggagctccctttaggaaggactcaaacccctaccgctct 

cagcagcccaagtggtgcctgtactctctagcacgggggcttctgcccaccctctaccgactgcccacattcaccaagag 

ccctcaccccttcctgataccccagtgactcagggtcctgccagcatgtgctgagctgcagcttctaggtgccaaagcaa 

cagcataggactcctatccccagcacccccagagactggcaggaggggcagcctggaaaggaggactttattggtatttt 

ccagggaccagttctggtgctgctgaccccaaaggctgggcctggagctaccttattcaggtcacacaagcaatgcagct 

gggtgcggtcagaggcaacagggtgctcgatggtgggcaagaacaggggaccacataaagtaagcgtgcccttagagctt 

tccctcctggtgatcggtcagggccatatgcaaaacagatgcctgtggagggggtgtgcccccacctgaaccccattcag 
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accctgccctgagtctcaggcctgcctccgcctcagctccccaacggcagcctcagcacaggggcagtgagaggcacccc 

aggaagcctccaatgggccgagctgggaccatctgagcatcaaaaagaataatgagtgcaactgatggaaacagatggaa 

cacataaaagcagtgcattcacagagattattagaaaagaagagagaaagcaaaacaaacaaacccaagccctcaaaagc 

cctcatttgtcaccactgacggtagcagtgccctctttactctgaaagctgaggattaaagaggaagaattcagcctgtc 

tcttggcctttcttgtaaccaaattgccctggtggttgatggaaagcccttccttcgggaagaattcttgctaataaata 

acaaagaaatgaccaaatgctaagccactctgcaacccctaaaggaacaaatgccggaggcaacaagaaccagtggatgc 

taaaccagaggggaaggttgacaggaagcagatactcacaggcgcccaagcgtcactgacagaggacctggtatggtttt 

tgttttttttttttttctgagacggagtttcactcttgtcacccaggctgggatgcaatggcacgatctcggctaactgc 

aacaCcCgcctcctgggttcgagcgactctcccgcctcagcatcccgagtagctgggactacaggcgcccgccaccacgc 

ctagctaatttttgtatttttagtagagacagggtttcaccatgttggccaggctggtctcgaactcctgaccccaagca 

atctgctcacttcggcctcccaaagtgttgggattacaggtgtgagccaccatgcctggccgaggacctggtgtttaaga 

ggaatggcacaagtttgcaatggaggtatgtgacttctcccagtcacacactggtccacctctccactccctgcaggaca 

gcctgtcacactgacgcctcctgaagtaaggcagcccaaagtcacagcatcactgacaagggattcctgccaaaaaggtt 

taacctggaatctaaaccagcctctaacttcaacttctcatttgctaaaaacacagaggagaggggaacaaatttaatga 

ctccatgagaaagcaatgagacgagcccagaaggcaggatattctacaggatgactgacctgttttttgtttttgttttt 

gttttcaaatgtcaagaagaaaaagaaagcaggctgggaacggtggctcacgcctgtaatcccagcactttcacggatca 

caaggtaaggagtccaagaccagcctggccaacatagtgaaacccagtctctaccaaaagtacaaagattagccgggtgt 

ggtggcgggcagctgtagtcccagctagttgggaggttgaggcaggagaatcacttgaatccagaaggcggaggttgcag 

tgagccaagatcgcgccacggcacttcagcctgggtgacacagcgagactctgtctcaagaaaaaaaaaaaagaaagaag 

aaaaataaaacgagggagattattctagataggaaagcagcaaactacagctgattacacagctgatggtaaggttgctg 

cctgttgtgtttttgtttttgttttcgagacagggtctcactctgcccagcccaggctggagtccagctcactgccaccg 

taatctcccaggctcaagcaatcctcccaccccagcctgccaagtagctgagaccacagccacgcaccaccatgcccggc 

taatttttgtagagacagagttgcccaggctggtctcaaactccggagcttaagcgatccacccacctcagcctcccaaa 

gtgctgggattacagacgtgagccgccaggcctggcaaataaatttttgcaaataaagttttactaatagaacactacac 

ttcattcacttacatgttgtctgtttttgcactaaaacaaagttacctccaagtagttatgatagagaccatatggcctg 

caaagtctaaaccatctaccatttggcccttcagagcaaaagtttgccaacccttgttctagatcaacacacacttaaca 

gaccaaaaaaggataatcttcaggacaactgacctgtttttttcaccacatcagctgcaagaagaaaaataaaggagagg 

gtgtttcgaactacctatcaaagaagacattttggggggtgccgggcgctgcggctcacacctgtaatcccagcactttg 

ggaagccaaggcgggtagatcacctgaggtcaggagtttgggaccagcctggccaacatggcgaaaccccatctctacta 

aaaatacaaaaatcagctaggcatggtggtgtgcacctgtaatcccagctactcaggaggctgaggcaggagaatcgctt 

gaacctgggaggcggaggttgcagtgagccaagatcccgccactgcactccagcctgggtgacagagcgagactccgtct 

caaaaaaaaaaaaaaaaaaaaggccgggtgaggtggctcacgcctgtaatcacagcactgtgggaggctgaggcgggcgg 

atcacctgaggtcgggagttcaagaccagcctggccaacatggagaaaccctgtctctactaaaaaatacaaaattagcc 

aggcgtggtggcgcatgcctgtaatccagctactcgggaggctgaggcaggagaatcgcttgaacccaggaggcggaggc 

tgcggtgagccgagattgcgccactgcactctagcctgggcaaaaggagcaaaactccgtctcaaaaaaaaaaaaaaaaa 

agacatttgggggatctggaaaatgtaaatggcctggatatgaggtaacactaagaaatgagttgatcatggaattgtat 

tatgtaaagaattatgtccttattttgtcagggaaacatactgaacaacttagagatgagtgacactggctcatcagtcc 

agggacagaagaggaactacagggtggcctcatagtttagggctgggcctgtggagggcatgatctcaaggtccctctgg 

tgccaagctgctctgggagcagatgtggcctcacctcgccccccacttctcaatgtgggcaaagtccacccaggcccaag 

ccctgcctcatgggcagggtgttcacagtttgggtaccaccaggagccccctttggccaatggaccaggccagcaacccc 

ctcttggaggcgagcttccaggccccagcccagggtgcaggaagtggagctacacccattcacctctggccagggccact 

gtggggtcagttgcctcagccctggaaagctcagcttgtccccagggaacttggtcctggagagcagcctgccgtaaccg 

ccccccgactacagctgcccccaaagctgtgaactcagagtatatgacaaatggccaagcacaagagtgtgaggccctcc 

tatcctgcagaggccgccccagccacacctctgcagccaggaggcagtgacagcccagtgtcctcaaggtccccccatct 

cccaggatgcttcctgaaggctcaggttaaactgcaccagcggctggtgagcaaaggctgcaggcccttttcacaaaaca 

ctctgaactggtccatgcagggcaccctccctgtcttcctcccaacctactgcctgaccgtccccccgccctgcctttgc 

ccaggtcatcccttcagccaggaacaatcccgtccctcccagccacaccttcgacagggtctcagccccaacatttcttt 

tttaaggatgcacaccctgcaaatcccaggacaggactacagctctttttggaatcccctttctcatgtctgggatcatt 

tgatatccctcttcctcaccttacaagggtagtgactgtgttttcgtcaaaccccgcgtccccggcactcagcgcagcac 

cagacagggaggagctgtgtttggcttatgtttgttgaatgaatgaccacatcacatttgcttggagggcctggccaagg 

cccgtacttcagccctaaatgatatggtcagggccagtgactgcactgggggctctgaggaagcagaactccccacccca 

ctcctttgcctcacccctgccctggggggcactcccctctccctggcccccaccaagtggcctcccactgggactcctca 

ggccttgctgcagtcagctggttacctgtccatgcctgcttcgagagggacatgccccatgcccactgagggtaggaccc 



FIG. 7.45 



WO 02/083077 



53/157 



PCT/US02/12063 



acatctcatgcagtgccagtactcagtaaagggcaaacactgagggcctgtaaccctctggatagtgacaacatagaggc 

aggaagcaagggacttcaggaacccaaaggaaactgggaaaaccaaacctcctttctcaatggagaacctgctggtcgct 

gtcctcggggagctagactcttgtgcacacacaattcctcatcagaaacaggcaaagagggcactgggcctcccctgctg 

cagctgtgtccacagagaacagctaggccaccacacaaccccaagcctggccctgcctccagcctcctcctgtgccaacc 

aaggcctcaccatggctcaatccacatggcccccagaggcagcagtcctggttcaaattcaggatctgcccttgctagct 

gcatgggtgagttctttcacttctctgtccctgtttcctaatgtgtaacacaggaataagcagtggcctcttccttccag 

ggtttgctgcaactgtgcctgaagctctgtgacaatgcaccccccatctctgcaaaagcaaacccccaaaggcctggttt 

ctaaagcaacagcagtttcagcagcaccgtcagagaggcacttcaggccaatcctggaggagccaggagtgacccttaga 

gtgggccccagggacgtcagccttcttggaaaacagctcaaggggtgagggggcctccctcctcctgcctcccccttctc 

ccactcccaaagcagccaggtccctagggagggtcagagaacagatgctgggagtttccagtccccctaaccagaggggg 

tcacaaggaagatgtgcagaatgaacatcctgggaaactgggaaatgactagggaggaacatggtgcctcccccccagca 

aaaaaaattatacccttccccatgagatggagtgtcagcaagcttccaggccccagcccagggtgcaggaagaggagcta 

cacccattcacctctggtcagcatgctcccaacgtctgagacctcatttctcattccttcttcagtccccattctcattg 

tggttcgaggtctttccctctgagccaggaactccgcaaccttccaccctccctacctcttcctctcctaccccagcctg 

gggctcagtcctggctcaagcactcagtccagcagaagcactgtgtagcctcccattaaagctcacgcctgtgaaaagaa 

cacccattgaggccttgagatggggccacactgacccgctgactctcaggactggacacagcagaggccacacatactca 

gaacaaagcctggaaaggcaaggctggaggtcagtagttgtggcagcttcacatcaactcagctttaatgtgatttaatt 

tccttctccctccagtgggccaaaggtgcaaagataagtatggctgttctctctccttctaacagtgaggtgctgggggt 

gaSSStgggsgaatatggagaagggaccctcaccacccacaccttcctgcct'Ccccaacaagtgctgccctcctctgccc 

agcattctccccactttgccctcagctagtgggtgcttagcctccagatagcatgccccacctaggccctgccctgggcc 

tgtgatccagaggtcccaagaagcagaggccaggctggatccagggggtcagccaaggtgagggtgggagcacacaggat 

tatctcccagggacagggctgctgcctcgtagctcaggatggatagaatgtggggggatatccagctacattttccctcc 

acaaaagaccagaatgggagggggatggggtgctgccccgactttcttcaactccccggagcagaaaaatgccctacctc 

cactttccagtgccaagattcaagaagaaaggcaagcggagacttccctttctcagtccctgcttactaatggaaacacg 

ggtccagaacctaaatccagtccctcctccttcataccaccgggagggaggtgcagcccaagcccccgaggccccaaggg 

tccaggtgtaggaccctttatcctccccggcagccatccctgtgggtgtggcacccccgccacaccccattcttgtcatc 

tcagggggaggggggaaatgtaatcggacatcccccccatccaatccatcctgagctgcgaggcggcggctctgtccctc 

ggagacaatcctgtgcactccccaccttcactcacctcggctgacgcaggagtctccggagcccgcactcccagacatca 

ctgccctccttcctgagggtgctgaggctcggaggcccagagatgctactggtccaaggtcatgcagcgaggcagcggca 

atgaacagggtcgtggggaggagggggcgctgaccccattacgcccccgccctcactaccgcactcatgcccccggacaa 

tcgcttcgcggaaaacaccccagctgccaccagttaacggtcactgcgccccgggaacctgacatcactctagttcaggt 

gctggggagccttccaggcccggcccccaacagcggagcccccccaccccagccctgccatcacggccgctggggtccca 

ggcactgactgctcaggacagggccgggaccggggagggggacctcggcgaacgggaaggaaccgggaggcagggagcag 

aggggtggcctcaccttgcggggtgcaccccggggccggggagggcagggacgaccactgcagcggcggcggctgcagga 

gctcaacgccgagcacgaggaagggagccccgcgccgcggccgccctcccgtcggcacgcccccgcctccgcccattggt 

tgatctgggagggtggggcgagggacgctccggaccaatgagcgggctccaaagaacggccaactggcgagggccgccta 

cgtcacgtgccagggtcgccgaggcagcgccctgctagtccgcgcctgccgggcgagctctcgcgaggaagacgggcagg 

cggcccaactaggccaggggccagaaccgaccactcgaagagggagaaggagggcctcggataggccccgcccccgctcc 

ttcttccgcctgggggatagcgcctctagccttgaaccttgcttaggacgcacctcccctgggcccttcgctctcgggag 

ggctgtcgggcgcgtctcggggctgggtggagctcccgaaggtggcctttctccctgggcttccacgccggcttcggcca 

tcgatacgggccgtgttggcctcgttcaggagctgaggaaccctccatcactcctgtttcgaccccagggtttggacctc 

ttccccttccaccccatcccctgtcttgaaagaagcaacccccgtgcgggcccgagacgcgtcccgggtgcctggccggg 

cctggagaagcatcagaacaaagaaggcacgcgggctgggggctgggagagcctgtgacgcgcccccggggaccgcagcc 

tctgctcccggtctccatggaggcggtcgccatggcagcgagatgcgcctcgctcagcaccgcggggtgggatgtgggcg 

cctgcaatgagccgaggagcgagaggcgtggccctccggtctgcgggggttcttgccggtgctctccgcccgccggttcg 

cgaacaccccacctatacttcgcccgtggggacggattccccaaagtgccctcagtaagccgtccggagcacgcagcgcc 

ttgcttccaacggaactagagagacggcctgggcggccgaaggccagcctcccttcagcagggccggggtcgctgcctta 

aaggagccctcaagtctgccaccctgtggcccataacctgc.ccgctgatctccagt,ctgcacactgttggcaaattaatc 

tttctgagctcttgttttcatcgcgtccctctcctgctccaaagccctctgggactgcctccagtagcgcttcacaaact 

tcagcagcactttgggtgactcatgtgccctcgcgtttgagagacagcgcttacattgagagcttttcacattctgatct 

cagcccatccactcctgcccactctacccactcccaccacactacctttagaccttgtcttgaaagtctaaagttggcag 

acggtgggcccaaagtggcccgtaggtgtattttgtttagctcacatggggccctttaaaagaagtccttccccctcctg 

cctcctccgcctctttctggaatcaggaacctggtttcgtggcttttgaaatatcagaagatctgacaacgtgggcccac 
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tttccagcctggcattacaagcccctttaagagcacataaatttaccatagtccccgctaatccctgtttctcacggtta 

gttaccttctgggtctgtgcagacatttgtgatcccctccccctcattatcccacgttccaaccCggaaggccctccccg 

ctctcttctcatagccaaaattcaagccctcttcaaccaagataagttttgcctatttacctatttatttatttttgaga 

cagagtctcactctgtcgcccaggcttgagttcagtggtgccatctcggctgactacagcctccaccccctcccccgccg 

ggttcaagccattctcctgcctcagcctcccaagtagctgggattctaggtgcgcgccaccacacccagggctaatcctt 

gtatttttagtggagacagggtttcaccatgttggtcaggtgggtcccaaactcctagcctcaagtgatctgcctgcctc 

agcctcccaaagtgctgggattacaggcatgagccaccgcgcctggcccaagcctgcctttttcaaaagtctttctagcc 

aacttctcaaaaccatctctggtgtgggctctctcaaacacaaactggctgctaactgcacagcccggggttatttccga 

attgtcaactcctttcagcaaacatttactgaatgtatctgatgagtaagttattgtgctaggctctgtggagtggaagg 

cactcacagtttagtgtaggagggacaccacacacaaagaactgtggtctcctgtccttgccaaatgttcctcacactcc 

caggcatcgggcagtctggcccactgggaggagctgaaatacagagctgcccatgcctagggatactacatgctctcagg 

gaaggccaaagatggttctggaaagcctccaggagaaaattagactctttgtgtcaggaggaagtcagggcaggagttac 

ctggctgccccatcactgcccaatgtgtctgtgttaccacacgagtagtgcccaggctctggggcttcctgtgcggaggg 

ccttaccaaagatagatggctctgcaagggaaaccacaggcccctaccaggccccctggagacagaggcaggacaagaag 

aatcaggaacaactccaggttaggagaacagatggagcacgtataaaacatccactcacggccgggcgtggtggctcacg 

cctgtaatcccagcactttgggaggctgagatgggcggatcatgaggtcaggagatcaagaccatcctggctaacacggt 

gaaaccccgtctctactaaaaatacaaaaaattagccgggcatggtggctgatgcctgtaatcccaactactccggaagc 

tgaggcaggagaatggcgtgaaccccgggaggcggagcttgcagtgagcccgagattgcgccactgcacttcagctttgg 

gcgactgagccagactccatctccaaaaaaaaaaaaaaaaaaatccactcaccatgagaacctggtgacttttaaaaaaa 

aacaaaaaaacacaacacttgctacaactccaaacacacggttttttgttttt ttttttatttttcagagacagggtctc 

actaCgttgcccaggccggtctcaaactctcggcctcaagcgatcccccgaccttgggctcctaatgtgctgggaccaga 

ccagaggcacaagccactgtgcccagccctctattctttttaattaaaaattaaacaactataacctcattggaaaactg 

cttggcagtacctaataaaaatgataaatgaaatgagtaattccactttttttttttttttgagacagagtcttactctg 

tcacccaggctggaatgcagtggtgcgatctcagctctctacaacctctgcctcctgggctcaagcactcctcccacctc 

agcctccccagtagccaggact.acaggcacatgccaccacacccagcc.aatttttgtattttttct:ttttttt:t.gagatt 

acaggcatgtgccaccacgccggctaattttgtatttttagtagagacgggttcctccatgttggtcacgctagtctcga 

actcccaacctcaggtaaactgcctgcctcagcctcccaaagtgctgggattacaggcatgagccaccgtgcctggccaa 

tttttttgtatttttaatagagacggggtttcaccatgttggccaggctggtctcgaactcctgacctcaggtgaccgac 

ccacctcagactcccaaagtgctgggattacaagcttgagccaccatgcccagccatggcatgtgtttaattttaaaaga 

aactggcaaattgttttccaaaatgactgtacttacttacactcccattagcagagtataagagtttctgttgcttcaca 

tcctcatcaacataaaaatttgcaattttgtaaaaaacgctacaccatgctaaccaaacaaagcagggcaacggactacc 

agtcttcagcctctcctctgagccatgcccttgtagaggacaatttcacaacagctatcaagattacaaatgcacacaca 

aaaaaagagaaaacacacacacacacacacacacacacacacacagccaacatggtgaaaccccatctctactaaaaata 

caaaaaattagctgggcgtggtggcaggtgcctgtaatgccagctacccaggaggttgaggcaggagaatcgcttgaacc 

cgggaggcagaggatgcagtgaggcaagatagtgtcattgcactccagcctgggccacagcgcgagactctgactcaaaa 

aagaaaaaagagagaggaaaaaaaacacaacaaagattacagatgcatatattctttgatctaggaaccccagttctgga 

atttatcatttagatatacttccccatgatactaaatcatgagtgtatgaggttatttattgttgcattcttgtgatagc 

aaaagattggaaataactcaaatgtccataaatagaagacttcttaaataaatttggtacagacaatataatgttatatg 

gctttaaaaagaaaaaaaaaaaagaggaagttttcaatgtcctcccaagtagccaggattacaggcgtgagccaccatgc 

cgggataatttttgtatttttagtagagatggggttttgccatgttatacaagctggtctcgaactcctggcctcaagcg 

atccacccacctcagcctcccaaaatgctaggattacaggcatgagctaccgtgcccagcctgctaaggaaagactttgt 

attagctaggattctccagagaaacagaactaggaaagagggcagataggaaagactgacatattataaggaattgactc 

acacaattatggagtctgaccagtcccaagagctgcaggttgggttggcaagttggagacccaggagagccaatggttta 

gttccagtctgagtcccaatgcctgagagccaggaaagtcaatgatgtatctccagtccaaagggcagcaggtttgagac 

caaggaagaatcaatgtttcagtttgatcccaaaggcgagaaaaaagctgatgttccagttcaaagaccaccaggcaata 

aaaattctctcttatttggggagggccagcctttttcttctatttaggtctcaactgactggatgtggcccactcacatt 

agagagggaaatctgtttcactcatctgctgatttctttcttttttccttttgtttttgaaatggattctcactctgttg 

cccaggctggagtgccgtggcatgatcttggctccctgcaacctccacctcccgggttcaagcgattctcctgcctcagc 

ttcctgagtagctggggttacaggcaatcaccaccacgcccggctaatttttgtatttttagtagagacggggtttcgcc 

atgttgaccaggctggtcttgaatgcctgacctcaggtgattagcccgcctcggcctcccaaagtgctgggattacaggc 

gtgagccaccgtgcccggcctcatctgctgatttaaatgttaaccttagccgggtgcggtggtttacccctgttttccca 

gcactttgggaggccgaggcgggcggctcactcgagctcaggtgttcgggtcaacatggtgaaacctgcctctactaaat 

acataaaaattagctgggcatggtggcatgagcctgtagtcccagctactcaggaggccgaggcacgagaatcacttgaa 
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cccaggaggtggagggtgcagttaactgaaattgtgccactacactccagccagggtgacaaaacaagactcttctctca 

aaaataagagtaagacaaatgtgcatgtttgtttatagtcacataaagtcattccggaaggatacataaaaactaaagat 

agaggttccctattagagatgtgctgggaataagaagacttttcactttttatgtgttttttttaaatcatatatatgtt 

ttatctattcaaaacattaagtaacaaagtttaaatacccttgcatgccttcctctcctgtagggttgtgaacactggag 

ggaacaaatcatgttttatactcacttttgtattcctggtatttagtacatagtaggtactcaataaatagaggaaagaa 

cgaatgaatgaaccagaagatagcagatttagtgtcggttagagagaggagaggggccaggcacagtggctcacacatgt 

aatcccagcattttgggaggctgaggcaagaggctcacttgaacctaggagttggagaccagcctgggtaacaaagcgaa 

atcctgtctctccaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaagcccaggagcagtggctcacacctgtaatcccagca 

ctttgggaggccgaggtggggtggatcacctgaggttagcagttcgagaccaactaggccaacatggtgaaacaccatct 

ctactgaagatacaaaaattagttgggcgtagtggcaggtgcctgtaattccagcaactcaggaggctgaggcaggagaa 

tagcttgaacctgggaggcagaggttgcagtgagccgagatcatgccgctgcactccagcctggatgacagagtgagaca 

tctcataaaaaaaaaaaattagctggatatggtggtatgcacctgtcctcccacctattctagaggctgaggtgtaaaga 

ttgcttgaacttgggaggctggggttgcagtgagccaagattgtgccactgcactccagcctgggcgacagagtaaaacc 

ctgtctctaaaaaagaagggagttagggctgggtgcggtggctcacggctgtaatcccagcactttaggaggccgaggct 

ggtggatcacctgaggtcaggagtttgagaccagcctgaccaatatggtggaaccccgtctctactaaaaatacaaaaag 

tagccgggtgtggtggcgtgtgcctgtaatcccagctacttgggaggctgagacagaggaatcacttgaaccagggaagt 

ggagagtgcagtgagccgagatcataccactgcactccagcctgggtgacagagcaatactctgtctcaaaacaaaaaaa 

aagaaagaaagaaaaattttttaaaaattaaattaaaaataagggagttataatcttatctaatgtaatcacaaaagttg 

acatcctatcacatttgctgtattctacttatcagaagcaagtcactaagtccagcctacattcaaggaaagggtacaca 

tgggggtgacgtccaggaggcagggattactgggagccacgttaggagctgtctactacaaggaggaaaaagcaatgaaa 

ggatgaaatctaggatggtggttatctctgagtggaagcaagggggagagatgcaagaagcacatggccggtgaaagtta 

tttgtaataattccattcttggggcaggtggtggatttataggtgttcataagtgagttgattatggacaaatggggaga 

gagtgtcatgaaccaaggattatatttagtccaattcagtaatatttaaaatattcaataataaatgaaataaaaattgc 

caactactcctgaccttgttctgagccccatccatgcactccacctggtcttagtttaacactggctccacctgaccctc 

cacttgccccaaactaaccgttagcccttcacacacactgacccttgctatagcaccacgatctctccatagtctgcaat 

aggcccccggagttctattccctgaaacttttatccatgtaacctcacctgtaggctgctatatgtaaagatagtaggaa 

tggctgatggagtggcccgaggaccaggagacagggcaggctaacagttctgccttagcagtagcactgtgggcccagaa 

gacaggtttgatggctggtatcacgacctcacctattggcctcctacatggcaaaggctgggccaggtcctgatggttct 

tcctctttgaggcttctgctcctggtatccagatatagctaagtctgtctctgagaagtgcacctctgccacagcaccag 

gcctgtaactcccttgttggtaatagactctatcaactctctcctttcctgtctttgatgaagtaagctaccatactgga 

gatggctatgtgtcaaggaaatgagagtggctattggccaacaatgagtaaaacactgaagctttcagtttgacagccca 

ctaggaagtaaattcagtcagcaaccaagtaagcctggaagcctggatccttcctcagttgtgccttcagatgagactgc 

agttctattggccctttgcttgcagccttgtaggaaacattgaagtggaggacccagccaagcctggcctggactcttga 

cccacagaaaatgtgagatataataataagtatttgttgttctaagctgctacattggtgataatttgctacacagtaag 

agataatgtatacaggttcttttcatttcttttgctattacaaagagtgtttctgtgggccaggcatggtggctcacgcc 

tgtaatcccagcacttttggaggccaaggcaggtggatcacttgaggtcaggagttcgaggccagcctagccaacatggt 

gaaacctgtctctaccaaaaatataaaaaattagccagatgtggtggcatgcacctgtaatctcacatacttgagaggct 

caggcaggagaatcgcttgaactcaggaggtggaggttgcagtgagccgagatcgtgcaattgcactccagcctggacga 

cagagcaagactccattttaaaaaaaacaaaaccaaacaaaaagagtgttgctatgaacgttgttgtaacatgtctccca 

gaacacagaacacaagagcctaggtttcccttgggtatgtgctgggtatatgcctaggaatggaattgttagatctcagg 

atgtttgttttgttttgagacagagtgtccttctgttgcctaggctggagtgcagtggtgcgatcttggctcactgcaac 

ctctgccccccagcttcaagcaattctcctgcctcagcttcccaagtagctgggactacaggtgcccgccaccacgcctg 

gctaattgtccgtatttttaatagagatgggttttcaccattggccaggctggtcttgaactcctggcctcaagtgaccc 

acctgccttggcctcccaaagtggtgggattacaggcctgagccaccatgcccagctggatctcaggatttacaaatgtt 

ccacattactggtagtcagaaaaccattaaagttagggtcatgccatggctgtagtgtgttggccagggttgggagtaaa 

gcctggcctcaatccatggaagttgggattgtctgtgggcagggttggagtcgtacagagggccgagttaggggtcagtc 

cttacgtagggttgagggtgagtgtgtggccacatttgggaacaatttgaagtctggtttggggttgggtttggacaagg 

ttggagaacactgtgagggtgggtttgggggtcaatctaaggccagggttagaagttaggctgtcatcctgtgcaaaacc 

tgacccctggtgatatcatcaacctaccaagctgtggccgcacaggacccagccacccacaagatgagatccactctggg 

tccagaaagctctttcactcaaggcgggggtggggagatggaggacaagtgaagaagaaactggctccccccagatgcaa 

aaagagaaggcaggcacaggaaatagaacacaacactgactttaatggggcagccctgagccgtaaccttcaaaccttcc 

gcctcagtaccccgtgcgcgaggagggaggggcgactgctacgggcacatcgtcgatgtcctccctactcctggcgatcc 

cacgcctcaccccttcctctccagctgctgcggtctccgtcgccgaggtgggtcccaggtaagctcccaggaggggcagg 
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ggagcccctcagcggcccgggacagattcccccatggctgagggcatggggaactagcctggatggagacgccgccgtcc 

tcggagctgggcggggacctgattctgggggtgtggacagagccaagggaccgccccccaaggcccagcgccgggagatg 

caaggccgggcccaaggtgttggacactgctttgggggacaggctaggtctctgcacgtggcttctgggctctggaaagc 

ggtccattctcctgacccggatctccggagtggtaggaggcggctcagtcccgggcctgcgctcctagagttcctgtccc 

atctcctcccacgctcacccatcccaaggaaggagggcactcgggccccagcaggctcgtgagagcagcgggctccgccc 

tcccaatggtctatccatcggtgggtgggtccggcgcggcgtcggggctctggcgggtacccgggcgtccccgcgcggcc 

cgggccgccgctcaccgctgcttacgctccgcctgctggagccgccgggagaggtcttcgatccgcacattcttgagctg 

gagcaggtgctccaggcgccttaccttcatctgcaggacctaagccgcaccgcgcaggcgtcaagcctggcggtctgctc 

cctcctgcccggcctctgctggccgcgagcccccacccgcagccttcatagacccgtcacttgcacggtctcttgagagg 

ccaacagctcctgctccttttcagcgatatggaggacgaagctggggtcgccctgcaacgcccggttatacgctggaggc 

cgaggcgccggcggccggtcccagctggggtgggaagcagcttagcggaggcttggacctcgggccctacactcaccctt 

ccagggccctagaccaggccttcccttctccccctctagcccctttcctgcctcccgtctcaggctcagcgtaccgtgcc 

ccccaaccccagctgtgtgcagagatcaccagcctggtgaagtctggaactcccagcttctttacctgagctgaccccct 

cccggggggtccgggacaccttcacctcgggccttctgggatacacctgagacaaggcaagtggaggaatgactgtgctg 

gagtctcattctcctgcttccccctccctacttccactctcacaagtgacaggaagaaggacagaaggaaagctccaacc 

gggagggagaaatggaaaaggccaccttacccacatccatgtagccactgccatcctggggagccagctcctgaaagaac 

cccgggggtccgctgtaagcccgggctcagtcatccccaaggagccccatgtcccttccacgggcccccaggccaggctc 

tattagccaagcacccctcccccacacttatcacccaggctggcctttctgtggtctggggagcgccccccacccagccc 

ccagagcagtgggagaagctggagcggggcggcacctgtaaggagccggcgccctgcttcctgcgcctctgcctctcctc 

caggcgctgcctcagcgggatgagcaccagctccaccacgcctggggcgcactgcgcgatcttgcgcatcacgtcatccg 

gtactgaaaagttcagcctcttcagtaccttcctgaggggtagacattgggatagcagattgggtcctggggaaatgaag 

cgaggggatgggggagagggaagggggcatggtggttgctgttctgagtgtacagccctgggggagtgaaagcgcatcgc 

atctgagcttccttgtgtgggtgttccgaggttccttgcctccttcctaatagaagagtctcttgattttgtaatcggac 

gaattctgtctgcgttgaccctccccaccctgccaccaaaacagtctcagcctgggcaaccaagaccataggagatgatg 

gggctgcacaggtattctgctacctgttcagatgaccccagctgctgagcttctgctggagagagttggcggggacataa 

ttgtgcatctccaccatcttggggaagtaaaacttgatgacctctgcaacaaggactgagggagaggggagcagacatgg 

aaagtgggggcatgggtaaggaaggaggaggtgagagaaggtgggggtgggagaatcaggcatggcagagtggtggaacc 

caatggaggggtggagatgcggaaaggggaaggataaggagggagacaaagacattaagattagtgataggtagaggaga 

ctaagtctccagcatacctcagaggagctgcaccccttccccactaatttaggaatgggaatactgagtctctggtcctc 

cagtagagaatgagtcagccagtgcaggggctcacacctgtaatcccaacactttgggaggctgaggcgggtagatcacc 

tgaggccaggagttcgagaccagcctggccaacatggtgaaaccccatctctactaaaaatacaaaagttagccaggtgt 

ggtggtgcatgtaaacccagctacttgggtggctgaggcagaagaatcgcctgaactcaggaggtggaggttgcagtgag 

ctgagatcatgccactgcactccagcctgggcgacaagagcgagactctgtctcaaaaaaaaaaaaaaaaaaaaaagaga 

atgagtctccaagagaagctttcaggagcccagggaat.tcacat:ggatgcatgatgcacacacacacacacacacacaca 

cacacacacactccaacaccaaagttccaggagagtaaaagactcacgtaagcagagacacatgtgataagatcccccta 

cacatagatacatagggcagaatgtgtagacacatagaacaaacacatgctagacacacataggcacacatgtagaacac 

acatagacagaagcacaaaaagccacatatgcacacagcctccccaaatatacagatacagagatgcatagaagtacttg 

acatgaaaagatgacagataagtatttctgtcattgagatcccaggagacattcctagaaacacaaacaccttcacaatc 

acaaagaaacacatgcacacaagacccccccaaatacaagactaaacatgtcaacacatatgaacacacacatgctcaga 

ctcccagaagacatagaacacatagcaatgagtgagcacatacagaaatgcataaatacatatgcagacacatatcttaa 

tcattaatgtccctggagggagttagaggcaccccccccaacatgcacacaggatagaacatgtagacatacatatatgg 

acacacatcgttcacacacagatgcaaatagaaaacaagcacacacatctaaaatttgaagtcccaggaggcagattcct 

ccccactcccacaagcacacaaacacaaagagtttgcacagagacccgttacacacgaacatacaggacagaacatgcag 

acacacaagggcacacacctccatcgctaaagtcccgggagaggtttcgcttgggccgggacagagggatgttgtctacc 

cacaggtacagctggtgcagcgcctcctcgtccacgctgctcgccattggcgtcctcacggcctggccgccccagcggtg 

ccgggtcccgccccagcctcacgacccttcaggcgcttcctttctcgccactgcagaagcccataactgcctctgcctgc 

ctaatccagagactcacgtcccagctgggagcggccatattgttttctgaaaccatggaaaccattcccgactctcctgg 

gtacaaagacttctgggagttgtagttttatcccctccccgctacctcctctcaggcccgagctttgaaaaggcggttag 

ctgcccttgtcttcttcccaagaaagcatcacttctgtgcccacgccacctagtgaccagccactcatccattttgggac 

cagccaaagccaaccaggaccccagaattccgcggatccttctatagtgtcacctaaatgtcgacggccaggc 
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Signal sequence Pro-domain ^ 



MGWfePRRARG TPLLLLLLLL LLWPVPGAGV LQ[ 3HIPGQPV TPHWVLDGQP 50 



WRTVSLEEPV SKPDMGLVAL EAEGQELLLE LEKNHRLL AP GY I ETHYGPD 100 

H 

Cysteine switch ^ 

GQ PWLAPNH TDHCHYQGRV RGFPDSWWXi CTCSGMSGLI TLSRNASYYL 150 



RPWPPRGSKD FSTHEIFRME QLLTWKGTCG HRDPGNKAGM TSLPGGPQSR 2 00 

Catalytic Domain 

GRREARRTRK YLELYIVADH TLFLTRHKNL NHTKQRLLEV ANYVDQLLRT 250 



LDIQVALTGL EVWTERDRSR VTQDANATLW AFLQWRRGLW AQRPHDSAQL 300 

Metalloprotease 

LTGRAFQGAT VGLAPVEGMC RAESSGGVST DHSELPIGAA ATMA BeIGHS" 350 

domain "Met-turn" ^ 

LGLSHDl PDGC CVEAAAESGG CVMftAATGHP FPRVFSACSR RQLRAFFRKG 400 



GGACLSNAPD PGLPVPPALC GNGFVEAGEE CDCGPGQECR DLCCFAHNCS 450 

Dtsintegrin Domain 

LRPGAQCAHG DCCVRCLLKP AGALCRQAMG DCDLPEFCTG TSSHCPPDVY 500 

Cysteine-rich Domain ^ 

LLDGS PC ARG SGYCWDGACP TLEQQCQQLW GPGSHPAPEA CFQWNSAGD 550 

► 

AHGNCGQDSE GHFLPCAGRD ALCGKLQCQG GKPSLLAPHM VPVDSTVHLD 600 



GQEVTCRGAL ALPSAQLDLL GLGLVEPGTQ CGPRMVCQSR RCRKNAFQEL 650 

EGF-like Domain ^ 

QRCLTACHSH GVCNSNHNCH CAPGWAPPFC DKPGFGGSMD SGPVQAENHD 700 

Transmembrane Domain Cytoplasmic 

[TFLLAMLLSV LLPLLPGAGL AWCCY Rlj PGA HLQRCSWGCR KDPACSGPKD 750 

Domain ^ 

GPHR jPHPLGG VHPM §LGPTA TGQPWPLDPE N |SHEP|5SHPE KPLPAVS| PDP 800 
putative SH2 " S Putative casein kinase H Putative SH3 

binding domain^ phosphorylation site" binding domain 

2VQMPRSC LW FIG 19 
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ggagaatagtggggttgcagtaggaggtagaaggttgggaatgaattggaaaaggagtga 
ctgtgtagccaaggctgtgccttgctcattctaggccaatttcttccttcttgtctctcc 
ctccctccctccctccctccctccctccctccctccctccctccctccctccctacctcc 
ctcctagctatccttccttccttccctttacttctatcttccttctttctttccttccct 
tcccttctcttccccattccccctagaccaggtcccatgtagcccaaactgacttcaatc 
tcattacagatgaccttgaacttcagatcctcagtccccatttctttttgtgcctgaata 
tcgtattttctctgggagcctgtccagtattttcctccaataaatttcattccaggatac 
aactggagtcaattttcttcttgcaaccaaaatgtctaggctagaatgtggagaattcaa 
atattaaacgctagtgtctcagacaccatctgtgggcatgtggccagggagcataaatcg 
cacagtgcttgcttcatactgtctgctgaagtcatccgggccccttcctgctcctcatag 
ctcctcacatgataactcacggggtgggctgctcgctcacagggactggcacaggcttgg 
cacatggcaggtggcacatggttcttgtacatggttgaatagaagcacagaaacccagct 
ctaccagaggcaagacagggagagaacaaaccaaagatggcaagatgttccaggatgacc 
gagagcaaaagccacacctaaagggtggtggctgctgagggtttggggacatgtgacttg 
aatctttgctctcaagtacttccaagagggatgttggagactacttggggtggggctcat 
tgggcccaatcttcagtagtctaaagcccctcagaaataattacttagctgtaattcaag 
cacaatatatacacatggtaagtgtattagcaaaatggctttctatgtaatataaataag 
ataaaaaaacaaacaaacaaaaaacaaaaccagaaataccagcacttcagagtaccccac 
tcccaacccctgcccagtgtagctgctaacttcactaatcaagttaatttaaaaatcaaa 
tataataccacagattttatttttcctgtgtttctgctttatacaaatggtatcatattt 
taaacatgagaacattgtgagatgactagccagtgtctgtctgtctgtctctctcagata 
tttatttatttatttcatgtatgtaggtacactgtcactgtcttcagacacaccagaaga 
ggacattggatcccattacagatggttgtgagccaccatgtggttgctgagaattgaact 
caggacctctggaagagtagtcagtgctcttaaccactgggccatctttccagtcctgac 
cagtgtctcttaatggtgaaatattcttcccttggccctagcattttatccttgggcatt 
ttctcaggaaaaatgaaggcgcgtgtcaccctgagatttgttactaggatgttcataaca 
gtgctacccatcagccaggaactgaaaacaataaatgctcggcagcaggggaatgaccga 
gtagacagagctacgcccacacagctatgagagctgtgcctacagctgagtgtggttctc 
agacggtcagtgggaggggctggcagcgatggctgcctgtggcagttacatgtcctaccg 
ctctattaaaggagctaacaaaggcaaccttaaggaaggaaacatttattttgggtcacg 
gttagtaggcacagtctcagagtggttggaaagatggcagcaagtgtctctagctgaggg 
ggcaggagtgggaggccctggtcactttgtgtcggcgaggaagcagacagagaggtgggg 
gtgccgtttgcttcttactttcctcttcctacccagtccaggatgacagaacatcaggta 
ttgccacccacatttggagtgggtcttctggtctctgttaaaactctctggaaacaccct 
cactgacctgatgtgcccggaacaaggtgattctcagcccaagcaagtcacagcgaagac 
caaggacaggtatagttaaggaatgggatagaagtcaatacacagccactggtgggcaaa 
agggatgggcgggagcacaatggagattcgagagccagggaagtgagcaggttcactttg 
tgaaaatctcacatttctgaatctatattttggaaagaaaacttaatttaaaaaaatctg 
ccacttttcctacgccttgtgctgtgtatcgacatgggaggataccttgaggccaggaga 
ctgagatcagcctgagcaacacaataaaacctcatttcttacaaaacagaacaaattggg 
tgtggaggtgcacgcctttaatcccagcacccagggagcagagccaatcacccttgtaac 
catctcaaattcagacccaggggctgtctagcggcagaagagtccatggaacactgtgac 
catagtcatacaattggaaactacttatcgacaagaataaatggccataggtaataggca 
ataggttgggtaaaactcacaaattgggaaaatacaacccatactccagacaagggccaa 
tcgcctttaaacatagaggtcctttaaactgttaaaagaaggaccaaccactcactagaa 
aaatgagctacacatgaatatgttactatatttaaaaaaaaaaaaaacctgtgtctaaaa 
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cagcagcttatgtaagaaattggtagtagtggc.tttctgtggggaaggagtacccaaagg 

atggagactaaaggaagactctcctaagcattttataccttctcccttgtttgctttgtt 

gtagtggtggtggttgtttttgattcttttttgtttggttggttttggtttttttcaaga 

cagtgtttctttgtggagccctggttgtcctggaactcactcaggctgcccttgaactca 

gagattctcctgcctctgggcttaaaggtgtacctcaccactgtcaggcttgggtttgtt 

tttgagacggggtcttgctgtgtagctccagctgcctctgaagtgctagaattacagaca 

tgaactacagatatctcccagtctcctctcattctccctccctgccccctttgtatacac 

attcacctgtgtgtgcatgtgtgactcactctgagttcttccagtgttactctccatcag 

attttttgagatagggtctcttactgaacctggacctcaaacatttgcctgtctctgtcc 

agtccaacacttggcttacaagtttgtgcaatggttctggctttctctgattgctggaga 

ctt gaactcaagaaccaacgcaggtcct tt act tact aagccttctccttgatgacaccc 

ctcaccccactgccacacaccaggtctccaggctgatctcacactttttccatataggac 

tttgaacttgtcttttgcctctctgctgggatacaggtgtgtgctgctacacccaattta 

tgggatgctggggatggaactcagggccccaatcttgtatcaactgaggtgcgctcgctc 

tctcgctctcattctctctctctctctctctctctctctctctctctctctctctctccc 

ccctccttcccttccctctctcttccatgagacttctttctatattgcacaggctagcta 

tcttgagtgtcctgtctcgatccaattgcctgacctatcatactggttttctgtttttgg 

ttttggttttttcgagactgggtttccctgtgtagccctggctgtcctggaactcacttt 

gtagaccaggctggcctcgaactcagaaatccgcctgcctctgcctcccgagtgctggga 

ttaaaggcgtgcaccaccacgcctttaatcccagcatttgatgttcgaatcgtataaatg 

ttttttcttctgccgttggaacttttacattccatattctcagaacgaaattattaatca 

tcttatttcaaagttaaatctcttgctatccgaaatcataccaagtgtacgaagtgatct 

tttcaacacgaaagcaatgtccttaaaatttgggaaagggtccctcatgcctgggggtct 

gtggtatagaacaattctgtcctggccatcgctgttcctctgccctgtgcccaccactgc 

attgcctttaccaggaaccacagacttcagcaacgcggccaccacaagacccctaggtac 

ccagatgcatctcagctgcagtgggacagcttcactgctgtgggacactaggtgcaggta 

gacagatcctgggtagaaaggagaatccagatgtcactagggagactgcgaccagagatg 

tgtgctaggcagacagagcagctgtggggatcacttagctagtgcctctattcagcacca 

gagaggttctctaagaaccttcccggactttccagcccatcagccatcccttggccagcg 

tggataggattgttgcagataacgccagggagggtcatcctggctcccgcagcgcgacat 

cctgttttgcgggccagggtagggccacactcctgcgctcctgtgaggtggcgcgaggcc 

cgctgagggcgggaggaggcggccccctctagccggggtggagccaacagcacagcaaga 

gtgagagggaaaggcactcccagccatgggctcgaggtgcgggagacccggggggtctcc 

ggtgctgctattgctgccgctgttgctgccctcgtgtccgctgcggagcgctcggatgtt 

tccaggtgaggatacaggcggcgggggggggggggggggcggctctagggcagggtgaag 

tggtggtggtcatgggttccagtgcaactacagaacatagagttcccgaccctcggctac 

ctggccaggacaggatgtgggaagggtctgtgctcctggtaagggtggccctggagcaga 

cttgcccacacttacttagcaaagcaaataggcaacactaccctttcctcagtgctgccc 

agctctgtccggaagcttctgcacttccaaactgccaggccagagtgtctccctgctcct 

ggaaggacctggggtccttggagtcctggtctgcccacaggcctagctgtctcattcggc 

tcgtcctcttctcatgctgggtctgggcagtgtgtatggtgtttatacagttgccttatg 

tggcagttcttcaaacaggggtgtctgcctagccgtgcctggggctaccactttctcttc 

ctactgtaacctctccttactccgggacgacagggctcttagcatctccaagcctgtagg 

gtgaatctccaaactccacatagcaagcactaggctggccagaaggcagaaggaccaccg 

gggaggaggcagtaagggctgagggctggatattgggcagtctggaaaccaccccatatc 

tgttgactgctgcttccgtaaggcttctgagggggctgactgacagctgtattgccttta 

gcctgcggatagttaatggtgaggagagagagcctctgtgtttcctaatgcagactggct 
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tcctcagt ttccactttagtccaccggactcctagcttgcctccttccttgtcttgcagg 
gctgcacagagaacactcacacctgaatccttatcctgcacccttggtaatccgaggcca 
gcaagacttggttctccatactcagttaagccttctcttggcggagagatctttaacctc 
tcaggaaggcttctctgtccttccttctcctcctctcccttctctctcttcagcctccat 
tccaaagccacacttgcacatccccacttgcacatggctcactaattacgaagcacatgt 
gactttaaggacgaggttcctggagctgctgctatttcagattttgaaaatgtatatgag 
cgttttgcctgcacatgcatctgtgtaccgcggaggtaccaggaaaacctgggggccgga 
agagagaggcggatcccctggagctggggttacagatagttatgaggcaccatgtgggtg 
ctgagaactgaacccggtcctcaggaagaacagcgagtgctctcaatccccgatccacct 
ctccagctcccattccagaatcttgatgtctgggatctagggaggcggtggaacaaggag 
ctgggatgacccaggaagggctgagccaacagagctctccactcaggcaaggccagagct 
ggagtggctggggtttctatgtttttttcctttttttttctgtaacctgtgccaactgga 
agatgcagcggcagagggataagtgggcttgctgctatctctgcaggagcttggctcctg 
caggataggtcctgtgagtgtttctgggaccagataggagatggggaggtaggaacattc 
ctgaagaacattccaacaccgtctgagtaatgcttggctccgattctcttcctggagccc 
cttggaaacgcctggcccaggatcactgagcctctgaaccctagctccttcaggttctgg 
gccaggtccgtcagagtcccaacaccagcatcaaccctaatgggagtcaggagctgtggg 
acagacatttggtggctgtcttactcagatggcaaaggaggcctccaaacacctccctca 
ccataaaggcatctactacccagtctaaaaatacccaaggctgcctccttcttcccagcc 
cctctgcgctggggtctgggcttcttcccgggacctctgacctcaaaccagctctctaag 
ctctggccgcttcaaattccccaatggggatggttcctggtgcctcggtttctcttttgc 
cacctccctggttgcttctccatttaggggatgccccatgaataccaaatcctggaagtg 
ctacacatggagtccagctttcaggacccctccttccccgatggcttcctgctcctaact 
gcaattttaacactattctctgctttcaccccaggaaatgcccatggagagctagtcact 
ccccactggatcctggagggcagactctggctcaaggtcaccctggaggagccggtaagt 
atccgttatcctgtccttattaggggccctaggtctcccgctctctgtcctttcccccaa 
atgcagcgtgacagtttttctggggttctcagaaccaaatcactgtaccgttaagccact 
ccctgttgggtgggtgggacaaagagacatctagatggcgtgaggctgcacacatggaag 
tcattttgggcttcctccctcccacccccttctattcctggccctaaccctcttctccat 
gggggaacattcaggccatgcccactcaactaagcaaagatgagttcatagtttggggcc 
taaagcctttcaggagagctctgtgtgtgtgtgtgtgtgtgtgtgtctgtgtgtgtgtgt 
gtgtgtgtgtattctgttctgttttgttttgattgagacagggtcttgctatgtagctct 
ggggggcttcaagtcatgagccatcatgccagccacaggagatttttaaaagaaggacac 
caaaatgtcctctttggccgtgtttatggggaacatgtgactccgtggacatgctgctgt 
cctgtaacctcagcttccatggctggaagccacctgggacaaggaagctgtgggccattc 
tcctcctgtccccagatgtattctcactctggtttgcttctccagagccctgctcagcct 
cctcctaacttctgctcattgtgttcccccagaagagccatgtgtgccttcactgcttcg 
ctgcccatggggaggccggcactgatccgcctggcttacagagaggcaagcgagccccac 
agagcttgtaaagttgctcatgcttgaatagttcattgggcctagaaacctgaggctccg 
tgggacctaagggcttcagatgattggctgcctccccttctttcctataactactcaggc 
tggagagagttcaattctgtacctcctgacagagggcagccccactgtccaaacctagag 
tgttgcaacagtgaggatgcattcccagaaacctaaaccttccaagtcagggccttcact 
ggggtcccccagacttctaggacttctctcaaccaagcaaagatggcttgtggacatgca 
cgggttcccctgatggtggcctctctccttgcctggctcggtctgtgagagaatcccagg 
gggactctgctctgcgttaggaagcctgtgaggccaggccaggaagagctttggcagggt 
gtgtgtatttcacaaacagggttgtactgcaggatggggatggtgcacaaagggggaggg 
agaccctggagcagaagtaaaggcagcaagggccgcagagggaagggaccttccactggg 
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gctactgcctttctcccagagcagacattttcccataaagcaagaggcactccaaccata 

taagctcatgtttgggcccttgtggagcctgtggctggggaagtgagggccatcttctct 

acacctgctgcagaagggccctgaaagattcttggccagggtcccagcccagtgcatttt 

gggataaaaagggaaagccatcgtgggtggggaaaacatttaaaaaaaaaatacagcagc 

ctcccctggaatctcttgggctagttccagttctggcttctagccaggctaagtggacta 

gcctgagagagaccaagtcagtgagagaggagaggtggctagagggccaaggccagccct 

tctgacatctagctaagagagtcaacactttttagggagccaaagttgggtttcatgttt 

acttcatgagttcagttatgaggccagctcagaggaatttcagaggatcgggcagtttgt 

cagaactgaaggtggaggaaaaagtttgggttctctcaggaatgagggaaaggcagcaag 

atggagagatgtgcaaccaggagctgtggatgcagctgggtgatttggtgcttgcttggc 

atgtgcaaggccctgggtttgatctatagcacccacagcaaactagcaagcaagcaaaca 

agcatacaaacaaaacagaagcaaggttgcaggtagtacaagagatggccaaagctgtag 

cccacccaatgacccacatcaccctcagtgcccactgcctcctgcttttcctgccttttc 

atggggctcctgactatggccatagcatatgtccatagcagacataacaacatatgccag 

gcaacgtagcaactgctatgtccgtatatggtggttatgcatctttaagggtccgttgtc 

acaaacacaagcactgagaacatctctaagttacaactcccacacatatccactccctgc 

aatgcgagagccagctgttcctatgagctttctcactaggcagctctacatcctaccggt 

tcctgggcagccaggtggccttggggcctagtgtgtcactgcgttccttctcggtcagat 

cttgaagcctgactcggtgctggtggctttagaggctgaaggccaggatctcctgcttga 

actggagaagaagcagtgagtaccagcgggggggttgctgaagtccagacaaagaccctc 

tctggagaggatcagtgctttctgggagggggttgggggctgggtgggaagcagcagtgg 

gagtgacagggagagtggcggatgtacttggggctacagtggactgaacccaactgtagg 

aagtacatacctggtcatctcataccctggagcagctggatgggcggctccccgctggag 

aagagtgagcaccagaacagacattgactgataccttaattcaaagggaattcttaggca 

aagggaacttccacagatggcagaagaaaaagctgcgaaggtcaaatccaatggtgacag 

ttcatctggtctacctggaatgcctccatctgcgtggaagagaacagagacatctggtct 

acctggaatgcctccatctgcgtggaagggaacagagaggagagggctggggtggggggc 

ttcccagtggggccgatcgaagagggcatcttagtgctaaagatcttggatggacctaaa 

gagtcaaagaacgggaggtggtaaaggagaaacacagtgtcgaaggagtgcagagggtaa 

gcaagatggaggcgtggaactgcagactccaccaccaaccacaccactcgctcatgtctg 

ctttcttacggacattgcagaaggcactcattcattttattctctccatggaatcagaca 

gactgggaacaaactgcaaacaccacgccaaaccacagttcctgacagtgccacatccct 

tcctactaaacggtcttgtacatgtgtgcacacattgaacgttagcatgtatgagtgcag 

ccagcaaagcagcagagttgtaatgtgtgtaaggatggctggccgtacaacagcctgggt 

ttttgatggaggacaatgtgaatttggagtcaggattttctgtaaggaagagcaattgaa 

gtcaggcatggaggtagaggcctgtaagtccggctcctctgggcaggagaacacagagtc 

aagtcagggactgcctgagttataaagaatgagttcaagaccagcctaggcaacttagca 

aaatccagtctcaaaatataaaggagtgaggaagagggaaagagaaaaagagggagaagg 

agaaggaggaaaagaaaagaaagaaggaaagaaagaaagagaaggaaggaagaaaggaag 

gaaggaagataaagaaagagagagagacagacagacagaaagaaagacagaagacagaaa 

gaaagacagaaagaaagaaagaaaggcagaaaagcagctatttatttatggttgcaatgt 

tcctggtggggacagggaagaagcctgaaaatggaaacattactgatagagtctagctcc 

aggtaaagggacccttccccgggtggctgaacagaagcagctacaggcaggtggaagtcg 

gaagtacgcatcactgagggacagatggagacatgagatctagcagtctgaggagaactg 

gagggcctggggtccgctcagcaaaggcaggagtaccctggccctacttggcctctctga 

ggtagtatctcctggcacacacacacacacacacacacacacacacacacacacacacac 

acctgcctgcagcaggctccacagcaccaaaggtccccagcctgtccttctgggtgcttc 
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ccacacttcccatccccttgactaacccacctcaggactcgctccagacctttgtcttct 
ttttgcctgaggttttctctttgaggcacagtgtctcctttctctctgacacactgaatg 
cctccgctagcactcagggaggagagctggaagcaagcggcaggatgcaggtctaaatcc 
agtgactcctaaccctccccctttccagcaagcttctggccccaggatacacagaaaccc 
actacaggccagatgggcatccggtagtgctgtcccccaaccacacggtgagacactttc 
atgggatctgagaacatggggagggtgcagcctcaatgcctaccctgtcctgttttgcat 
tcctaaggaggttgtagtctgccttcaggccccacttcctccttgtcccaccaacttgag 
ctcctactcagatgtgggtatgagcctcttcctctctaggatcattgccaatatcacggg 
cgtgtgaggggcttccgggaatcctgggtggttctcagcacctgctctgggatgaggtaa 
ggcgttgggggagagggcctggggctgcggcagaggtgagcgct.tcctgttcacacatcc 
ttgctatccactgcactctagtggccttattgtgctcagcagcaaagtcagctattatct 
gcaacctcggactcctggggataccaaagacttcccaacccacgagatcttccggatgga 
gcagttgttcacctggagaggggtccagagagacaagaactcccaatacaaagcaggaat 
ggccagtcttcctcatgtcccccagagccgggtaaggggtacagactagagtgggactgg 
tccttcaaagcagaggacacaggaatgcgggtcctggacggggtctccctgttttaattc 
agaggctgaagaatcaagtttgagcagaagtttaactatcccctcccctagcccgggggg 
atcacggtgtgtggccaaggattcggtcaagcagcctgcccctcaccatttgctcctccc 
aggtgaggcgagaggcgcgcaggagtcccaggtacctggaactgtacatagtggctgacc 
acaccctggtgaggagagacctaagaggtctgctgggtggggggactgctcagcaccgtt 
atgaccagggacctctctgccttctcagttcttgcttcagcatcagaacttgaaccacac 
gagacagcgcctcctggaggttgccaattgcgtggaccaggttggatgattagggcgctg 
aaggttgggagtgaggcagttgaaaagctctggtgagtgcctggttcctggcttcagcag 
gtctctccctcagattctcaggactctggatatacagttggtgttgaccgggctggaagt 
gtggaccgagcaggatctcagtcgcatcactcaggacgcaaacgaaacgctctgggcttt 
cctacagtggcgccgcggggtgtgggccaggagaccacacgactccacacaaGtgctcac 
gtgggtgccatttacacacacacacacacacacacacacacacacacacacacacacaca 
caccccggttgcgggatgtccttatttccaactccacccagtcacgccctgctccatcat 
ccttaggggccgcaccttccagggtaccacggtgggcctggcacctgtggagggcatatg 
ccgcgcggagagctccggaggtgtgagcacagtaagctctgctggggacctaggaggaga 
gaggtttgggtagggtttgtggtcctcagtgaaacccggctcctcaggaccactcggaac 
tccccatcggcacagcagccaccatggcccacgagataggccacagcctgggcctccacc 
atgatcccgagggctgctgcgtgcaggccgatgcagagcaaggaggctgcgtcatggagg 
cagccacagggtacgcaaagaaggtcgttgttaccggacctagacaaagcgactgcgtat 
ctttggttacccctcagttccctctttagtaaagataacagcctgctaatagggcctcgg 
tgggaaagatgctgataaggtggagtctcagaatgactgcgggactagggtactccactc 
tgatccttggtcctccttggggcccggactgcccgattttagcgtcacagcctcctgtct 
ccagttcctaggctcctctctctctcgattgctcgcttagcttagaaagaaacagcacat 
cttgagggctgtgtggctaacctcagggcccagagtgcagaagtagcctgagcttaggag 
gtggctccagcacccgagatggtccccacaccaccaatgacatccatcttccatctgcag 
gcaccctttcccgcgcgtcttcagcgcctgcagccgccgccagctgcgcaccttcttccg 
caaagggggcggtccttgcctctccaacacctcggcgccggggctcctggtgctgcccag 
ccgctgcggaaacggcttcttggaagcaggagaagagtgcgactgcggttctggccaggt 
cagatcgtcatcttcgtccctggggttcaaggctaacccatgctcccagcctttcccagg 
tctagcttgctccaccatcacgtgttctgttctctcctttctgacttcagaagtgcccag 
acccctgctgctttgcccacaattgctccctgcgtgcgggggctcaatgtgcccacggtg 
attgctgtgcgaggtgcctggtgaggatactggggagcgccctgtacatcttagctgggc 
tgggacttctagtcccttttctgagtgtgagtttgacccagtgtatgggttcacgtagca 
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ctccctcgggctttcagcaaagatctctctgctttcccttaatgggtctctggtgtagtt 

aaagtccgcgggcacgccttgtcgtcctgctgcgactgactgcgatctccccgagttctg 

caccggcacctccccgtattgccccgcagatgtttacctactggatggctcaccctgcgc 

tgagggtcgcggctattgcctagacggctggtgtcccacgctggagcagcagtgccagca 

gctatgggggcctggtgagaccgcacgctggtcctggtgccctgaccaatactaaaacct 

gcggttttctactgagggcaagctccacccgtggaactgaggcccgagctgcccgcttct 

tactccccctccccccagggtccaagccggccccagagccatgtttccagcagatgaact 

ccatggggaattcgcaagggaactgtggccaggaccacaagggtagcttcctgccttgtg 

ctcagaggtgaggtgtgatgctgagggtctgcagctgtaaagtagggcggagcatgcgga 

gggaacactccaagttgttgaccaccttccacttcctccccagggacgctctgtgtggga 

aactgctgtgccagggaggggagccgaacccactagtgccgcacatagtgactatggact 

ccacaattctcctagagggccgcgaagtggtttgccgaggggcctttgtgctcccagata 

gtcacctggaccagcttgacttgggtctggtagagccaggcaccggctgtggacctagaa 

tggtgagccctgcccacccaacccctcctggttattgagtcctccatgccaaagtgttct 

cctcactgcccagtgggcacaatgcccataggtgtgccaggacaggcactgtcagaatgc 

tacctcccaggagctggaacgttgcttgactgcctgccataacggtggggtgagtagcct 

aaggggtcagggtgaccttggaggtccttgctacctggtgacttttctatcctcatctta 

ggtttgcaatagcaatcgtaactgtcactgtgctgctggctgggctccacccttctgtga 

caagcctggcttgggtggtagcgtggatagtggccctgcacagtctgcaagtatcccaat 

ggggtgggggcaggcaggaaccacctgggcagtagcctgcttgagactcagcacccctgc 

cctccacagaccgagatgccttccccttggccatgctcctcagcttcctgctgcctctgc 

tcccttgggcttgcctagcctggtgctactaccagctcccaacattcttgtcatcgaagg 

gactgtgctgcaggagggaccccctatggaataggtaggttcggtgctcaggtctctctt 

cctgagcctgcccccatggctcctgcttctcagaactcttcagggctttgtagagtgaga 

ggctacagggagctggggctttaggaagctagatgggatccttattctctagatgtagtg 

agagctcccaggctgtgggaagaagtccgtggtgtgtatcactgccctgaccagcacttg 

gtggtgtggtcctttccatgggcttcccttgatttttttgttttatgttttttgttgttg 

ttggtttttttctgctttaaagaaattcaacacagcctgtgctccgctttgtcctgaaga 

cgactctagcttccttgtctcaatatcagccatctcccatggcctttgccctaattactc 

ccttcaggtccctcggtgctagccaaggacccctttgtgctccctataaactggccagct 

atagtgttgctctttctgtgccaggactctgtgcctctgtcccctgtagatcacagtgct 

gtaaacccaattttctggcaggcagaaatgccacctgatagcacacattatctaccgagg 

gcagcctcctcactaggcccctcgggcaagtcaggcacattatgtccctgtctgaacttg 

gaagtgttctagaccaattgagggaagggcagggttgggttggagatgtgactagagggc 

acctcaggccagaaacagcaccagcaggccccaggagccagtgagacggtctggggaaag 

ccaggtagtgcctggggggtgggggcggtgtctgcagacagggaaacaggtggagtgaca 

gttgggagggggcacttcagaggggtggcagctgcacaccgttatcgggatagggtgtca 

aggacagtgggaatatctggatgaaccatccaaagaaatggcaaggtctgtgagaaaggt 

cccggcagttcagagtctagactggcagagcacggctaaacggagcatgggaaggcacag 

ttcccaccaagggagacatctctgcacttcagctctaggtgggccctcggtgacgcctac 

atctagactgagtgggggctggagtggagtcacctcgggagaaaggatgtccacagccct 

ggaggccctgggacataagtgaggtctggacatcttaaggacagaacaggaatgtggaat 

tcgaagcttgagtggaatggagaagcaatccccctttgtctcatacacggtcactttcca 

accttctgaactcttatctggtctgtcactggcccctgcagagacatacccctgggcagt 

gtgcatccggtggagtttggctccatcatcactggagagccctcgccccctcgtaagtgt 

gccccttgggacatggagagggaagcaaggggtgggtgcttgccatctgcctccttctaa 

atgctctccttaacacaccttcaatcctctgccctgctcagccccatggacctcttgcca 
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acagcgttcgcaccctccatctcttgacttgctctcagaccctgcgaactctgagcttac 
ctaagaactaccctctgaagcagcctggtctacagattgagttccagacctgccctatcc 
ctatggtatggaagcaccctgaggacctcctgttgcccagtcacctacctctgtctcagt 
ttgttgtcccctcctcagatttacaggcttgcatcaataaagaaatgagacatgggcctc 
agagaagctgttgtcatagagaccatgatgctggaagccctaggggcagggaagggagac 
actgtggttcttcttgggtccttatagagggaggacaaatgtgccctgccatgtgacttg 
cagtcctcagtttctcagacgcactcttataattcctatgggctgtatgctgagctctta 
ctcagcataggaaccccagagcccgatcatgttgtatcccgcctgccctgagagctgtgc 
tattctgaaatgttagaatgtatctaataacaataaatccacaagttatatcagtgttgt 
tggctgtgacctgttaaaagggtctaagttgtttattaaaaagatatggagatggattac 
tgagaaagaaattagaaacaacatctggacagtggaggagccagcactggggaggaaagg 
gcagacagatctctgagttcaaggccagcctggtctacagattgagttccaggatagcta 
gggccacacagaggaaaccctgttttgaaaaccaaacagtcaaataataaaaacaaactt 
agtagcaccttgtacagacagagagaaaggttcccaggagagctcagaccccagcaccag 
aggtggcaaggcagagtctcaaagcccaggtggcaaaagtgggatctgttagcataaacc 
caaagggcgcgtgggacaggcagggagtaccctttgatgcaagctcactctggtgagggc 
ccctcacccctggatgtctgttagcaagggaatcagtcagtgtctcagtctgttaacatc 
tgtgagaggggaaaggctgctgcagacatggcctgagcagcatctggattcgaacatttg 
cactttagggcctgctctctcccctgggtggggcactcgccattcattgcctttacgaca 
gctgtgagagagaggttgctgcaagtgtgtatggctgggttagctcgagcccaccaggca 
atcatgatttccctcgacatctagtcattaacaaacacaggttcttttactacaatttta 
actaccaatattaactaataatgtgtataaaatatataacacagtacacatatataagta 
catagatgttagtacatataaatattatatatatatatataattgtaattaatattactt 
aattttattttatcatttgatattattttactattagttataacaatgtgcataatatat 
gtgtaatataaaatataattttattatttaatattattatataaatttaattaatattaa 
ttatacctatatatttagtacatatacataggt'tacagaatggctacaaaagtgccagga 
gccatcaaggagaagctaaaagccagcaagtgatcttcctgagacggttctgccatggac 
tgtacaattagtgatggatttgcttctgtaggcaaggacgaggagatttcattttaggaa 
agattcctgctattaatatgcttttcctggtattattaaatatatataacaatcactagg 
tattagcccaccgttttgaacagaatgttctgcagaacaatgaagatgtactctcttgta 
atgatgctatatagacaaatagattatttcttttttaaaaaagaaaaaagagccgggcga 
tggtggcacatgcctttaatcccagcacttgggaggcagaggcaggcagatttctgagtt 
caaggccagcctggactacagagtgagttccaggacagccagggctactcagagaaactc 
tgtcttggaaaaaaaaaaagaggaagaaagaaaaaagatttatttatttattttatacat 
atgagtacaccatcagacacacaagaagagggcaccagaccccattacagatggttgtga 
gccaccatgtggttgctgggaattgaactcaggacctctggaagaacagttggtgctctt 
aacccctgagccatctctccagcccaaatagatgatttcttaattcttaaggatgatcct 
ataagaattcctaaacttacattagtaattattaagctcttttacaataggacttctatt 
aagtcttctctaatatgaaaacttcaataagaactctgccagtcttcaagtgtcatgagt 
tagttgcttctgagatagcaagtaggcatcaacaacttagagcacattctaggaggttgt 
aaaaccattaaccagtggtcttaaaaagggaactaacaataggctataggtgcaaggaca 
gaagataaaatattgactaggtttatcaatacaaaatttacccacaaaagttatgttttt 
gacttttcataaaaactctttatgaacctgtagaactggtgaaagatgacgaatgcttag 
ccagataattactcctaatagatatgcatgtgaatattctgtgctgtaaacttatttatg 
tttgaacttccagtgaacttttgtttaaaaaagggggggggttgaaaaagccatgtgatc 
tattctcctagaaagggtacagaagactaagaaagattacattggagatgtaaccttgga 
gagaaagctttgggagcaagagcatagagagcaaggccattgtggcatcagagcaggagg 
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agagagcaagattagaaggagatgcagagtggaataacttagaaactataaggcaacata 

aaaaattaagagagccatatgcagaatgcagagggaaagagaaaaaaaaaaaaaaaaaga 

agctgcagggagagcagaaggagcaggcaggcttctcctgaccatggggtagaacagggc 

ttttcttaataccaaggcaggcttagtcttaaggataataaagcttttctttcttacaga 

cttggttttaattcatttagcaataaaagtgtaaaagtgttttctttccctatgcaataa 

agattggagcttatttttcagccagaatgagtgagttctctctgcaacggtgcttggtct 

tttgcttcatatacacacataagtgtgtgtgtgcgcgcatgcgtgtgtgtgtgtgtgtgt 

gtgtgtgtgtgtgtaagtgtgcaattatcagatggcatggaagctgggctcaattggttc 

aaatggggacttgtgagggtatatgcatgaatctgtatatgaattcatgtgagcttatat 

atatttgcttgtgtaaaagtttttccttctgtgagtgtgactctcttctcctggttcaat 

agaggtttattgcttcaaacttccccctagcctgacagtcgagaggcatctggacaagag 

agaaaaggctctagccattaatccttttcttagatccattttcttagagaactttcttag 

gaaactgtttagagagaacatagaaacaggctgaaatcacttgtcaaactgtcccctttt 

cttcctaaggacttctactagcagactgggagttagagctgcacagtccctgaggagata 

gaacaaaggctgctttactgaatcccctgctgttttaagatgaggttctaaaggagattg 

cagtttctgacccccaaaaggaactcaggcaggtcagctacagtatcaaagtgacttaaa 

cttaagatagggatatgttttattattaaacagctaccctaaatatctcataagatcaag 

cttaccccggtgacacttccccctctgttgcctcaagaggaaccaagcagaaagaaccgc 

cagggctggctcctggcacaaatgggttaaagatgttgtagcatggggaaatgaagagat 

ggctcagctattaagagaatatcttactcttccagaggaccagtgttcaattcccagcaa 

acatatcaggtgccacaccatcacttgtagctccagctgcagatctgctacatctggcct 

ccataggcacccacacacaggtggcacccacaattaaaaaaataagataaatctaaaaca 

gcaaagttaaagcatgagctgaaactagtaaagtgcttgtgtggcatagaccaagacctg 

ggtttggtccctacctgttagaaatagtctcagtatcacacaaaggaacacccaagcgaa 

gcaaaagctcccagcaagacaaaactacagtcttcattgagagtgtgcacgctgaagacc 

gagcacactgggtgcaaaatgtacttggattctgtttgcttgttttgttccagacagggt 

ttctctgcataacagccttggctgtccctgaaattcactgtgtagaccaggctggcctca 

aaccctcagagatccgttcacccacgcttatctaggcttcagtctcaccctgtgagatgg 

cctgaaagttgttagaaccgcgcgggatctatttctgacagactggctggcatcttttcc 

ttctctcagcatgagattcctggggcgttcccatttcagcatcaagcatggtagcagagt 

tggaacctgagggctgagggctcagactcagaccataaactggaagcagagagaactgga 

gattgtgggaggctttgaaacctcagctcctgccccagcaaataccttccagcaaggcca 

cacctcttaaacctccccaaacagggtcaccaactggggacctaatattcaaatgcccac 

aaatatgggagacatgacatccaaaccgccaggacaggtgtatacctccatgcttggttt 

ccgtagtaagaaacactaaacattagcctttcctaataaacactgatataaagccctgct 

attctcgatgtttttcctctgttctgctcctccttctccacctgcttctctgttctctga 

cctcttctgtgtcacagatagccctgccatgtccatctgccagccatgttctgtctactt 

gcctctctctctgctctggactcttctagatgcctctggctgttctttctcatatctaca 
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>ltl216_cDNA 

AAAGGCACTCCCAGCCATGGGCTCGAGGTGCGGGAGACCCGGGGGGTCTCCG 

GTGCTGCTATTGCTGCCGCTGTTGCTGCCCTCGTGTCCGCTGCGGAGCGCTCG 

GATGTTTCCAGGAAATGCCCATGGAGAGCTAGTCACTCCCCACTGGATCCTGG 

AGGGCAGACTCTGGCTCAAGGTCACCCTGGAGGAGCCGATCTTGAAGCCTGAC 

TCGGTGCTGGTGGCTTTAGAGGCTGAAGGCCAGGATCTCCTGCTTGAACTGGA 

G AAG AAGC ACAAGCTTC TGG C CCC AGG ATAC AC AGAAACC C ACTAC AGGCC AG 

ATGGGCATCCGGTAGTGCTGTCCCCCAACCACACGGATCATTGCCAATATCAC 

GGGCGTGTGAGGGGCTTCCGGGAATCCTGGGTGGTTCTCAGCACCTGCTCTG 

GGATGAGTGGCCTTATTGTGCTCAGCAGCAAAGTCAGCTATTATCTGCAACCTC 

GGACTCCTGGGGATACCAAAGACTTCCCAACCCACGAGATCTTCCGGATGGAG 

CAGTTGTTCACCTGGAGAGGGGTCCAGAGAGACAAGAACTCCCAATACAAAGC 

AGGAATGGCCAGTCTTCCTCATGTCCCCCAGAGCCGGGTGAGGCGAGAGGCG 

CGCAGGAGTCCCAGGTACCTGGAACTGTACATAGTGGCTGACCACACCCTGAA 

CTTGAACCACACGAGACAGCGCCTCCTGGAGGTTGCCAATTGCGTGGACCAGA 

TTCTCAGGACTCTGGATATACAGTTGGTGTTGACCGGGCTGGAAGTGTGGACC 

GAGCAGGATCTCAGTCGCATCACTCAGGACGCAAACGAAACGCTCTGGGCTTT 

CCTACAGTGGCGCCGCGGGGTGTGGGCCAGGAGACCACACGACTCCACACAA 

CTGCTCACGGGCCGCACCTTCCAGGGTACCACGGTGGGCCTGGCACCTGTGG 

AGGGC ATATGCCGC GC GGAG AGCTCCGG AGGTGTGAGC AC AGACC ACTCGG A 

ACTCCCCATCGGCACAGCAGCCACCATGGCCCACGAGATAGGCCACAGCCTGG 

GCCTCCACCATGATCCCGAGGGCTGCTGCGTGCAGGCCGATGCAGAGCAAGG 

AGGCTGCGTCATGGAGGCAGCCACAGGGCACCCTTTCCCGCGCGTCTTCAGCG 

C CTGC AGCCGCCGC C AGCTGCGC ACCTTCTTCCGC AAAGGGGGCGGTCCTTGC 

CTCTCCAACACCTCGGCGCCGGGGCTCCTGGTGCTGCCCAGCCGCTGCGGAAA 

CGGCTTCTTGGAAGCAGGAGAAGAGTGCGACTGCGGTTCTGGCCAGAAGTGC 

C CGGACCCC TGCTGC TTTGCC C AC AATTGCTCCCTGCGTGCGGGGGC TC AATG 

TGCCCACGGTGATTGCTGTGCGAAGTGCCTGTTAAAGTCCGCGGGCACGCCTT 

GTCGTCCTGCTGCGACTGACTGCGATCTCCCCGAGTTCTGCACCGGCACCTCC 

CCGTATTGCCCCGCAGATGTTTACCTACTGGATGGCTCACCCTGCGCTGAGGG 

TCGCGGCTATTGCCTAGACGGCTGGTGTCCCACGCTGGAGCAGCAGTGCCAGC 

AGCTATGGGGGCCTGGGTCCAAGCCGGCCCCAGAGCCATGTTTCCAGCAGAT 

GAACTCCATGGGGAATTCGCAAGGGAACTGTGGCCAGGACCACAAGGGTAGC 

TTCCTGCCTTGTGCTCAGAGGGACGCTCTGTGTGGGAAACTGCTGTGCCAGGG 

AGGGG AGCC GAAC CC ACT AGTGCCGC AC AT AGTG AC T ATGG ACTCC AC AATTC 

TCCTAGAGGGCCGCGAAGTGGTTTGCCGAGGGGCCTTTGTGCTCCCAGATAGT 

CACCTGGACCAGCTTGACTTGGGTCTGGTAGAGCCAGGCACCGGCTGTGGACC 

TAGAATGGTGTGCCAGGACAGGCACTGTCAGAATGCTACCTCCCAGGAGCTGG 

AACGTTGCTTGACTGCCTGCCATAACGGTGGGGTTTGCAATAGCAATCGTAACT 

GTCACTGTGCTGCTGGCTGGGCTCCACCCTTCTGTGACAAGCCTGGCTTGGGT 

GGTAGCGTGGATAGTGGCCCTGCACAGTCTGCAAACCGAGATGCCTTCCCCTT 

GGCCATGCTCCTCAGCTTCCTGCTGCCTCTGCTCCCTGGGGCTGGCCTAGCCT 

GGTGCTACTACCAGCTCCCAACATTCTGTCATCGAAGGGGACTGTGCTGCAGG 

AGGGACCCCCTATGGAATAGAGACATACCCCTGGGCAGTGTGCATCCGGTGGA 

GTTTGGCTCCATCATCACTGGAGAGCCCTCGCCCCCTCCCCCATGGACCTCTTG 

CCAACAGCGTTCGCACCCTCCATCTCTTGACTTGCTCTCAGACCCTGCGAACTC 

TG AGCTT ACCT AAG AACT ACCC TCTGAAGC AGCCTGGTC T AC AGATTGAGTTCC 

AGACCTGCCCTATCCCTATGGTATGGAAGCACCCTGAGGACCTCCTGTTGCCCA 

GTCACCTACCTCTGTCTCAGTTTGTTGTCCCCTCCTCAGATTTACAGGCTTGCAT 

C AATAAAG AAATGAG AC ATGGGCCTC AG AGAAS C TGTTGTC ATAGAGACC ATG 

ATGCTGGAACCCCTAGGGGCAGGGAAGGGAGACACTGTGGTTCTTCTTGGGTC 

CTTATAGAGGGAGGACAAATGTGCCCTGCCATGTGACTTGCAGTCCTCAGTTTC 

TCAGACGCACTCTTATAATTCCTATGGGCTGTATGCTGAGCTCTTACTCAGCATA 

GGAACCCCAGAGCCCGATCATGTTGTATCCCSCCTGCCCTGAGAGCTGTGCTAT 

TCTGAAATGTTAGAATGTATCTAATAACAATAAATCCACAAGTTATATCAGHAAA 

AAAAAAAA 
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QILRTLDIQLVLTGLEVWTEQDLSRITQDANETLWAFLQWRRGVWARRPHDSTQIiliTGRTFQGTTVGLAPVEGICRAESS 

ggvstdhselpigtaatmaheighsl'glhhdpegccvqadaeqggcvmeaatghpfprvfsacsrrqlrtffrkgggpcl 
sntsapgllvlpsrcgngfleageecdcgsgqkgpdpccfahncslragaqcahgdccakcllksagtpcrpaatdcdlp 
efctgtspycpadvylldgspcaegrgycldgwcptleqqcqqlwgpgskpapepcfqqmnsmgnsqgncgqdhkgsflp 
caqrdalcgkllcqggepnplvphivtmdstillegrewcrgafvlpdshldqldlgbvepgtgcgprmvcqdrhcqna 

TSQELERCI/TACHNGGVCNSNRNCHCAAGWAPPFCDKPGLGGSVDSGPAQSAl^ 

QLPTFCHRRGLCCRRDPLWNRDIPLGSVHPVEFGSIITGEPSPPPPWTSCQQRSHPPSLDLLSDPANSELT 
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mGene216 582 VPHIVTMDSTILLEGREWCRGAFVLPDSHLDQLDLGLVEPGTGCGPR>IV 631 

ii. i -ill* hi-n \\\\ ii . ii i Miiiiii mill 
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• • » • . 

mGene216 632 CQDHHCQNATSQELERCLTACHNGGVCNSNRNCHCAAGWAPPFCDKPGLG 681 

II I I- Mhlllllll- llllll Mill I Mill, II | 
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Mill -11 III MM I Ihl IIM 

hGene216 7 50 WGCRRDPACSGPKDGPHRDHPLGGVHPMELGPTATGQPWPLDPENSHEPS 799 

mGene216 774 SHP . . PSLDLLSDPANSELT 791 

III I -II 

hGene216 800 SHPEKPLPAVSPDPQADQVQMPRSCLW 82 6 
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10 30 50 
»•••«. 
CGGGCACGGGTCGGCCGCAATCCAGCCTGGGCGGAGCCGGAGTTGCGAGCCGCTGCCTAG 
+ + + + + + 



70 90 110 

AGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTGC 
+ + + + + + 

MetGlyTrpArgProArgArgAlaArgGlyThrProLeuL 



130 150 170 

TGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGAC 

• + + + h + + 

euLeuLeuLeuLeuI.euLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGlyH 

190 210 230 

ATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACCG 

+ H + + (■ + 

isIleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpArgThrV 

250 270 290 

TCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAAG 

+ + + + + + 

alSerLeuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGliiAlaGluG 

310 330 350 

• 

GCCAGGAGCTCCTGCTTGAGCTGGAGAAGAACCACAGGCTGCTGGCCCCAGGATACATAG 

+ + 1 H 1- f- 

lyGlnGluLeuLeuLeuGluLeuGluLysAsnHisArgLeuLeuAlaProGlyTyrlleG 

370 390 410 

AAACCCACTACGGCCCAGATGGGCAGCCAGTGGTGCTGGCCCCCAACCACACGGTGAGAT 

+ + + + + + 

luThrHisTy_rGlyProAspGlyGlnProValValLeuAlaProAsnHisThrValArgC 

430 450 * 470 

GCTTCCATGGGCTCTGGGATGCACCGCCAGAGGATCATTGCCACTACCAAGGGCGAGTAA 

+ + + + + + 

ysPheHisGlyLeuTrpAspAlaProProGluAspHisCysHisTyrGlnGlyArgValA 
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490 510 530 

GGGGCTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCCTGATCA 

+ + + + + — • + 

rgGlyPheProAspSerTrpValValLeuCysThrCysSerGlyMetSerGlyLeuIleT 

550 570 590 

CCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCAAGGACT 
+ + + . — + + + 

hrLeuSerArgAsnAlaSerTyrTyrLeuArgProTrpProProArgGlySerLysAspP 

610 630 650 

TCTCAACCCACGAGATCTTTCGGATGGAGCAGCTGCTCACCTGGAAAGGAACCTGTGGCC 
— . . — + + + » + + + 

heSerThrHisGluIlePheArgMetGluGlnLeuLeuThrTrpLysGlyThrCysGlyH 

670 690 710 

ACAGGGATCCTGGGAACAAAGCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGAGCAGGG 
+ + + + + + 

IsArgAspProGlyAsnLysAlaGlyMetThrSerLeuProGlyGlyProGlnSerArgG 

730 750 770 

GCAGGCGAGAAGCGCGCAGGACCCGGAAGTACCTGGAACTGTACATTGTGGCAGACCACA 
+ — . + + + + 

lyArgArgGluAlaArgArgThrArgLysTyrLeuGluLeuTyrlleValAlaAspHisT 

790 810 830 

CCCTGTTCTTGACTCGGCACCGAAACTTGAACCACACCAAACAGCGTCTCCTGGAAGTCG 

+ + + + + . + 

hrLeuPheLeuThrArgHisArgAsnLeuAsnHisThrLysGlnArgLeuLeuGluValA 

850 870 890 

CCAACTACGTGGACCAGCTTCTCAGGACTCTGGACATTCAGGTGGCGCTGACCGGCCTGG 

+ + + + + 

laAsnTyrValAspGlnLeuLeuArgThrLeuAspIleGlnValAlaLeuThrGlyLeuG 

910 930 950 

AGGTGTGGACCGAGCGGGACCGCAGCCGCGTCACGCAGGACGCCAACGCCACGCTCTGGG 

+ + + + + + 

luVa 1 TrpThrG luArgAspArgS e r Ar gVa 1 ThrG InAsp Al a AsnAl aThrL euTrpA 

970 990 1010 
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CCTTCCTGCAGTGGCGCCGGGGGCTGTGGGCGCAGCGGCCCCACGACTCCGCGCAGCTGC 

+ + + + + + 

laPheLeuGlnTrpArgArgGlyLeuTrpAlaGlnArgProHisAspSerAlaGlnLeuL 

1030 1050 1070 

TCACGGGCCGCGCCTTCCAGGGCGCCACAGTGGGCCTGGCGCCCGTCGAGGGCATGTGCC 

+ + + — + + + 

euThrGlyArgAlaPheGlnGlyAlaThrValGlyLeuAlaProValGluGlyMetCysA 

1090 1110 1130 

GCGCCGAGAGCTCGGGAGGCGTGAGCACGGACCACTCGGAGCTCCCCATCGGCGCCGCAG 

+ + — + + + ■ + 

rgAlaGluSerSerGlyGlyValSerThrAspHisSerGluLeuProIleGlyAlaAlaA 

1150 1170 1190 

CCACCATGGCCCATGAGATCGGCCACAGCCTCGGCCTCAGCCACGACCCCGACGGCTGCT 
+ + + + + + 

1 aThrMe tAlaHisGluIleGlyHis SerLeuG ly LeuS er Hi s Asp P r o AspG ly Cy s C 

1210 1230 1250 

GCGTGGAGGCTGCGGCCGAGTCCGGAGGCTGCGTCATGGCTGCGGCCACCGGGCACCCGT 

+ + + + -+ + 

ysValGluAlaAlaAlaGluSerGlyGlyCysValMetAlaAlaAlaThrGlyHisProP 

1270 1290 1310 

TTCCGCGCGTGTTCAGCGCCTGCAGCCGCCGCCAGCTGCGCGCCTTCTTCCGCAAGGGGG 

+ . + + + + + 

heProArgValPheSerAlaCysSerArgArgGlnLe\iArgAlaPbePheArgLysGlyG 

1330 1350 1370 

GCGGC GCTTGC CTC TCC AATGCCCCGG ACCCCGG ACTCC CGGTGCCGCCGGCGCTCTGC G 

i. + + , — w+ + + 

lyGlyAlaCysLeuSerAsnAlaProAspProGlyLeuProValProProAlaLeuCysG 

1390 1410 1430 

GGAACGGCTTCGTGGAAGCGGGCGAGGAGTGTGACTGCGGCCCTGGCCAGGAGTGCCGCG 

f. + ■- + h + + 

lyAsnGlyPheValGluAlaGlyGluGluCysAspCysGlyProGlyGlnGluCysArgA 

1450 1470 1490 



FIG. 24.3 



WO 02/083077 



86/157 



PCT/US02/12063 



ACCTCTGCTGCTTTGCTCACAACTGCTCGCTGCGCCCGGGGGCCCAGTGCGCCCACGGGG 
+ + + + + + 

spLeuCysCysPheAlaHisAsnCysSerLeuArgProGlyAlaGlnCysAlaHisGlyA 

1510 1530 1550 

ACTGCTGCGTGCGCTGCCTGCTGAAGCCGGCTGGAGCGCTGTGCCGCCAGGCCATGGGTG 
+ + + + + + 

spCy sCysVa 1 ArgCy sLeuLeuLys Pro Al aG lyAl aLeuCy s ArgGlnAlaMe tGlyA 

1570 1590 1610 

ACTGTGACCTCCCTGAGTTTTGCACGGGCACCTCCTCCCACTGTCCCCCAGACGTTTACC 
+ + + + + + 

spCysAspLeuProGluPheCysThrGlyThrSerSerHisCysProProAspValTyrL 

1630 1650 1670 

TACTGGACGGCTCACCCTGTGCCAGGGGCAGTGGCTACTGCTGGGATGGCGCATGTCCCA 
+ + + + + + 

euLeuAspGlySerProCysAlaArgGlySerGlyTyrCysTrpAspGlyAlaCysProT 

1690 1710 1730 

CGCTGGAGCAGCAGTGCCAGCAGCTCTGGGGGCCTGGCTCCCACCCAGCTCCCGAGGCCT 
+ + + + + + 

hrLeuGluGlnGlnCysGlnGlnLeuTrpGlyProGlySerHisProAlaProGluAlaC 

1750 1770 1790 

GTTTCCAGGTGGTGAACTCTGCGGGAGATGCTCATGGAAACTGCGGCCAGGACAGCGAGG 



ysPheGlnValValAsnSerAlaGlyAspAlaHisGlyAsnCysGlyGlnAspSerGluG 



1810 1830 1850 




lyHisPheLeuProCysAlaGlyArgAspAlaLeuCysGlyLysLeuGlnCysGlnGlyG 



1870 1890 1910 

GAAAGCCCAGCCTGCTCGCACCGCACATGGTGCCAGTGGACTCTACCGTTCACCTAGATG 

+ + + + + + 

lyLysProSerLeuLeuAlaProHisMetValProValAspSerThrValHisLeuAspG 



1930 1950 1970 

GCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCTGGACCTGCTTG 
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lyGlnGluValThrCysArgGlyAlaLeuAlaLeuProSerAlaGlnLeiiAspLeuLeuG 

1990 2010 2030 

GCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTGCCAGAGCAGGC 
+ + + + — . + + 

lyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCysGlnSerArgA 

2050 2070 2090 

GCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTGCCACAGCCACG 
+ + + + + + 

rgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCysHisSerHisG 

2110 2130 2150 

GGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCCACCCTTCTGTG 

-+ + ~ 1 _ 1- 1- 

lyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaProProPheCysA 

2170 2190 2210 

ACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGAAAACCATGACA 

+ + + + + _. + 

spLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGluAsnHisAspT 

2230 2250 2270 

+ + + + + 

hrPheLeuLeuAlaMefcLeuLeuSer Val LeuLeuProLeuLeuProGlyAlaGlyLeuA 

2290 2310 2330 

CCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTGGGGCTGCAGAA 

+ + + + + + 

laTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTrpGlyCysArgA 

2350 2370 2390 

GGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCCCCTGGGCGGCG 
+ + + + + + 

rgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisProLeuGlyGlyV 

2410 2430 2450 

TTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGACCCTGAGA 
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alHisProMetGluLeuGlyProThrAlaThrGlyuxnt^roTrpProIieuAspProGluA 

2470 2490 2510 

ACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTGACCCCC 
h h [. + + + 

snSerHisGluProSerSerHisProGluLysProLeuProAlaValSerProAspProG 

2530 2550 2570 

AAGCAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCCTAAAATGA 
+ + + + + + 

InAl a AspGlnVa 1G InMe t Pr o ArgS er Cy sL e uTrp End 

2590 2610 2630 

ACAGATTTAAAGACAGGTGGCCACTGACAGCCACTCCAGGAACTTGAACTGCAGGGGCAG 
+ + + + + + 

2650 2670 2690 

AGCCAGTGAATCACCGGACCTCCAGCACCTGCAGGCAGCTTGGAAGTTTCTTCCCCGAGT 
+ + + + + ■ + 

2710 2730 2750 

GGAGCTTCGACCCACCCACTCCAGGAACCCAGAGCCACATTAGAAGTTCCTGAGGGCTGG 
+ + + + + + 

2770 2790 2810 

AGAACACTGCTGGGCACACTCTCCAGCTCAATAAACCATCAGTCCCAGAAGCAAAGGTCA 
, + + + + + + 

2830 2850 2870 

CACAGCCCCTGACCTCCCTCACCAGTGGAGGCTGGGTAGTGCTGGCCATCCCAAAAGGGC 
+ + — + + + ■ + 

2890 2910 2930 

TCTGTCCTGGGAGTCTGGTGTGTCTCCTACATGCAATTTCCACGGACCCAGCTCTGTGGA 
+ + + + + + 

2950 2970 2990 

GGGCATGACTGCTGGCCAGAAGCTAGTGGTCCTGGGGCCCTATGGTTCGACTGAGTCCAC 
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3010 3030 3050 

ACTCCCCTGCAGCCTGGCTGGCCTCTGCAAACAAACATAATTTTGGGGACCTTCCTTCCT 
+ + . + + + + 

3070 3090 3110 

GTTTCTTCCCACCCTGTCTTCTCCCCTAGGTGGTTCCTGAGCCCCCACCCCCAATCCCAG 

+ + + H + + 

3130 3150 3170 

TGCTACACCTGAGGTTCTGGAGCTCAGAATCTGACAGCCTCTCCCCCATTCTGTGTGTGT 

3190 3210 ' 3230 

C GGGGGGAC AGAGGGAACC ATTT AAGAAAAG ATAC C AAAGTAG AAGTCAAAAGAAAGACA 
+. + + + + + 

3250 3270 3290 

TGTTGGCTATAGGCGTGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAAGCCGGGGTA 
+ . + + + + + 

3310 3330 3350 

GGAGGATCACCAGAGGCCAGCAGGTCCACACCAGCCTGGGCAACACAGCAAGACACCGCA 
+_. + + + + + 

3370 3390 3410 

TCTACAGAAAAATTTTAAAATTAGCTGGGCGTGGTGGTGTGTACCTGTAGGCCTAGCTGC 
(. + h h + + 

3430 3450 3470 

TCAGGAGGCTGAAGCAGGAGGATCACTTGAGCCTGAGTTCAACACTGCAGTGAGCTATGG 
+ + h + + + 

3490 3510 3530 

TGGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAAATTTT 
+ + + + + + 

3550 3570 3590 
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10 30 50 

CGCAACTGGAGTGAGTCTCTTTCTTGCAACCCAAACAGCTGGGCTAGAACAGAGGAGTCA 
+ + + + + + 

70 90 110 

CATGTTAGTGGGGTTTATTCTTAGACACCATCTGTAGGAGGGGGCCAGGGGCTGCAAGTT 
+ + + + + + 

130 150 170 

ACATGACATTTATCACAAATTACCTTACGCTGAAGTGATTTGTGTCCCTTTCTGCTCCTT 

+ + _ _ + , + + + 

190 210 230 

G AGG C TTCGTGTGTTG CTC ACATTTGC CACC CAC AATG AC C AGCCCCGTG C ATGGC ACGC 

+ ■ 4- ■ 4- 1 + 4- + 

250 270 290 

AGTAGGTGGGCACGTGAACTAAGCGTTTGTCTTGTCTACATTGCTGACAGTGGGGAGGGT 
+ . + . + + + + 

310 330 350 

GGAGAACATGGACTAAATCCAGAGGTGGGACAGGGAGAGAGCAAGTCAAATGCAGCCAGA 

+ , + , + + + + 

370 390 410 

TGTTTGGGGGCTGTCTGGGGAAGAGAGGGTGTAGGGAGGACCATAAGCAGCTGCTGAAGC 
+ + + + + + 

430 450 470 

TGCGGCTACTGGGGGGATTTGGGAATATCCGGCTTAGGAGTCTTCACCTGCTTCTGCATA 
+ + + __ + -4- ■ + 

490 510 530 

CTTCCAAGGGGACATATGTCCATCTTGGATTGGAGGCTCTCAGAGGGGTTAGTTGTGTCC 
+ + + + + + 

550 570 590 

GGTTCCAGTGGTCTGAGGCCTCCTGAGACTTTTTGTACATTCAATTAAAGTGTAACATAC 
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610 630 650 

AGAAAAGTTCACACATTATAAATGC^CTCAAGGACTTTTCCAAAAGCGAACACACCCAG 



670 690 710 

ATCAAGAAATAGACCATCCTACAGTCCCCCCTTACACTCTGTAGCAGTTGCAGCCCCCAC 
+ + + + + + 



730 750 770 

AAGGGTAACTACTGTCTTGACTTCGAACACCATAGATTTGATTTGCCTGTTTTTAAACTT 



790 810 830 

TACATAAGTAGAATCACAGAGTGTGTACAATGACTTTGGAAAACTGTTTGACAATATCTA 



850 870 890 

TTAAAGCTAAAATACCCTTGCCCTATGAACCTGAAATTCCACCCACCTTGCCAAGGGACA 



910 930 950 

AAAAGTTCCCCTCTAAATGCACCAGGCTGTCAGGGATGAAGCGTTGGCTTTGGGGCCCCC 
+ , + + + + + 



970 990. 1010 

ATTCACACACATGACCTTTTCTGGGGCACCCAAGCATCAGCCTGTGGTCACCAGGTGCCA 



1030 1050 1070 

CCCTGGCGATCTCTGAAGGCTGGAGTCGGAGTGCCTCCCTCAGACATCCTGTTCTGCGTC 
+ + + + + + 



1090 1110 1130 

ACTCCTTGGGAGAAGTCGTGTTTACAGATGGTGGGTGTCACCCATGCCAAGCACTTCTAA 



1150 1170 1190 

GGGTTAATGCTCACTGGTTTGCCTGGTTCCCAGGACATTTCCTGATGCCCCTCTGGAGGG 
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1210 1230 1250 

TGACGCCAACAAGCCAGTGGAGAAGCCATCTTTCCCAGGTGCTGTCAGGCGCCCCCGGAG 
+ + + + + + 

1270 1290 1310 

CTGCTCGGTGCATCCTAGGATCCCTCTTCCTCAGCTTTGGTTTGATGGCCTCATCTCCTC 
+ + + + + + 

1330 13S0 1370 

CCCTGCAACCTCAAAATGTAAATAAACCCTTTCTCAGAGACTTCGGCAGAAAATTCCTCT 
+ + + + + + 

1390 1410 1430 

GACCTGCACTTGGACACAGCTCATCTGGGTTTGGGAGGTGTCAACTGTGTAAGGATGACT 



1450 1470 1490 

/ . 

CTGATCCCCATGTGGCTTTTCGACTGTGTCCCCTCTACAGTCAGTTATTAGCAGTGACTG 



1510 1530 1550 

TGCTAGGAAGTGAGCAACACACATATTCCCAGACCACATGGAGCTCAGGAGCTTGGGGAG 



1570 1590 1610 

AGAGACAGGGAAGTGGACGACTACAGGGCCTTCTGAAACGTGTTGCAGGGAGAAGTGTCA 



1630 1650 1670 

GTCAGGGGATG CTAACCTGGCTTTGGGTAAGGGAC AGC CTCTGAATG ACAGGACATTAAA 



1690 1710 1730 

GCCATGGCCTGCAGTTTAAGTAGGAGTTGGCCAGTTCGAGGTAAGAATACCAGTAAGCAA 



1750 1770 1790 
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G AACGCC AG AGTAGCTC CTCG AGC TGCCTT C TGTAC i_ ± jTCCACACTGAAGCCAGCC 
+ + + ■ + + + 



1810 1830 1850 

CCTCTGTGTTCAGCCTTGCTTTACTGAAGAGGTGTCGCTGAGGGGCTGCTCTGGGGTGCT 
+ + + + + + 



1870 1890 1910 

GCTCTGCTTTCCTGTCCCCAACTTGTTCTGAGCTCGAGCCACCTCCATACTGGTGCTCCT 
+ + + + + + 



1930 1950 1970 

GGTTCTCAGGCCTTTGAACTCAAACTGAATCACACCACTGGCTTTCCTCGTTCTCCAGCT 
+ + 4. + + + 



1990 2010 2D30 

TGCAGATGGCAGATTCGGGAACTTTTTGGCCTCCATAATCACGTGAGCCAATTGCTATAA 
+ + + + + + 



2050 "2070 2090 

TAAATATCTCTCTCCCTCTTTCTTCCTCTCTCTCTCTCTGTGCAAATATAGTTCCAATTA 



2110 2130 2150 

TAAGAGCCCCTAACTGGAAAATAACCCTATGGTGCACTGGTGAGTAGAGAAACTGTGGTT 
+ + + 4. 4. 4. 



2170 2190 2210 

CCCTCAAACCACCGAACACTATTCAGCAATACGAAGGAACAAACTATTGATATGCAAAAT 



2230 2250 2270 

AGTGTAAATGAATCTCAAAAACATCGGAAAGAGGGAAGGAAGCCAGACACAGAAGAGTGC 
+ + + + - 4- + 



2290 2310 2330 

ATGCCGCATGATTCCATTTATATGAAATTCTAGAACAGGCAAAACTTATCTATAGACAGA 
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2350 2370 2390 

GAACAACAGATCAGTGGCTGTCTGGGGTTGGGAGTGGGGAAGTTTGGCTGGAAGGGCACA 
+ . + + + + + 

2410 2430 2450 

GGGCTCTTTCTGTGAGTGAGGGAATGTGTCTGCATTATAGTGATGCTTATGTAGTTATAT 



2470 2490 2510 

ACACTTATCGAAACTCATCTTACTGGCCACTTAAAATAAGTGCATTTTATTGTGTGTAAA 



2530 2550 2570 

TTAT AC CTT AATG AAGTTGATTTGAAAATC C AAAGTAGT AAT AATAAGTAAT AATCTCGT 



2590 2610 2630 

AGCTGGACAGCTGTGGTGACTCACTCCTGTAATTCCAGCGATTTGAGAAGCTGAGGCAGG 



2650 2670 2690 

AGGATCACTTAAGATCAGGAGTTCTTTTTATTTTTATTTTTATTTTTTGAGACGGAGTTT 



2710 2730 2750 

CGCTCTTGTTGCCCAGGCTGGAGTGCAATGGCATGATCTCGGCTCGCTGCAACCTCCACC 



2770 2790 2810 

TTCTGAGTTCAAGCGATTTTCCTGCCTCAGCCTCCCAAGTAGCTGGAACTACAGGCGCTC 



2830 2 85 0 2 87 0 

ACCACCATGCCCGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCAGCATGTTGGCG 
+ + + + + + 



2890 2910 2930 

AGACTGGTCTTGAACTCCTGACCTCCAGTGATCTGCCCGCCTCGGCCTCCCAAAGTGCTG 
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2950 2970 2990 

GGATTACAGGCATGAGACACTGCGCCTGGCCAAGACCAGGAGTTTGAGACCAGCCTGGGA 
+ + + + + + 

3010 3030 3050 

AAC AAAGTG AG ACC C C CTGTCTAC AGAAAAATTAAAAATTTTAG CTGGG C C TGGTG CCGT 
+ + + + + + 

3CT70 3090 3110 

GTGCCTGTAGTTCCAGCTACTCAGGAGGCTGAGGTGGGAGGATACCTTGAGCCCAGGATT 
__, + + + + + + 

3130 3150 3170 

TCAAGGCTGCAATGAGGCATGATCAGGCCACTGTCCTCTAGCGTGGGTGACAGAGTGAGA 
+ + + + + + 

3190 3210 3230 

C C CTGTCTCTAAATAATAATC AT AAGAACAAC AAGGACCCTCTAAACGCACTGATATCTA - 
+ + + + + 

3250 3270 3290 

AGGTGT ATT AAG CG ACC AAAAAAAAAAAGAAAATC AAAGTG C AG AAAAACGTTAATAAGA 
+ + + + + + 

3310 3330 3350 

GAAAAAAATATGTCTGTATTGTCTTGAGTGTGAAAAAATAATCTAAAAGCCTATGAAAGA 
+ + + + + + 

3370 3390 3410 

AACTAATCATATTGGTTTCCTGTTGGTGAGGAGGGCTAAGAGCACGGAGACTTTTCCCTA 
+ + + +- + . + 

3430 3450 3470 

TGCTTTCTGTACTTTTTGATTTTGAGATATGTGAATGTAGGTTTCTCTCACTGCTCGAAC 
+ + + + + + 

3490 3510 3530 
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TTTCACTAACCAAATTACTACATTCCAAATTCTCAAAAACAAATAGATTTACTTAAAAGT 



3550 3570 3590 

AGGCTGGGTGCGGTGTCTCACGCCTGTAATTCCAGCGCTTTGGGAGGCCGAGGCGGGCAG 
+ + + + + + 



3610 3630 3650 

ATCACCTGAGGTCGGGAGTTCGAGACCAGCCTGACCAACATGGAGAAACCCCATCTCTAC 



3670 3690 3710 

TAAAAATACAAAATTAGCCAGGCGTGGTGGCGAATGCCTGTAATGCCAGCTACTCGGGAG 



3730 3750 3770 

G CTGAGGCAGAAGAATCACTTGAATCTGGGAGGCAGAGGTTGCAGTGAGC CCAGATCATG 



3790 3810 3830 

CCATTGCACTCCAGTCTGGGTAACAAGAGAGAAACTCTGTCTCAAAAAAAAAAAAAAAAA 
+ + 4- + + + 



3850 3870 3890 

AAAAGATTTGCTTAAAAGTTAACATCTCCGGCCGGGCGCGGTGGCTCATGCCTGTAATCC 



3910 3930 3950 

CAGCGCTTTGAGAGGCCGAGGCGGGTGGATCACGAGATCAGGAGATTGAGACCATCCTGG 



3970 3990 4010 

CCAAAATGGTGAAACCTCGTCTCTGCTAAAAATACAAAAGTTAGCTGGGGGTGGTAGCGC 
+ + + + + + 



4030 4050 4070 

GCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCAGGGAGT 
^ + + + + + + 
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4090 4110 4130 

CGGAGGTTGCAGTGAGCCAAGATCGCGCCGCTGCACTCCAGCCTGGCGACAGAGGGAGAC 



4150 4170 4190 

TCCATCTCAAAAAAAAAAAAAAAAAAAAAGTTAACATCTCATCCAAATTTGCACCGAGTA 
+ + . + + + + 



4210 4230 4250 

GGAAAACAAAAGTTTAAAACATGAAACAGATGTTACTGAGGCCGAAAGGGTCTCCCAGGC ■ 



4270 4290 4310 

CTGGGAGTCTGCAGCTTTTATGCAATTCTGCCCTCTGGCCACCGCCAGGGAAGAAAGGTT 



4330 4350 4370 

GTCTCCGTCTGCTGCATCGCCTTTGCCCAGGAATGAAGCCCCCAAGAGAGCGGCAGCCGG 



4390 4410 4430 

TTGCCTGAACCTTCCTATCCTTGGGGGCACCCAGTGCAGGTGGATGACCCGACTCAACCT 
+ + + + + + 



4450 4470 4490 

CCGCCAGGGCACCCTCGGGGCAGGACGGGTAGCAAGGAGGGGACAGAGATCGGCCCCAGG 



4510 4530 4550 

AGACCACGGAAGATCGCGCTCCTGGGGCCAACTTCAGCAGCGAGAGGCGGCCTTTGCCCA 
+ + + + + + 



4570 4590 4610 

CCGCCTCATCCCACCACGCCGCGGTCCTCCAAGAACCTTCCCAGCGGTTCTCTCCTCCTC 
+ + + + + + 



4630 4650 4670 

TCAGGAGTAGAGGCCCTCTGAGACCGACGGGGAGGGACGGCTCGGGCCGGTCATCCGAGG 
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4690 4710 4730 

GGCCGCACGGATTCCCTCCTCCGCCCAGCTCCACCCCCTCGAGGGGCGGCGGTCCGGGAG 
+ + + + . + + 



4750 4770 4790 

TGGCGACCCGGCTCCCCCATGGCGGGCGCCGTCGGGGCCCCTGGCCAGGCTCCGAGCGGG 



4810 4830 4850 

GTTGGCGGGGAGGGGAGGCGGGAGCGAGGGCGGGCGGTGGGAGGTGGGGGCGGGAAGGTC 



4870 4890 4910 

CGAAGGCGGCGGCCTGAGGCTGCACCGGGCACGGGTCGGCCGCAATCCAGCCTGGGCGGA 



4930 4950 4970 

GCCGGAGTTGCGAGCCGCTGCCTAGAGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCC 



4990 5010 5030 

CGGAGAGCTCGGGGGACCCCGTTGCTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTG 



CCAGGCGCCGGGGTGCTTCAAGGTGAGGACGCGGGCGGGGTGCGCCCTGAGGGGCAGGCT 



5110 5130 5150 

AGGCGCGGTGGTGGTGGCGGGGATGGGTTCTGCTCAGAGCTCGGGTCAGCGCGCGGAGGG 



5170 5190 5210 

TCTCACGGCCCCGGCACCATACGGCCAGTAGGTCAGGGCGTGGGGACTCTTTGGGGGGGT 



5230 5250 5270 
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CTCCGTGGGACCTGCCCAGGGACGCTCAAGTGTGCTTGGGCTGGCCCCGGGCCCGGACTT 



5290 5310 5330 

GCCCACACTGCCCGGCTGCCACTCCGCTGGCAAAGCAGAGGGCATGGCTCCCTCCCCCTC 

+ + + + ■ + + 



5350 5370 5390 

GGGGACAGCCCAGCCCCCAGCCCCCAGCCCATAGCCGTAGCCCCCTCTGCCTGGATTCTC 



5410 5430 5450 

GCTCTCACAACCAGCTTCCATCCGCAGGCCACCGTGTGACCCGCTCCTGCTCCTCCACCC 
+ + + + + + 



5470 5490 5510 

CTTAGGACTCAGCGGGGCTCCATCCTCTAGGAAGCCCCCATGCCCAAGAGTCCCCCAGAG 



5530 5550 5570 

TCCCTGCTTTGCTCTCAGGCTGCAGAACTAGCTGTGGCCTCCACCCTGCTCACCCCTCGT 



5590 5610 5630 

CCCTCCTCCCAGGGCAGCAGGGCAGTGTGTATGTTGTTTATATTGTTGCCTTGTTTGGTG 
+ + + + + + 



5650 5670 5690 

AGATAGAGAAGGGCCTCTCCAGATAGAAGGTGTCTGTTTAGCAGTGCTCTGGAAAGACTG 
+ + + + + + 

5710 5730 5750 

CAGCTGTCTCCTCGGGGTAACCCCTCCAAACAAAGATGTTAAGATGGGGCTGGAACAACC 
+ + + + + + 

5770 5790 5810 

TCTGCAAGCGGGTGGGAGGATTAGCCAGTCCTGCACAGCAAGTGCCTGGCCGGGAACAGG 
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5830 5850 5870 

GAGGGCAACCAGGGAGGGGGCATGCGGGGCTGGGCTGTGCTATGCAGACTGGGCGGTGGC 
+ + + + + + 



5890 5910 5930 

TTCCACAGCACTGTGTGGGGACCAAACAGGTACAGGGGCCTGGTCTGTTCTGGCCCCAGG 



5950 5970 5990 

GGAGGGCCCCAGGCGGTCCACTGCTCCCTCCCCTCTGAGCCCTATCCTGGGGTCAGGGGA 
+ + + + + + 



6010 6030 6050 

GGTGATGGGACCCCTGGGAGAGGGGCGTCTATGTGCCCAATACCAGCCTGGCTCCCTCGG 
+ + + . + + + 



6070 60.90 6110 

GTTCCACCCCCATTCACCCGGTCACCGGAGCTCCAGCTCCAGCTCCAGCTCTGCCCCTCT 



6130 6150 6170 

CTCCCTCATTGGGGTCAGGGTGCCCGTGGCCAGCACGTGCGCGCAAGGCCATGTGGACAG 
+ + + + + + 



6190 6210 6230 

CACCCACACACCACACTGCACCCACACCACACCTGTGCCCGGGCCCACCCTACCTCTTCC 



6250 6270 6290 

CCAAACCCTTAGAGGCCTAGGAGCAGCAAAGCTTGGTTCTCTACTCTGAGTTAAGTGCTC 
+ + + + + + 



6310 6330 6350 

TCTGGGCTGAGAGACCTCCCCTCCTTCCCCTCCCCGACATCCACTCAGAGCCCTCCCTGC 
_ + + + + + + 



6370 6390 6410 

ACTGGCCCCTCTAGCCTCCTTTCCAAGGTGGCAGACTCCTCTCGGCCGTGATCTGCCTGA 
h + + + + + 
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6430 6450 6470 

TGGCAATTCACTCATCCAATCAAGGAGGGCTTCTTGGAGGAAGGGTCTTTGATGTTTGTA 
+ + + + + + 



6490 6510 6530 

GTCTGGGAGAGAAGGTGGAGGAGAAAAAAGGAGTTGGGGTGGCCTAGCAGGAGCTGAGTC 
+ + + + + + 



6550 6570 6590 

ACTTCCACAGGCAGCCATCAGCCCAGCAGGACTGAGGCCAGGGCTGCGTGGAGGGGGGAG 
+ - + + + + - - + 



6610 6630 6650 

GCTGTCTGTTCTGGGAGCTGGGACTGGGTACCGGGGGAAGGAGGGCTGCTGCAGGCTCTG 



6670 6690 6710 

GGTGCCTGGGGCCTGGCTCCTGCAGGGCGGGCCTGTGAGAGTGGTTGGGGCCAGTGGAGG 



6730 6750 6770 

GGCTGGGAGCATTCCAGGGGAACATTCCAGGCGCCCTCTGAGTAATGCTTGGCTCTGGGA 



6790 6810 6830 

TTCCTCCTAGAGCCCCCTTAGGCACACCCGGCCAGGGAGCACCAAGGCTCCGTCCGGAAG 
+ + + + + + 



6850 6870 6890 

CGTCCCCTCCCCTTGAAGAGATGAGGAGGGGCCTTCTGGGCCAGGGTACCAAAACCCTGC 



6910 6930 6950 

CACCAGGACAGAGTCCCCGAGGGAGCTCTGGGCAAGGTGGACCTGGCAAGGCAACATCTG 
+ + + + + + 



6970 6990 7010 
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GCTGTTGTTTTTCTCAGATGATGGGGGGGGCACAAGTGTCCTCTCTTCGTACATCTCTCA 



7030 7050 7070 

CCCTAAAGGCATCTGCTGCCCATCTAAAAATCCCTAAGGCTGCCGCGCTCTTTCCTTCCC 



7090 7110 7130 

CTCTGCACTGGCGGCCTTGGCCTCTTCCTTGTGATGGCCGAGCCCAAGCCTGCCCCCCGA 



7150 7170 7190 

CAAAGGTCAGGGGACTCCCGTGTCCCCAGCTGAGCTGTCCCTTTCCAGCCTTCTCTTTTC 



7210 7230 7250 

CTCCTCCTTGATAGCTCCTCAGATCCAAGGATGCCCACGGGCGTCCCTCCTTCTCCAGGC 



7270 7290 7310 

TGAGCCCACGCGTGTTGAAGGTGAAGTCTGCCCCAAAAGGCCTCCAGTGCCTCCCTGGGG 
+ + + + + + 



7330 7350 7370 

ATGTCCTCTACCCCCCTCCCTCTGCTTTGTCCCATGCCCCTGTGTTCCTCAGGTCCCCCT 



7390 7410 7430 

CACCCTGTGCTCTGTCTTTACTCCAGGACATATCCCTGGGCAGCCAGTCACCCCGCACTG 
+ + + + + + 



7450 7470 7490 

GGTCCTGGATGGACAACCCTGGCGCACCGTCAGCCTGGAGGAGCCGGTCAGTGCCATGTC 



7510 7530 7550 
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7570 7590 7610 

CTTCTAGTTTTGCTGGGGTCCCCAGAACCACCAAGTCACTACTCCTATAGGCCCCTGCCT 
+ + + + + + 



7630 7650 7670 

CCCCTGCCCCTCAAGTGGGCAGAAGAAGGCACTGGGGTTTGGACATCTGGATCTCGTGAG 



7690 7710 7730 

CCCGCACACATGGAAGTCATTTCAGCTTTCTCCACCCCACCTCCCTCTTTCCCTCCCTCC 



7750 7770 7790 

C TGG ATGAT C TGGG C C ACCCCC ACCC C C AC C AGGCAGAAATGGGT C CAG AGTTTGTGGGT 
+ + + + + + 



7810 7830 7850 

CCTGAAGCTTTTCAGGAGCCTCTAAAAAAAAAAAAAAAAAAAAAAGCAC CAAAAAGAAAA 
+ + + + + + 



7870 7890 7910 

CCTTTTGCAAAGTTGACCAGAACATGTGACCCTGTGGACACACTGCTGTCCCTCTCAGGG 



7930 7950 7970 

CCCTGCCACGAAGGCCTGAACCTTCAGCCTCACTGGCTCCTGTGGAATCCACTTCTGGTA 



7990 8010 8030 

TGGGGGGGGCAGTGGTCACTCTCCTGATGTCCCCCAGATGTAAGAGCACCCCATGTGCTT 
+ + + + + + 



8050 8070 8090 

CTTCTGCAGGACGCTCTGCCCCAGCCTCTTCCCAATCCCGCTCTTCACACGCTTCCAGAA 
+ + ■ + + + 1- 



8110 8130 8150 

TAACCATGCCCCATCTGTTTGTGCCATAATATCTGTGCTGCAAACTAAGAGGGCAGTAGC 
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8170 8190 8210 

CTTGATATGCTCATTTTACAGAGGGGCAAACGGAAGCCCAGAGAGCTTGGGGAAATTGTC 
+ + + + + + 



8230 8250 8270 

CATGGTCACACAGCTCTTTAGGCTGGGAGCCTGAGACCCACTAAGGTCTGAACGATTTTA 



8290 8310 8330 

AACCATTGGCTACACCCCTGCCCCTCCTAGAGAGCCCTCTTTGTTTGGAATTTTCAGCCC 



8350 8370 8390 

TACTGTCCAAATCCAGCAAGAGGGAAGGCAGGGGAGCATTGCCATGAAGGCTGAGAGGCC 
+ + h -4- + + 



8410 8430 8450 

CCCAGAGACCCAGCAGCTCCCAACCCAGGGCCCTCACTGGGATCCCCTAGGCCCATAAGG 



8470 8490 8510 

CCCCCATTCGACTGGTCAAGCACGGCACTGGCCTGAGCTTTGAGATTGCCCTCCCCATCC 



8530 8550 8570 

CCAGGAGGGGAAGGCTGGACACACACTGGGGTCACTCTGCCTCTGGGCCTCCCTGTCTGT 



8590 8610 8630 

CTGGCCTGGGCTGTGACCAAGAGGAGAGCCCCAAAGGGGCTCTGCTTCCCCCACCGGTGG 



8650 8670 8690 

GCCCCTGCCCCCAGGAAGCCTGCCAAGATGGTACAGAAGAAAGAGTAGAGGCTAGGTATC 



8730 8750 
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CCCTCCAAAAGGCAGGAAACACTCACMTTCAAGATGAGGGGTATATATCAAGGGGCAGG 
+ + + + + + 



8770 8790 8810 

GT AC C AGG AGGG C AAGAGT AAAG AT AGC AGGGGC TGC AG AGG AAC AGG G AC C T CGAGT AT 

+ + + + + + 



8830 8850 8870 

GGCCTTTTTCCCGGTGCAGACCTTTCCCCAATAAAGCAAGTGGCATTCCAGCCTCATGAG 



8890 8910 8930 

CTCATGCTGGAGGCCTTGTGGGGCCTGTGGCCAGGGAGGCAAGGACCATCTGCTCCCCAC 



8950 8970 8990 

TTGCGAAGGAAGAACTCCCTCCAAAGACTCTGAGACCCTTGGACAGGGCCCCAGGCCAGT 
4 + + + + + 



9010 9030 9050 

GCATTTTTGGAGAAAAGGAGTCGGGGGTTAAACATTCCGAAGGCGCAGCAGCCTCCCAGG 
+ + + + + + 



9070 9090 9110 

AAGCTCCTGGGCCGGCTCCAACTCTGGGCCCCCAGCCAGGCTGAGTGGACAAGGGGGAAG 
+ + ^ + + + - + 



9130 9150 9170 

TGGGGTGTTCCCACAGGGTGGGAGACGCCAAGAGGGTGGGGGAAGGAGAGAGGGCTGGCC 
+ + + + + + 



9190 9210 9230 

GTCCAAGCCAGCCTCCTGACACCTAGCTGAGAGCCAGTGTGCTCTCTTGGCTGGAATGGC 
+ + + + + + 



9250 9270 9290 

GTCCATGTTTACTTCGTGGGTCCAGTGAAGCAGGTGTCGGAGCCGGAGGGACGGGGGCTG 
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9310 9330 9350 

CTGGAGGCCCAGGAAAACTTTGGAAGAGGGAGCAGTTTGCCAAAATTGGAAGTGGAGGAG 



9370 9390 9410 

TCAAATTTGAATTCTATAGGAAATGAGCAGCAGCTCATTTGGAACCAAGCCTCAGGTAGC 



9430 9450 9470 

AGAGGCTCTGAGGAGGCCCTGACCATGGCTACCCGATGCCCCCATAATGTCCTCAGCACC 



9490 9510 9530 

CCTCTGTCTTCCCCTGCTTTTGATGCCCCTTCTGGGCATGAAAGAAGAGGGCGGGGCCAG 



9550 9570 9590 

GGGAGGGGCACCTTTCTGGGACCTCTGGTCTCTAGGGAGGATGCTGGTGTGCCTGGCAGG 



9610 9630 9650 

CTGTGCCAACGCCCTTCCAAGTGGCTGTTGTCAGGACTGCAAACATCCTGAGTTTGGGAA 



9670 9690 9710 

CATCTTTGTATGTTCTCACCTCCTCCACGCCCTCCATAGTATGTGGGGGGTCCTGCTGAC 



9730 9750 9770 

TCCCCAGCCCACGTTCTCCCCAAGAACTTCCTCCCCAGCCGGCTCCACAGGCCACCTACT 
+ + + + + + 



9790 9810 9830 

CCCTGGCAGGCAGGAGGCCTGGAGGCCACCATCTCAGCTCCACACTCTTTCTTGCCCAGG 
+ + + + + + 

9850 9870 9890 

TCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAAGGCCAGGAGCTCCTGCTTG 
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9910 9930 9950 

AGCTGGAGAAGAAGCAGTGAGTGCCAGGCTGGGGTAGGGCTGGGAGGAGGGGATCAGTGT 



9970 9990 10010 

TGGGGGGCAGGGACTGACACAGATCTGTGCGGGTGGCTGGATGGGCAGAGGACCCCAGAG 



10030 10050 10070 

AGGGTGCAGATGACAGGGAGAGTCACGCAGGCCTGTGGTTGGCTCCCTGGAGGCTGAAGA 



10090 10110 10130 

GGACCGCTGAGGCTGTCAGCCCCGCTGTGGGGCACCTCCGCCCTCCCAACCCCAGGAGCG 



10150 10170 10190 

GCTTGTTAGCTCCCTGCTGGCGATGAGTGAGCACCACCTAGTGGACATTTGCAAGATATG 



10210 10230 10250 

CTGAGTCTAAAGAAATCCTAGAGGGAAAAGATGAGCCGGCACCCCAGGCTAAGGGAATGG 



10270 10290 10310 

CAGGGACCAAGATGCGGTGGCTTTGGGAGGCCGAGGCGGGCGGCTCACCTGAGGTCAGGA 



10330 * 10350 10370 

GTTTGAGACCAGCCTTGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATT 
+ , + + - - ■ + - + + 

10390 10410 10430 

AGCCAGGCGTGGTGGCGGCGCCTGTAATCCCAGCTACTTAGGGGGCTGAGACGGGAGAAT 



10450 10470 10490 



FIG. 29.18 



WO 02/083077 PCT/US02/12063 

113/157 



CGCTTGAACCCCGGAGGTAGAGGTTGTGGTGAGCCAAGATCACACCACTGCACCACTCCG 



10510 10530 10550 

G C C TGGG C AAAG AGTG AGACTC CGTCTGAAAAAAAGAG AAAAAAAAG AAAAAGAAAAAAA 



10570 10590 10610 

AAAAGAAAGAAAAGAAAAAAGAAAAAGATGCAGTGGCTACACTTGGGGGCAGCAGTTTGT 



10630 10650 10670 

CTGACCTGCCTGGAAGGTCTCCATCTACAGGGAGGGGAGCAGGGGGGAATGAATTTGGAG 



10690 10710 10730 

AGTCCCAGGAGGGCCAGATCACAGAAGGCCATTTTGGTGCTCAGTGTCCTGGACCATCCA 



10750 10770 10790 

GAGCCAAAGATTTTGAGCTGGGGAAGGGACAGGCAGACCTGTGCTCAGGAAGGTGCCTTG 



10810 10830 10850 

GGCTGGGTGGGGTGGGTGTCCGGGCTGGAGCGCAGGCTCTTAAAACCACCCAGATTATGT 



10870 10890 10910 

TATCAGTATATATCACCTACTGAGTGCTTGACCGCAGGCGCTGTTCTGAGCACTTGACAC 



10930 10950 10970 

GTATTTTATTCTCCCTCGTGGAGTCGGATGGACAGGGAACAAACTCTAGTTCCACTGTGC 



10990 11010 11030 

CCAACCATATTTTCCCGACGTCCCTACCCTTTCAATGGGGTGGTCACATCACCTACCTCC 
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11050 11070 11090 

TAGGGTGGCGGGTGTGTGTGGGGCAGGGGTAGGGGGCAGAGCTGGGGCAGGTGGTGGAAT 



11110 11130 11150 

GCCTGGGAGGGGGGAAGCAGCCATCATTAGCGGGTGGTCTGGAGGTAATGAGGCCAAGGT 



11170 11190 11210 

G AGGTTGGG TT AAGG ATTTTC TTT AAAG AAG AC AG ATTG AC TT ATGATTG ATC C AT CCGT 



11230 11250 11270 

GTGGGAAAGATCCTGTTGAGATGGAGCCTGAAGATGGAATCATTACCGGAGTGGGTGTGG 



11290 11310 11330 

AGAAGGCAGGGAGGGTGGAAGCAGCGTGGGCAGGTGGCGATTCTGTTTTCTCTGGAGGCA 

+ , ~ + , - + + - + 



11350 11370 11390 

GGGGGTGAGCATCAATCACTGAAGGACAGGTGGGAGGTATGTGGGGTCTAGAAGTCTGAG 



11410 11430 11450 

GAAAATATTTC AAGG ATC TAGGGCAGGTGGGGGCAAGAGGGTCGACCAGATGCCCAACAA 



11470 11490 11510 

AGGAGGGCAGCAGGCAGGGGAACTGGGGGAGGTCACCGCATTTCCCCAACTCCAAGTCCC 



11530 11550 11570 

ATTCTTCGGCAGTGTCTCCTGACTCCTCCCCTCCCGATCCTGTGGATCCTGCTGCCTGCT 



11590 11610 11630 

GCAGGTCCCCTGGGAACCACAAACTCTTCCCCTATTCCCACTCCTCCCCGGCGTCCTCCC 
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11650 11670 11690 

TGGTGCTTCCCATATTCACATCTCC CACAACTAAGCC ATCAC CAAGGCTCCTT C CTCTAG 
+ + ~ + + + + 

11710 11730 11750 

CCCCAAGAGTTTCTGATCTGAGCAAGTCACCATTGCTCCTGTCCCTTCCCTAAGACACAC 
+ + + + + + 

11770 11790 11810 

TGTG AGTGT C TCACTC ATAAAG CTG CTC C AT TAGC ATTT AGGG AGGAAGGC TGG GAGACA 
+ + + + + + 

11830 11850 11870 

TCCTGGAGGAGGCAGGAGGAAGCTGAATTCAGTGTTCCCTGTAACACCCCCTCTCAGCAG 

— : + + — + + + — , + 

11890 11910 11930 

GCTGCTGGCCCCAGGATACATAGAAACCCACTACGGCCCAGATGGGCAGCCAGTGGTGCT 
+ + + + + + 

11950 11970 11990 

GGCCCCCAACCACACGGTGAGATGCTTCCATGGGCTCTGGGATGCACCGCCAGAGGTACC 
+ + + + + + 

12010 12030 12050 

CCCCCACCATTCCTACCCCTACTCCTCCTTGCATTCCTAAGGGGCGGTTGGAGCCAGCCC 
+ + + + + + 

12070 12090 12110 

CTACCACACCCTCCCTCTTGCCCCTCTTGCTCCAGCCCTGGCTGAGATTTGGGGCTGGCC 
+ + + + + - + 

12130 12150 12170 

CCTTCCTCCCTAGGATCATTGCCACTACCAAGGGCGAGTAAGGGGCTTCCCCGACTCCTG 
+ + - + + + +, 

12190 12210 12230 
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GGTAGTCCTCTGCACCTGCTCTGGGATGAGGTGAGCTCTGGGAGAGGAGGCTGGGCCTGG 



12250 12270 12290 

GATGGGGAAAGAGCTCCCTCACACCCGCTCCTACCCCTCTGCACCCTAGTGGCCTGATCA 



12310 12330 12350 

CCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCAAGGACT 



12370 12390 12410 

TCTCAACCCACGAGATCTTTCGGATGGAGCAGCTGCTCACCTGGAAAGGAACCTGTGGCC 



12430 12450 12470 

ACAGGGATCCTGGGAACAAAGCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGAGCAGGG 



12490 12510 12530 

TCAGGGGCATCGATCGGATGGGAGTGGGAATGCTGTATCTATAGCCCTCCAAATCAGAAG 



12550 12570 12590 

AGACAGGAATTCACAGGCCTCGAGTCCCAGTATTTTTATTGAAGTCTGAAGAAACAAGTT 
+ + + + + + 



12610 12630 12650 

CCAGAAAACATGTTAAACTTCCTTCTGGGAGCTGGGGTTGGGGGTCAGGGCTCAAGCCCA 



12670 12690 12710 

GCAGCTTCCACTCAGGGTCCCCATTTGCACCTCCGCAGGGCAGGCGAGAAGCGCGCAGGA 
+ + + + + + 



12730 12750 12770 

CCCGGAAGTAC CTGGAACTGTAC ATTGTGGCAGACCACACCCTGGTGAGGAGAGACCC C A 
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12790 12810 12830 

GGGGTTGGCGGGGTCAGGGATGGGGCCAGCTCAGCCCCTCAAGCCACCGGGATTTCTGCC 



12850 12870 12890 

TTCCCAGTTCTTGACTCGGCACCGAAACTTGAACCACACCAAACAGCGTCTCCTGGAAGT 



12910 12930 12950 

CGCCAACTACGTGGACCAGGTTGGGGGCGGCGGGGAGAGAGCGGTGATGGGGGTGGCGGC 



12970 12990 13010 

GGCAGGACAGGCAGGTGCTGGTGGGGTTTGGGGAAGAGGAAGGGCGCCCCACGAAGGACC 



13030 13050 13070 

ACCGGCGCGATGGGGCGCCCTGTCCCGGCTTCAGCCCCGCCTCGCCCTCAGCTTCTCAGG 



13090 * 13110 13130 

ACTCTGGACATTCAGGTGGCGCTGACCGGCCTGGAGGTGTGGACCGAGCGGGACCGCAGC 



13150 13170 13190 

CGCGTCACGCAGGACGCCAACGCCACGCTCTGGGCCTTCCTGCAGTGGCGCCGGGGGCTG 



13210 13230 13250 

TGGGCGCAGCGGCCCCACGACTCCGCGCAGCTGCTCACGTGGGTGCCTCTGACCGGGACG 



13270 13290 13310 

CGGGTCCCGGGTGGGGCGGCCTCACCTCCCGGCCCCGCCTGGTCACGCCGCGCTCCGCCC 



13330 13350 13370 

CCAGGGGCCGCGCCTTCCAGGGCGCCACAGTGGGCCTGGCGCCCGTCGAGGGCATGTGCC 
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+ + + + + - + 

13390 13410 13430 

GCGCCGAGAGCTCGGGAGGCGTGAGCACGGTGAGCCCCGCGGGCGGGGGCGAGGGAGAGA 
+ + + + + + 

13450 13470 13490 

CAGGAGGCTCTACGGCCGCAGTGACCGCCCTCCCACGGCCCCCCAGGACCACTCGGAGCT 
+ + + + + + 

13510 13530 13550 

CCCCATCGGCGCCGCAGGCACCATGGCCCATGAGATCGGCCACAGCCTCGGCCTCAGCCA 
+ + + + + + 

13570 13590 13610 

CGACCCCGACGGCTGCTGCGTGGAGGCTGCGGCCGAGTCCGGAGGCTGCGTCATGGCTGC 
+ + + + + + 

13630 13650 13670 

GGCCACCGGGTACGCGGGTGGGGGGTCGGGGCTGCGGCGGGGCGGCTAGTCCTGGGGACT 
+ - + + + + + 

13690 13710 13730 

TCCTCCGCTGCGTTTCTTTGGTCGTCCCTCAGTTTCCTCTTCTGTAAAATGGGGATAATG 
+ + + + + + 

13750 13770 13790 

ATCATAGTGTCCGCTTCAGGGTGGTTTATGAGGCTTAAAGGGAAGAAGCTCAGGCAAAGT 
+ + + + + + 

13810 13830 13850 

GGATTCTCAACGGTATGAAGATTATTTTCCGAGTAACCTGGCGAGGTTACTCCTACACCG 
+ + + + + + 

13870 13890 13910 

GGAGGAGCACCGTCGGGTCGCGATTCCACCTTGGGTCCCGGGCTGCTCACTATTGGGGCC 
1- ). (- j_ _ + h 

13930 13950 13970 
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GCATCGTCCCCTGTCCCGCTTGTTGTGTGACTTTGCGCGGGTTACTTCCCCTCTCTGGGC 



13990 14010 14030 

TCTGCGCGTCTGGCGGCTGTAGCCAAGCCCAGGGGTGGGGATCAGAGAAGCGCGGGGGTT 
+ + + + + + 



14050 14070 14090 

GGGGGACTGTCCCTCCATGCCCAATGCCCTCCCCGTGCCGGTAGGCACCCGTTTCCGCGC 



14110 14130 14150 

GTGTTCAGCGCCTGCAGCCGCCGCCAGCTGCGCGCCTTCTTCCGCAAGGGGGGCGGCGCT 



14170 14190 14210 

TGCCTCTCCAATGCCCCGGACCCCGGACTCCCGGTGCCGCCGGCGCTCTGCGGGAACGGC 



14230 14250 14270 

TTCGTGGAAGCGGGCGAGGAGTGTGACTGCGGCCCTGGCCAGGTTAAGTCGGCTCGCCCG 



14290 14310 14330 

GCCCCCACTTGCCCTCTCCGCTCAGGTCTGGGGCGCTGCGCCCTCACCTGGGCCCTTCTT 



14350 14370 14390 

GCCTTTCTGGTCCCAGGAGTGCCGCGACCTCTGCTGCTTTGCTCACAACTGCTCGCTGCG 



14410 14430 14450 

CCCGGGGGCCCAGTGCGCCCACGGGGACTGCTGCGTGCGCTGCCTGGTGAGGGCATGGAA 



14470 14490 14510 

GGTTCAGGGTGAGGGTTTCGGGGAGCTTGGGAGCCGGCCTGTTGGCCTTAGTTAATTGGT 
+ + + + + + 
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14530 14550 14570 

G C C CTC AG GTTCCC CCGTTGGGTG CTGGG CTTGGGTAGGCCTGG CTC C C C C AG C T C CGAG 
+ + + + + + 



14590 14610 14630 

CCGCGCTCTCGGCATGGACCTCTCACTGCACGTGGCCTCTCTCTGCCTTCCCCACCACCC 
+ , + + + + + 



14650 14670 14690 

GTCACCTGCGCAGCTGAAGCCGGCTGGAGCGCTGTGCCGCCAGGCCATGGGTGACTGTGA 



14710 • 14730 14750 

CCTCCCTGAGTTTTGCACGGGCACCTCCTCCCACTGTCCCCCAGACGTTTACCTACTGGA 
+ + + + + + 



14770 14790 14810 

CGGCTCACGCTGTGCCAGGGGCAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGA 



14830 14850 14870 

GCAGCAGTGCCAGCAGCTCTGGGGGCCTGGTGAGAGGACACGAGCACCCTTGCACCCTGC 



14890 14910 14930 

CCCCCATCCTCTGGTGGGGCCAGTTTTCTACTGTGGGGAAGATGGGCAGGGGAAACTGAG 



14950 14970 14990 

GCCCGCTGAGCGCAGCCCCTCTCCGAGCTGCCCCCAGCCTGGCCCATGCTTCCTCAGGCT 



15010 15030 15050 

CCCACCCAGCTCCCGAGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGATGCTCATGGAA 



15070 15090 15110 

ACTGCGGCCAGGACAGCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGTAGGGAGTGGAGC 
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15130 15150 15170 

TGAGTGGAGGGAGCAGAAGCTATGGAGTGGGTTTGGGGAAGGGGGGTACTGCAGCTGTTG 



15190 15210 15230 

ACCCCCCTCTACTTCCTCCCCAGGGATGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGA 



15250 15270 15290 

AAGCCCAGCCTGCTCGCACCGCACATGGTGCCAGTGGACTCTACCGTTCACCTAGATGGC 



15310 15330 15350 

CAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCTGGACCTGCTTGGC 



15370 15390 15410 

CTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGAGCTCTGCCCACCCG 



15430 15450 15470 

ACCCCTCCTTGCCGTTTGAATCCCGCAGGCCAGTGTCCCCCTCACTGCCTGGTGCACTGC 



15490 15510 15530 

CCGTAGGTGTGCCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGC 



15550 15570 15590 

CTGACTGCCTGCCACAGCCACGGGGTGAGAGCCCGAGGAGTGGGGGTGACCTTGGGGTTC 
+ + - + + + + 



15610 15630 15650 

CTAATCCTACGTGACCCTCCTCTTCTCTTCTCTGCAGGTTTGCAATAGCAACCATAACTG 
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15670 15690 15710 

CCACTGTGCTCCAGGGTGGGCTCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCAT 
+ + + + + + 



15730 15750 15770 

GGACAGTGGCCCTGTGCAGGCTGAAAGTATGCCAGTGGGGGGCATGTGGGCAGGAGCTGG 



15790 15810 15830 

GGTGGTGCACCTGCTCAGGACTCAGCGCCCCTTCCCCCAATCCCCGCAGACCATGACACC 
+ + + + + + 



15850 15870 15890 

TTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGGGGCCGGCCTGGCC 
+ + + + + + 



15910 15930 15950 

TGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTGGGGCTGGAGAAGG 



15970 15990 16010 

GACCCTGCGTGCAGTGGGTAGGCTCCGAGCGCCTGCTTCCTGAGCCTACTCCTGCGGTTC 
+ + + + + + 



16030 16050 16070 

CCCTCCTCAGAGCTCTGCTGGGGCTGTGGGAGCTGGGGCAGGCCCTCAGCCTTGCCCCCA 

+ + + + + + 

16090 16110 16130 

GGTGCAGAGAGCAGCCCCAGAGGCCATGGAAAGAAGTAGCTTTGAACAGGAGGTTCCAGT 
+ + -- + + + + 

16150 16170 16190 

GGCCTCCCAGTCAAGCGAGGGGGTGGATCCCTGCCCCACCACCAGCACCGCAAGGCATGG 
+ + + + + + 

16210 16230 16250 

CCCTCTACCTCCCAGTACAGCTCCTCTTGTCCACTCTCCTGCTTCTCCCACCAGCTGGCT 
+ + + + + + 
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16270, 16290 16310 

GCCTCACCCTTGACTTCGCCCTGTTTTTCCCTGGCTCAGATTGCAGTCCCTGTACCATGC 
+ + + + + + 



16330 16350 16370 

TGCCCCCGGAGGCCTGTCCAGCCTCTGTCTCACCAGTTTTCGGCCCTTTGCCACTTCCTC 
+ + - + + + + 



16390 16410 16430 

TGCACAAATCACCTCTGTCACCCCCTTGAAGTTCCCAAATGCTGGGCCCAGCACATCTTT 
+ + + + + + 

16450 16470 16490 

TCACTCCATACCACTGGTCAGCTGCGGTGCTGGCTGCCCCTGTGCCAGGGCCCTGCCTTA 
+ + + + + + 

16510 16530 16550 

ACCCAGTTCTCTGTGACCTGGGTGGTGGCGGAGTGGGGAGTCACATAATACTAAGCATGG 
+ + _ + _ : + + + 

16570 16590 16610 

CTGTCCTAGGACTCACCCTGCACCAGGGCCCTAGGCAGGGCAGGCACTCTGTGGCCATGT 
+ + + + + + 

16630 16650 16670 

CTGACATAGCCTGGTCTTGGGAGTGCTCCGGGCAAGCCAAGGGAGATGGCATGATTTGGG 
+ + + + + + 

16690 16710 16730 

CCAGAGATGGGGGCAGAGGGCATAACAGACAGGGGCAGGGCACCACCTGGGCCGCGGGTG 
+ + + + + + 

16750 16770 16790 

GCAGCTAAGAGGACCCTGACAAAGCGAGTTGTGATTGAGGGTCTGTGGGCAGAGGAGCAA 
+ + + + + + 

16810 16830 16850 
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GGTGGCCAGAGCCTGGCGTGTCAGCACGGAGGGGGCGCTGCAGAGGGTGGCGGCTGCTTC 

+ + + + + + 



16870 16890 16910 

TCATCCCCAGGCGGGAGTCTCAGGGCAGGGGAGAATGTTTTGAAGGAACATCACAGGAAA 
+ + + + + + 



16930 16950 16970 

TGACAAGGCCTTGGGGGATGGGATGGGGACAGTCAAAGATGGCTTGGAATCATCAAGGGC 
+ + + + + + 



16990 17010 17030 

AGCAGGGCACCCAGGGGCAAGGAGAGCAGACATAGCTGCCGAAGGGGCGGACATCCAAGG 



17050 17070 17090 

TTCTTTGGAAGCTGAGCGATGCCAGCATCTGGAGAGTGCCAGGCTGCTGGGTGGTGTCAG 



17110 17130 17150 

AGCCTGGAGGAAATGTTAGGACTAGAGAGAGGAGGTGCCAGCCGAGGGCATGAGGCTCAC 



17170 17190 17210 

TTGGAGCCTGGATCCCAAGGCTCCCCTGAAGAGGGAGCAGGAAGGGAGCTGAGAGGGTGA 



17230 17250 17270 

CTTGGAGCAGATGGGTGCCCCAAGAAACTCAGTAAACGCAGAACTCCCTGGGCTGGACAC 
+ + + + + + 



17290 17310 17330 

CATGCTGCGGGGAGGCAATAACCCACTCAGGATCACTGTGCCAACCTCCTGGACTCTTAT 



17350 17370 17390 

CACGTTGCTCAGCCCCAAAGATGGCCCACACAGGGACCACCCCCTGGGCGGCGTTCACCC 
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17410 17430 17450 

CATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGGTGAGTGAGGCACCAG 
+ + + + + + 

17470 17490 17510 

GGGGAGGTGGAGAGGGAAGGGAGAAGGGAAGGGCTCATGCCTCCTGCCTCCTTCCAGATG 
+ + + + + + 

17530 17550 17570 

GGCAGCACCCAGTCACCTTGAGTCCCCTATGCCCCTCCCCAGCCCCAGGGTCTCCTGCTG 
+ + + + + + 

17590 17610 17630 

ACCATATTCACAACATTTACCCTCCACCATTTCTCCCAGACCCTGAGAACTCTCATGAGC 



17650 17670 17690 

CCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTGACCCCCAAGGTAGGCAGG 



17710 17730 17750 

GACCTGGATTCAAAGCCTCCCCCTCTCATCGCCCACCCTCCCACCTCTCCCACCCCTCAG 
+ + + + + + 



17770 17790 17810 

TTTGCTGCCCCCTAATCAGGTTTCTGGGCTCAGGTTATTATGGAAATGAGTTTATGACCT 



17830 17850 17870 

CTTGGTTATCATGGAGACCAGGATGCTGGAAGCCCCTGGGCTGGGGAGGGAGAAGCTGTG 



17890 17910 17930 

GCTTTTCCTGGATCACTGGTCCTCACTGAGTGAGG^TGGGCTCTCTGCCACACAGCTTGC 
+ + + + + -- + 

17950 17970 17990 

AGCCTGGGGCCCCAGTCCTTAGGGGACAACATATCCTCCTCATTCTCAGCAGATCAAGTC 
+ + + + + + 
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18010 18030 18050 

CAG AT GC C AAGATCCTG C CTCTGGTGAGAGGTAGCTCCTAAAATGAAC AG ATTT AAAG AC 
+ . + + + + + 

18070 18090 18110 

AGGTGGCCACTGACAGC CACTGCAGGAAGTTGAACTGCAGGGGCAGAGC CAGTGAATCAC 
+ + + + + + 

18130 18150 18170 

CGGACCTCCAGCACCTGCAGGCAGCTTGGAAGTTTCTTCCCCGAGTGGAGCTTCGACCCA 
+ + + + + + 

18190 18210 18230 

CCCACTCCAGGAACCCAGAGCCACATTAGAAGTTCCTGAGGGCTGGAGAACACTGCTGGG 
+ + + + + + 

18250 18270 18290 

CACACTCTCCAGCTCAATAAACGATCAGTCCCAGAAGCAAAGGTCACACAGCCCCTGACC 
- + + + + + + 

18310 18330 18350 

TCCCTCACCAGTGGAGGCTGGGTAGTGCTGGCCATCCCAAAAGGGCTCTGTCCTGGGAGT 
+ + + + + + 

18370 18390 18410 

CTGGTGTGTCTCCTACATGCAATTTCCACGGACCCAGCTCTGTGGAGGGCATGACTGCTG 
+ + + + + + 

18430 18450 18470 

GCCAGAAGCTAGTGGTCCTGGGGCCCTATGGTTCGACTGAGTCCACACTCCCCTGCAGCC 
+ + + + + : + 

18490 18510 18530 

TGGCTGGCCTCTGCAAACAAACATAATTTTGGGGACCTTCCTTCCTGTTTGTTCCCACCC 
+ + - + + + - + 

18550 18570 18590 



FIG. 29.32 



WO 02/083077 



127/157 



PCT/US02/12063 



TGTCTTCTCCCCTAGGTGGTTCCTGAGCGCCCACCCCCAATCCCAGTGCTACACCTGAGG 



18610 18630 18650 

TTCTGGAGCTCAGAATCTGACAGCCTCTCCCCCATTCTGTGTGTGTCGGGGGGACAGAGG 
+ + + + + + 

18670 18690 18710 

GAACCATTTAAGAAAAGATACCAAAGTAGAAGTCAAAAGAAAGACATGTTGGCTATAGGC 
+ + + + + + 

18730 18750 18770 

GTGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAAGCCGGGGTAGGAGGATCACCAGA 
+ + + + + + 

18790 18810 18830 

GGCCAGCAGGTCCACACCAGCCTGGGCAACACAGCAAGACACCGCATCTACAGAAAAATT 
+ + + + + + 

18850 18870 18890 

TTAAAATTAGCTGGGCGTGGTGGTGTGTACCTGTAGGCCTAGCTGCTCAGGAGGCTGAAG 
+ + + + + + 

18910 18930 18950 

CAGGAGGATCACTTGAGCCTGAGTTCAACACTGCAGTGAGCTATGGTGGCACCACTGCAC 
+ + + + + + 

18970 18990 19010 

TCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAAATTTTAAAAAGACATATTA 
+ + + + + + 

19030 19050 19070 

ACTTGGACCTTGGTTAGTCTTTTCTGTATGTAAATTCAACCCATGGGGTGCCCTGAGGAC 
+ + , + + + + 

19090 19110 19130 

CCACACGGGGTGGTGGTTGGCGGGGTGGTGGTTGGTGGGGTGGTGGCTGACGGGGTGGTG 
+ + - + + + + 
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19150 19170 19190 

GCTGGCAGGCCGAGCCTAGATGGCAGCCAGAGCCCCAGGCATGTGTCTGGGCACAGGACG 
+ . + + + + + 

19210 19230 19250 

GTGTTGCCTAGTTTGAACACCCTCTTTGCTCTGTCACTCCTGG£TCCCTTGGGCGTTCAC 

+ + + + + + 

19270 19290 19310 

ATTCTCCCATTGCTTCATGCAAGAGCTGCTGAGTGGCCTATATCAGCCAGCTGTTGCCGC 
+ - + + + + + 

19330 19350 19370 

ATAACAAAACCATCCCAAAACTGAGTGCAGGGAGGCAACTTCACCTCGGGCTCCACTCCA 
+ + + + + + 

19390 19410 19430 

CAAGCCCAAGGGGCCAGGTGAGAGTGCTCTCTAAAGCCCCCTCCTGCCTCAGTTGTAGTT 
+ + + + + + 

19450 19470 19490 

GCAAAATTTTAATTTATGAAGGTGACTGATGACACAGAGGCCAATGCTGTTGAAATAAGT 
+ ^+ + + - + 4- 

19510 19530 19550 

TATTACTCACAGTTTCCCACCATGCAGGGCCACAGTGGGGAGGCACTAGGTTTGGTCCAG 
+ + + + + + 

19570 19590 19610 

GGACAGAATCAGGAGCGAGTGGAAGGCACAGGCCACAGCCCACAGTGCCGTTTCCACTGG 
+ + + + + + 

19630 19650 19670 

GGAGGCAAGGCAGGCCAGGGGAAGAGGGTAGGATTGGCATTTTGAATCATTCTGGTGGGG 
+ + + + H + 

19690 19710 19730 

TTTGGGGCGTGGGGTTGGGCTCTAATTGTCTGGGTAGGTGCCTGGCCCTGAGCTGGTTTA 
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19750 19770 19790 

GGGCAGGGGA?VATACTGGTTTCGTATGTGAGAGTTCCTTGAAGGGGGTGGTTGGTGTATG 
+ + + + + + 

19S10 19830 19850 

GACTCAAGACTGGTCGGTTTGCATATGAAAGGCATGAGTTGTTTCTGATCTCCAGGAATC 
. + + + + + + 

19870 19890 19910 

AAG C AGTTT C TCTC CAG C CAAC AAG CC C C CAC C C CGAG ATGTTAAAC CATC ATAAAATAG 
+ + + + + + 

19930 19950 19970 

AGAATCTAAGGCCAGGCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAG 
+ + + 4- + + 

19990 20010 20030 

GCGGGAGGATCATTTGAGGTCAGAAGTTCGAGACCAGCCTGGCCAATGTGGTGAAACCCC 
+ + + + + + 

20050 20070 20090 

ATCTCTACTAAAAATACAAAAATTAGCCCGGTGTGGTGGCACGTGCCTATAATCCCAGCT 
+ + + + + + 

20110 20130 20150 

ACTCGGGAGGCTGCGGCAGGAGAATTGTTTGAACATGGGAGGTGGAGGTTGCAGTGAGCT 
+ + + + + ■ + 

20170 20190 20210 

GAGATCGTGCCACTGCACTCCAGCCTGGGCAACAAGAGCAAGACTCCGTCTCAAAAAAAA 
+ + + + + . + 

20230 20250 20270 

AAAAAAAAAAAAGAGAGACTCTAAAAATACACGTTAATATACCTCCCCCGCTCTTACCCT 
+ + __, + + + + 

20290 20310 20330 
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TCAGGAGGGGGTGTCTAGACCCCGCGGGACTCCAGCTACAAGGGACCCTGGGGAGGCCAA 



20350 20370 20390 

CTCTGCCCTCTTGGCTAATCCCCAAGACTGCCCAGCACCCCCTCCACCCCTTCTCCATTC 
+ + + + + + 



20410 20430 20450 

AGTGG CG AACC CTGGGG AGGC C ACGTGGG AAGGAAAG AGGG CT CTAAG AGGGG AGGCC CC 
4- + + + + + 

20470 20490 20510 

AGACTGGGGGAGAGGCCTGTCTGGAGCCCAGGATCACCTGGCTGTGCTGCAGAACTGGAG 
+ + + + + + 

20530 20550 20570 

AAGAGAAGCTCAGCAGAAAGGAGCTGGCATGGGGCCAACAGCAGAAAAGCAGGAGGCACG 
(. + + + + + 

20590 20610 20630 

CAGAAGTGACTGGGAAGCAGGAGGGTAGGCATGGACCCTGAGGCTGAGCAGGAGGTACTG 
+ + + + + + 

20650 20670 20690 

AGGGGCAGAGTGGACGCTGAGCTGGGGGTAGCGAGCGAGCCCAGCTCAGCTGTGACGCCC 
+ + + - + + + 

20710 20730 20750 

TCTGTTTGGCCACCCAACTACCAGCTACTTGGGCTGCCCCGGGAGGAACTGGGCTTCCTC 
+ + + + + + 

20770 20790 20810 

TGACATTCTGTGGCCTGCGGCCATCTGTCACACCTTCTTGTCTCTCTGCCCCTCCCTTGA 
+ + + + - + + 

20830 20850 20870 

CTTGTGGCACCCACAGACAGGTGGGAGAGTGTACCTGCCCTGTGTGGTCAGAGCTTGGTT 
+ + + + 4. + 
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20890 20910 20930 

TTGAGTTTCCTTCCCTCACCCCTCTTTCCTCCCACACGCCAAAACACAAGAGGATGTGTC 



20950 20970 20990 

AGAGGC CTGTGAACCAGAGCAACTCGATG CTG AATAGGGG C TG AG C AAAATAAGGCTG AG 



21010 21030 21050 

ACCTACTGGGCTGCGTTTCCAGACAGTTACAGCATTCTGCGTCACAGGATGAGATAGGAG 

+ + + + + + 



21070 21090 21110 

ATACAGGTCATAAAGACCTTGCTGATAAAATAGTTTGCAGTAGGCCAGGCGCGGTAGCTC 



21130 21150 21170 

ACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACCTGAGGTCAGAAGT 



21190 21210 21230 

TCGAGACCAGCCTGGCCAACAAGGTGAAACCTCATCTCTACTAAAAATACAAAAACTAGC 
_ + + + + + - + 



21250 21270 21290 

CAGGCATGGTGGTGTGTGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCG 
+ + + + + + 



21310 21330 21350 

CTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGATCATGTCACTGCACTCCAGCCTG 



21370 21390 21410 

GAGCGAGACTCCGTCTCAAAAACCAAACCAACCAAAAAAATCAGCTTGCAATATAGAAGC 



21430 21450 21470 

TGGCTAAAACCCACCCAAACCAAGATGGTGATGAGAGTGACCTCTGGTCGGTCCCGACTG 
+ + + + + + 
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21490 21510 21530 

CTACACTCCCACCAGCACCATGACAGGTTACAGATGCCATGGCAGTATCAGGAAGTTACC 
+ + + + + + 

21550 21570 21590 

ATATATGGTCTAAAAAGGGGAGACATGAACAATCCACCCCTGTTTAGCAGATCATCCAGA 



21610 21630 21650 

AACAACCATAAAAATGGGCAACCAGCAGCCCTCAGGGCTGCGCTCTCTATGGAGTAGCCA 



21670 21690 21710 

TTCTTTTATTCTTTTACTTTCTTAATAAATGTGCTTTCACTTTATGGACTCGTCTCAAAT 



21730 21750 21770 

TCTTTCTTGCACGAGATCCAAGAACCCTCTCCTGGGGTCTGAATCTGGACCCCTTTCCGG 
+ + + + + + 



21790 21810 21830 

TAACAGATGTCGTAGAGTGAAGCACAACCACTGCAGGGGCATCTTGGTTTACATTTTGCT 



21850 21870 21890 

TCAGCGGCCATGGTTAGCACAGCGGAAAGCACATCACAGTCTTCTGATTCATTAAAAAAA 

, + + + + + + 



21910 21930 21950 

TTAGGAAATGGACCACCACAAACCACAGACAGATGTACTGAGACAGGATAGGTAGTCAAG 
+ __ + + + + . + 



21970 21990 22010 

AAAGTGACCATGTTCTAGGCGCGCAGCAGCAACTGTGGTGACCGTACAGTCAACAAGCCT 



22030 22050 22070 
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CAGCACTGGCATTGCAATTGAGCTCATTCAAGCAAAGCTATCTTCAGCAGGGACTTCTCC 
. + , + + - + + + 



22090 22110 22130 

CTCTAGGCAGCAAGCGCATTTTTATTTTACCTGTCCTCAAACTGATCCTTTGCTCCTTAT 
+ + + + + + 



22150 22170 22190 

AACAGTAAGGAACACACCCCTGTGTGGAGATTTAAGATGCTAATGAGGCCAAGCGCAGTT 



22210 22230 22250 

GCTCACGTCTGTAATTCCAGCACTTTGGGAGGCAGAGGTGGGCGGGTCACTTGAGGTTAG 
+ + + + + + 



22270 22290 22310 

AAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTACTAAAAATACAAAAAT 



22330 22350 22370 

TAGCCGGGCATGGTGGCGGGCGTCTGTAATCCCAGCTACCTGGGAGGCTGAGGCAGAAGA 

+ _ + _. + + + + 



22390 22410 22430 

ATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAG 



22450 22470 22490 

CCTGAGGGAGAGAGAGAGCAAGACATCGTTTTTGTTTGTTTGTTTGTTTGTTTGTTTGTT 



22510 22530 22550 

TTTTAAAAAAAAGTCAAGACAAATCATAGTGGGGGCTTTTCTGGTCACTTTTTAAATCTT 



22570 22590 22610 

AGTGTTGAGACTTTATTTGAGACAGGGCCTTACTCTGTTGCCCAGGTTGGATGAGATTTT 
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22630 22650 22670 

TAACCTCAATATTTACTTATAGAATAACTTTTTGGTTAGTCAAAACAATGCTGTGTCTCA 
+ + + + + 



22690 22710 22730 

TTCTGATCAGAATAAAACATCAGAGAACTGAAGAGAAACATTCTGCAAAATAACTGGCCA 



22750 22770 22790 

GGATTCTTCAAAAGTGTCAAGGGTAAAGATAAGGAAAGATGAAGGAACTCCCAGATTGAG 
+ + + + - + + 



22810 22830 22850 

GAGAATAAGGAGACAACTGTGATGTGGGATCCTAGAATGGATCTTGGAACAGAAAAAGGA 



22870 22890 22910 

CATTAGTGGAAAAATGAGAAATGCAAAACAGTCTACAGTTTCGTTAACAGGATTGTACCA 
+ + + + + + 



22930 22950 22970 

AGGTTAGTTTCCTAGCTGTAATGATTGGACTATGATTAAGTAAGATGGACCATCAGGGGA 
+ + + + + + 



22990 23010 23030 

AGCTGGGTGAAGGGTGTAAGGAAAATGCTTACATTTTCCAACTTTTCTGCAAGTCTAAAA 



23050 23070 23090 

TTAGTCAACAATAAG AAGTTTAAAATAGGCC AGG CATGGTGG CTCACACCTGTCATC CTA 
+ + + + + + 



23110 23130 23150 

GCACTTTGAGAGGCCGAGGTGGGAGGATGGTTTGAGCCCAGGAGTTCAAGACCAGCCTGC 



23170 23190 23210 

GCAATAGAGCGAGACCCCAACTCTATTCAAAAAAATTTTTTTAAGTTTAAAATAGAATTA 
+ + + + + + 
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23230 23250 23270 

TATAAAAAAGG AAAAGAAAAATG C TGTTTC ATAGCGTTCCTAGTTTAG C ATGGGAG AG AC 



23290 23310 23330 

CAGGTCTCCCTGGGTGGTTGTCTGTGTGTTGCTGGGTGTGCGTGGCAGGGCTAGTGTGTT 
+ + + + + + 

23350 23370 23390 

GGGGTCCGTCTAGGCACATTCAGGCGCCGAATCCCGTGGCTCCCAGGTTTACCTGACGGT 
+ + + + + + 

23410 23430 23450 

GCAGCCTGGGGTGGAGACTTAATGAGGGCGGGGAGTTGCTGCAGCAAAGGCTCCTCCCAG 
+ + + + + + 

23470 23490 23510 

GGGTATCAGCGCAGACAGCTGGGTTTTCACTGTGCTCCTGCTCCAGAGGCACTAGGAAGG 
+ + + + + + 

23530 23550 23570 

GGGCGCCTATCAGACTAGGACTCTGCCAGCCATCCTTCTCTGTTGAAGGTCCAGC 
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10 30 50 

• • • • ■ 

CAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTGCTGCTGCTGCTACTAC 

+ +« + h + + 

MetGlyTrpArgProArgArgAlaArgGlyThrProLeuLeuLeuLeuLeuLeuL 

70 90 110 

TGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGACATATCCCTGGGCAGC 
+ + + + + + 

euLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGlyHisIleProGlyGlnP 

130 150 170 

CAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACCGTCAGCCTGGAGGAGC 
h + .+ + h + 

roValThrProHisTrpVallieuAspGlyGlnPrbTrpArgThrValSerLeuGluGluP 

190 210 230 

CGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAAGGCCAGGAGCTCCTGC 
+ + + + . . + + 

roValSerLysProAspMetGlyLeuValAlaLeuGluAlaGluGlyGlnGluLeuLeuL 

250 270 290 

TTGAGCTGGAGAAGAACCACAGGCTGCTGGCCCCAGGATACATAGAAACCCACTACGGCC 

+ + + ■ + + + 

euGluLeuGluIiysAsnHisArgLeuLeiiAlaProGlyTyrlleGluThrHis Tyr GlyP 

310 330 350 

CAGATGGGCAGCCAGTGGTGCTGGCCCCCAACCACACGGATCATTGCCACTACCAAGGGC 

+ + + + + 

roAspGlyGlnProValValLeuAlaProAsnHisThrAspHisCysHisTyrGlnGlyA 

370 390 410 

GAGTAAGGGGGTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCC 
+ + + + + + 

rgValArgGlyPheProAspSerTrpValValLeuCysThrCysSerGlyMetSerGlyL 

430 450 470 

TGATCACCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCA 
+ + + + + + 

eul 1 eThrLeuSer Ar gAsnAl aSer Ty^TyrLeuAr gProTrpPr oP r o ArgGlySerL 
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490 510 530 

AGGACTTCTCAACCCACGAGATCTTTCGGATGGAGCAGCTGCTCACCTGGAAAGGAACCT 
+ — ■ + + + + + 

ysAspPheSerThrHisGluIlePheArgMetGluGlnLeuLeuThrTrpLysGlyThrC 

550 570 590 

GTGGCCACAGGGATCCTGGGAACAAAGCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGA 

+ + + + h + 

ysGlyHisArgAspProGlyAsnLysAlaGlyMetThrSerLeuProGlyGlyProGlnS 

610 630 650 

GCAGGGGCAGGCGAGAAGCGCGCAGGACCCGGAAGTACCTGGAACTGTACATTGTGGCAG 

+ + + + + + 

erArgGlyArgArgGluAlaArgArgThrArgLysTyrLeuGluLeuTyrlleValAlaA 

670 690 710 

ACCACACCCTGTTCTTGACTCGGCACCGAAACTTGAACCACACCAAACAGCGTCTCCTGG 
+ + + h + + 

spHisThrLeuPhelieuThrArgHisArgAsnLeuAsnHisThrLysGlnArgLeuLeuG 

730 750 770 

• • • • 

AAGTCGCCAACTACGTGGACCAGCTTCTCAGGACTCTGGACATTCAGGTGGCGCTGACCG 

+ + + + + + 

luValAlaAsnTyrValAspGlnLeuLeiiArgThrLeuAspIleGlnValAlaLeuThrG 

790 810 830 

GCCTGGAGGTGTGGACCGAGCGGGACCGCAGCCGCGTCACGCAGGACGCCAACGCCACGC 
+ + + + + + 

lyLeuGluValTrpThrGluAr gAspAr gS er ArgVa 1 ThrGlnAspAlaAsnAlaThrL 

850 870 890 

TCTGGGCCTTCCTGCAGTGGCGCCGGGGGCTGTGGGCGCAGCGGCCCCACGACTCCGCGC 
+ ■ + h h + h 

euTrpAlaPheLeuGlnTrpArgArgGlyLeuTrpAl aGlnArgProHi sAspSerAlaG 

910 930 950 

AGCTGCTCACGGGCCGCGCCTTCCAGGGCGCCACAGTGGGCCTGGCGCCCGTCGAGGGCA 

+ + + + + h 

lnLeuLeuThrGlyArgAlaPheGlnGlyAlaThrValGlyLeuAlaProValGluGlyM 
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970 990 1010 

* 

TGTGCCGCGCCGAGAGCTCGGGAGGCGTGAGCACGGACCACTCGGAGCTCCCCATCGGCG 
+ + + + — + + 

etCysArgAlaGluSerSerGlyGlyValSerThrAspHisSerGluLeuProIleGlyA 

1030 1050 1070 

CCGCAGCCACCATGGCCCATGAGATCGGCCACAGCGTCGGCCTCAGCGACGACCCCGACG 

+ + + + + -t- 

laAlaAlaThrMetAlaHisGluIleGlyHisSerLeuGlyLeuSerHisAspProAspG 

1090 1110 1130 

GCTGCTGCGTGGAGGCTGCGGCCGAGTCCGGAGGCTGCGTCATGGCTGCGGCCACCGGGC 
+ + + + + + 

lyCysCysValGluAlaAlaAlaGluSerGlyGlyCysValMetAlaAlaAlaThrGlyH 

1150 1170 1190 

ACCCGTTTCCGCGCGTGTTCAGCGCCTGCAGCCGCCGCCAGCTGCGCGCCTTCTTCCGCA 
+ + + + + + 

IsProPheProArgValPheSerAlaCysSerArgArgGlnLeuArgAlaPhePheArgL 

1210 1230 1250 

AGGGGGGCGGCGCTTGCCTCTCCAATGCCCCGGACCCCGGACTCCCGGTGCCGCCGGCGC 
+ + + + + + 

ysGlyGlyGlyAlaCysLeuSerAsnAlaProAspProGlyLeuProValProProAlaL 

1270 1290 1310 

TCTGCGGGAACGGCTTCGTGGAAGCGGGCGAGGAGTGTGACTGCGGCCCTGGCCAGGAGT 
+ + + — , + + h 

euCysGlyAsnGlyPheValGluAlaGlyGluGluCysAspCysGlyProGlyGlnGluC 

1330 1350 1370 

GCCGCGACCTCTGCTGCTTTGCTCACAACTGCTCGCTGCGCCCGGGGGCCCAGTGCGCCC 

+ + + + + ■ — F- 

ysArgAspLeuCysCysPheAlaHisAsnCysSerLe\iArgProGlyAlaGlnCysAlaH 

1390 1410 1430 

ACGGGGACTGCTGCGTGCGCTGCCTGCTGAAGCCGGCTGGAGCGCTGTGCCGCCAGGCCA 

+ + + + + 

isGlyAspCysCysValArgCysLeuLeuLysProAlaGlyAlaLeuCysArgGlnAlaM 

1450 1470 1490 
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TGGGTGACTGTGACCTCCCTGAGTTTTGCACGGGGACCTCCTCCCACTGTCCCCCAGACG 

+ H + + H + 

etGlyAspCysAspLeuProGluPheCysThrGlyThrSerSerHisCysProProAspV 



1510 1530 1550 

TTTACCTACTGGACGGCTCACCCTGTGCCAGGGGCAGTGGCTACTGCTGGGATGGCGCAT 
+ + + + -|- + 

alTyrLeuLeiiAspGlySerProCysAlaArgGlySerGlyTyrCysTrpAspGlyAlaC 



1570 1590 1610 

GTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTGGGGGCCTGGCTCCCACCCAGCTCCCG 

+ _| y + + + 

ysProThrLeuGluGlnGlnCysGlnGlnLeuTrpGlyProGlySerHisProAlaProG 



1630 1650 1670 

AGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGATGCTCATGGAAACTGCGGCCAGGACA 

+ H + + H [- 

luAlaCysPheGlnValValAsnSerAlaGlyAspAlaHisGlyAsnCysGlyGlnAspS 

1690 1710 1730 

GCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGGATGCCCTGTGTGGGAAGCTGCAGTGCC 
+ + + + + h 

erGluGlyHisPheLeuProCysAlaGlyArgAspAlaLeuCysGlyLysLeuGlnCysG 

1750 1770 1790 

AGGGTGGAAAGCCCAGCCTGCTCGCACCGCACATGGTGCCAGTGGACTCTACCGTTCACC 
+ + + + + + 

InGlyGlyLysProSerLeuLeuAlaProHisMetValProValAspSerThrValHisL 

1810 1830 1850 

I'AGATGGCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCTGGACC 
+ + + + + + 

euAspGlyGlnGluValThrCysArgGlyAlaLeuAlaLeuProSerAlaGlnLeuAspL 

1870 1890 1910 

TGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTGCCAGA 
[_ + + h + + 

euLeuGlyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCysGlnS 
1930 1950 1970 
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GCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTGCCACA 

+ + • + + + + 

erArgArgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCysHisS 

1990 2010 2030 



+ + + + + + 

erHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaProProP 



2050 2070 2090 

TCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGAAAACC 
+ + + + + + 

heCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGluAsnH 



2110 2130 2150 

ATGACACCTTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGGGGCCG 
+ + + + + — . + 

isAspThrPheLeuLeiiAlaMetLeuLeuSer ValL euLeuProLeuLeuProGlyAlaG 

2170 2190 2210 

GCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTGGGGCT 

+ + + + + + 

lyLeuAlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTrpGlyC 

2230 2250 2270 

GCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCCCCTGG 

+ + + + + + 

ysArgArgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisProLeuG 

2290 2310 2330 

GCGGCGTTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGACC 

+ + + + + f- 

lyGlyValHisProMetGluLeuGlyProThrAlaThrGlyGlnProTrpProLeuAspP 

2350 2370 2390 

CTGAGAACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTG 

+ + • + + + - + 

roGliiAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSerProA 

2410 2430 2450 

ACCCCCAAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCCTAAAA 
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+ + + + + + 

spProGlnAspGlnValGlnMetProArgSerCysLeuTrpEnd 



2470 2490 2510 

TGAACAGATTTAAAGACAGGTGGCCACTGACAGCCACTCCAGGAACTTGAACTGCAGGGG 
+ + + + -i + 

2530 2550 2570 

CAGAGCCAGTGAATCACCGGACCTCCAGCACCTGCAGGCAGCTTGGAAGTTTCTTCCCCG 
+ + + + + + 

2590 2610 2630 

AGTGGAGCTTCGACCCACCCACTCCAGGAACCCAGAGCCACATTAGAAGTTCCTGAGGGC 
+ + ■ + + + + 

2650 2670 2690 

TGGAGAACACTGCTGGGCACACTCTCCAGCTCAATAAACCATCAGTCCCAGAAGCAAAGG 
.+ + + + + + 

2710 2730 2750 

TCACACAGCCCCTGACCTCCCTCACCAGTGGAGGCTGGGTAGTGCTGGCCATCCCAAAAG 
+ + + + + + 

2770 2790 2810 

GGCTCTGTCCTGGGAGTCTGGTGTGTCTCCTACATGCAATTTCCACGGACCCAGCTCTGT 
+ + + + + + 

2830 2850 2870 

GGAGGGCATGACTGCTGGCCAGAAGCTAGTGGTCCTGGGGCGCTATGGTTCGACTGAGTC 
+ + + + + + 

2890 2910 2930 

CACACTCCCCTGCAGCCTGGCTGGCCTCTGCAAACAAACATAATTTTGGGGACCTTCCTT 
+ + + + + + 

2950 2970 2990 

CCTGTTTCTTCCCACCCTGTCTTCTCCCCTAGGTGGTTCCTGAGCCCCCACCCCCAATCC 
+ + + + + + 
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3010 3030 3050 

CAGTGCTACACCTGAGGTTCTGGAGCTCAGAATCTGACAGCCTCTCCQCCATTCTGTGTG 
+ + + + + + 

3070 3090 3110 

TGTCGGGGGGACAGAGGGAACCATTTAAGAAAAGATACCAAAGTAGAAGTCAAAAGAAAG 
+ + — « + + + + 

3130 3150 3170 

ACATGTTGGCTATAGGCGTGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAAGCCGGG 
+ + + + + + 

3190 3210 3230 

GTAGGAGGATCACCAGAGGCCAGCAGGTCCACACCAGCCTGGGCAACACAGCAAGACACC 
h — + + + + + 

3250 3270 3290 

GCATCTACAGAAAAATTTTAAAATTAGCTGGGCGTGGTGGTGTGTACCTGTAGGCCTAGC 
+ + + h h + 

3310 3330 3350 

TGCTCAGGAGGCTGAAGCAGGAGGATCACTTGAGCCTGAGTTCAACACTGCAGTGAGCTA 
+ + + + + + 

3370 3390 3410 

TGGTGGC AC C AC TGC ACTCCAGCCTGGGTG AC AG AGC AAG ACCCTGTCTCTAAAATAAAT 
+ + + + + 

3430 3450 3470 

TTTAAAAAGACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
+ + + + + + 

3490 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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10 30 50 

CGGGCACGGGTCGGCCGC2\ATCCAGCCTGGGCGGAGCCGGAGTTGCGAGCCGCTGCCTAG 

70 90 110 

...*•* 
AGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTGC 

MetGlyTrpArgProArgArgAlaArgGlyThrProLeuIi 

130 150 170 

TGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGAC 
euLeuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGlyH 

190 210 230 

ATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACCG 
isIleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpArgThrV 

250 270 290 

TCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAAG 
alSerLeuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGluAlaGluG 

310 330 

GCCAGGAGCTCCTGCTTGAGCTGGAGAAGAACCACAGGC 
lyGlnGluLeuLeuLeuGluLeuGluLysAsnHisArg 
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10 30 50 

CGGGCACGGGTCGGCCGCAATCCAGCCTGGGCGGAGCCGGAGTTGCGAGCCGCTGCCTAG 

70 90 110 

AGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTGC 

MetGlyTrpArgProArgArgAlaArgGlyThrProLeuL 

130 150 170 

TGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGAC 
euLeuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGlyH 

190 210 

ATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGAC 
IsIleProGlyGlnProValThrProHisTrpValLeuAspGly 



FIG. 32 



WO 02/083077 



145/157 



PCT/US02/12063 



10 30 50 

• • • ■ • * 

GCCTAGAGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCC 

MetGlyTrpArgProArgArgAlaArgGlyThrP 

70 90 110 

CGTTGCTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTC 
roLeuLeuLeuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuG 

130 150 170 

AAGG AC ATATC C C TGGG C AGCC AGTC ACC CC G C ACTGGGTC CTGGATG G AC AACC C TGGC 
lnGlyHisIleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpA 

190 210 230 

GCACCGTCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGG 
rgThrValSerLeuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGlxiA 

250 270 290 

CTGAAGGCCAGGAGCTCCTGCTTGAGCTGGAGAAGAACCACAGGCTGCTGGCCCCAGGAT 
laGluGlyGlnGluLeuLeuLeuGluLeuGluLysAsnHisArgLeuLeuAlaProGlyT 

310 330 350 

ACATAGAAACCCACTACGGCCCAGATGGGCAGCCAGTGGTGCTGGCCCCCAACCACACGG 
yrlleGluThrHisTyrGlyProAspGlyGlnProValValLeuAlaProAsnHisThrV 

370 390 410 

• • « • . . 

TGAGATGCTTCCATGGGCTCTGGGATGCACCGCCAGAGGATCATTGCCACTACCAAGGGC 
alArgCysPheHisGlyLeuTrpAspAlaProProGlxoAspHisCysHisTyrGlnGlyA 

430 450 470 

GAGTAAGGGGCTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCC 
rgValArgGlyPheProAspSerTrpValValLeuCysThrCysSerGlyMetSerGlyL 

490 510 530 

TGATCACCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCA 
euIleThrLeuSerArgAsnAlaSerTyrTyrLeuArgProTrpProProArgGlySerL 

550 

GACTTCTCAACCCACGAGAT Clf* QQ 

AspPheSerThrHisGlu l"IV3- OO 
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10 30 50 

G AGGC C G AGG AG C TC AC AGCTAT GGGC TGG AGGC C C CGG AG AGC TC GGGGG AC CC C GTTG 

Me tG lyTr pAr gPr oAr gAr gAl aAr gGl yThr Pro Leu 



70 90 110 

.... 
CTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGA 
LeuLeuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGly 



130 150 170 

» • • a . 

CATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACC 
HisIleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpArgThr" 



190 210 230 

■ •*.». 
GTCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAA 
ValSerLeuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGluAlaGlu 



250 270 290 

GG C C AGG AGC TC C TGC TTG AGCTGG AG AAG AACC ATGGC C TG ATC AC C C TC AGC AGG AAT 
GlyGlnGluLeuLeuLeuGluLeuGluLysAsnHisGlyLeuIleThrLeuSerArgAsn 



310 330 350 

GCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCAAGGACTTCTCAACCCACGAG 
AlaSerTyrTyrLeuArgProTrpProProArgGlySerLysAspPheSerThrHisGlu 



AT 
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GAGGC C GAGG AGCTC AC AGC TATGGGC TGG AGGC CCC GGAG AGCTC GG GGG AC C C C GT TG 

Me tGlyTrpAr gPr o Ar gAr gAlaAr gGlyThr Pr oLeu 

70 90 110 

CTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGA 
LeuIieuLeuLeuLeuLeuLeuIieuLeuTrpProValProGlyAlaGlyValLeuGlnGly 

130 150 170 

CATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACC 
HisIleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpArgThr 

190 210 230 

GTCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAA 
ValSerLeuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGluAlaGlu 

250 270 290 

GGCCAGGAGCTCCTGCTTGAGCTGGAGAAGAACCACAGGCTGCTGGCCCCAGGATACATA 
GlyGlnGluLeuLeuLeuGluLeuGluLysAsnHisArgLeuLeuAlaProGlyTyrlle 

310 330 350 

GAAACCCACTACGGCCCAGATGGGCAGCCAGTGGTGCTGGCCCCCAACCACACGGATCAT 
GluTlirHisTyrGlyProAspGlyGlnProValValLeuAlaProAsnHisThrAspHis 

370 390 410 

TGCCACTACCAAGGGCGAGTAAGGGGCTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGC 
CysHisTyrGlnGlyArgValArgGlyPheProAspSerTrpValValLeuCysThrCys 

430 450 470 

TCTGGGATGAGTGGCCTGATCACCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGG 
SerGlyMetSerGlyLeuIleThrLeuSerArgAsnAlaSerTyrTyrLeuArgProTrp 

490 510 

CC ACCCCGGGGCTCCAAGGACTTCTCAACCC ACGAGAT ~ 
ProProArgGlySerLysAspPheSerThrHisGlu MVJ. vJ!D 
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10 30 50 

CTGGCCCCAGGATACATAGAAACCCACTACGGCCCAGATGGGCAGCCAGTGGTGCTGGCC 
LeuAlaProGlyTyrlleGluThrHisTyrGlyProAspGlyGlnProValValLeuAla 



70 90 110 

CCCAACCACACGGATCATTGCCACTACCAAGGGCGAGTAAGGGGCTTCCCCGACTCCTGG 
ProAsnHisThrAspHisCysHisTyrGlnGlyArgValArgGlyPlieProAspSerTrp 



130 150 170 

GTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCCTGATCACCCTCAGCAGGAATGCCAGC 
ValValLeuCysThrCysSerGlyMetSerGlyLeuIleThrLeuSerArgAsnAlaSer 



190 210 230 

TATTATCTGCGTCCCTGGCCACCCCGGGGCTCCAAGGACTTCTCAACCCACGAGATCTTT 
TyrTyrLeuArgProTrpProProArgGlySerLysAspPheSerThrHisGluIlePhe ' 



250 270 290 

CGGATGGAGCAGCTGCTCACCTGGAAAGGAACCTGTGGCCACAGGGATCCTGGGAACAAA 
ArgMetGluGlnLeuLeuThrTrpLysGlyThrCysGlyHisArgAspProGlyAsnLys 



310 330 350 

G 

GCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGAGCAGGGGCAGGCGAAAAGCGCGCAGG 
AlaGlyMetThrSerLeuProGlyGlyProGlnSerArgGlyArgArgLysAlaArgArg 

Glu 

370 390 410 

ACC C GG AAGT AC CTGG AAC TGTAC ATTGTGGC AG AC C AC AC C CTGTTCTTG ACTC GGC AC 
Thr Ar gLy sTyr L euG luLeuTyr 1 1 eVal Al aAspHi s ThrL euPheLeuThr Ar gH i s 



430 450 470 

CGAAACTTGAACCACACCAAACAGCGTCTCCTGGAAGTCGCCAACTACGTGGACCAGCTT 
ArgAsnLeuAsnHisThrLysGlnArgLeuLeuGluValAlaAsnTyrValAspGlnLeu 



490 



CTCAGGACTCTGGACATTCAGGTGGC tZ\r* 
LeuArgThrLeuAspIleGlnVal I Iv3. uD 



WO 02/083077 PCT/US02/12063 

149/157 

10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGlnGlnLeuTr 

70 90 110 

GGGGCCTGGCTCCCACCCAGCTCCCGAGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGA 
pGlyProGlySerHisProAlaProGluAlaCysPheGlnValValAsnSerAlaGlyAs 

130 150 170 

TGCTCATGGAAACTGCGGCCAGGACAGCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGGA 
pAlaHisGlyAsnCysGlyGlnAspSerGluGlyHisPheLeuProCysAlaGlyArgAs 

190 210 230 

TGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGAAAGCCCAGCCTGCTCGCACCGCACAT 
pAlaLeuCysGlyLysLeuGlnCysGlnGlyGlyLysProSerLeuLeuAlaProHisMe 

250 270 290 

GGTGCCAGTGGACTCTACCGTTCACCTAGATGGCCAGGAAGTGACTTGTCGGGGAGCCTT 
tValProValAspSerThrValHisLeuAspGlyGlnGluValThrCysArgGlyAlaLe 

310 330 350 

GGCACTCCCCAGTGCCCAGCTGGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCA 
uAlaLeuProSerAlaGlnLeuAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGl 

370 390 410 

GTGTGGACCTAGAATGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGC 
nCysGlyProArgMetValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAl 



430 450 470 

TCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGC 
aProProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAl 



490 



TGAAAACCATGACACCTTCCTGC Clf* 0"7 

aGluAsnHisAspThrPheLeu I IV3. Of 
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10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGlnGlnLeuTr 

70 90 110 

GGGGCCTGGCTCCCACCCAGCTCCCGAGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGA 
pGlyProGlySerHisProAlaProGluAlaCysPiieGlnValValAsnSerAlaGlyAs 

130 150 170 

TGCTCATGGAAACTGCGGCCAGGACAGCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGGA 
pAlaHisGlyAsnCysGlyGlnAspSerGluGlyHisPheLeuProCysAlaGlyArgAs 

190 210 230 

TGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGAAAGCCCAGCCTGCTCGCACCGCACAT 
pAlaLeuCysGlyLysLeuGlnCysGlnGlyGlyLysProSerLeuLeuAlaProHisMe 

250 270 290 

GGTGCCAGTGGACTCTACCGTTCACCTAGATGGCCAGGAAGTGACTTGTCGGGGAGCCTT 
tValProValAspSerThrValHisLeuAspGlyGlnGluValThrCysArgGlyAlaLe 

310 330 350 

GGCACTCCCCAGTGCCCAGCTGGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCA 
uAlaLeuProSerAlaGlnLeuAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGl 

370 390 410 

GTGTGGACCTAGAATGGTGTGCCAGAGCAGGCGCTGGAGGAAGAATGCCTTCCAGGAGCT 
nCysGlyProArgMetValCysGlnSerArgArgCysArgLysAsnAlaPheGlnGluLe 

430 450 470 

TCAGCGCTGCCTGACTGCCTGCCACAGCCACGGGGTTTGCAATAGC2^ACCATAACTGCCA 
uGlnArgCysLeuThrAlaCysHisSerHisGlyValCysAsnSerAsnHisAsnCysHi 

490 510 530 

CTGTGCTCCAGGCTGGGCTCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGA 
sCysAlaProGlyTrpAlaProProPheCysAspLysProGlyPlieGlyGlySerMetAs 

550 570 

« • • « 

kGTGGCCCTGTGCAGGCTGAAAACCATGACACCTTCCTGC _ f _ 

SerGlyProValGlnAlaGluAsnHisAspThrPheLeu FIG. 38 
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10 30 50 

>•■••« 
CAGTGGCTACTGC TGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGC CAGC AGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrlieuGluGlnGlnCysGlnGlnLeuTr 

70 90 110 

GGGGCCTGATGGCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCT 
pGlyProAspGlyGlnGluValThrCysArgGlyAlaLeuAlaLeuProSerAlaGlnLe 

130 150 170 

GGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTG 
uAspLeuLeuGlyLeuGlylieuValGluProGlyThrGlnCysGlyProArgMetValCy 

190 210 230 

CCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTG 
sGlnSerArgArgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCy 

250 270 290 

CCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCC 
sHisSerHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaPr 

310 330 350 

ACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGA 
oProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGl 

370 

AAACCATGACACCTTCCTGC _ 
uAsnHisAspThrPheLeu FIG- 39 



WO 02/083077 



152/157 



PCT/US02/12063 



10 30 50 

•••••« 
GGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTGGGGCTGCAG 
AlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTrpGlyCysAr 



70 90 110 

AAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCCCCTGGGCGG 
gArgAspPr oAlaCysSerGlyProLy sAspGlyProHi sArgAspHi sProLeuGlyGl 



130 150 170 

CGTTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGACCCTGA 
yValHisProMetGluLeuGlyProThrAlaThrGlyGlnProTrpProLeuAspProGl 



190 210 230 

GAACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTGACCC 
uAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSerProAspPr 



250 270 290 

* 

CCAAGCAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCCTAAAAT 
oGlnAlaAspGlnValGlnMetProArgSerCysLeuTrpEnd 



310 

GAACAGATTTAAAGACAGGTGGCC 

FIG. 40 



WO 02/083077 



153/157 



PCT/US02/12063 



10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGl nGlnLeuTr 

70 90 110 

GGGGCCTGATGGCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCT 
pGl y ProAspGl yGl nGl uVa lThrCysArgGl yAl a LeuAl aLeu ProSerAl aGl r\Le 

130 150 170 

GGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTG 
uAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCy 

190 210 230 

CCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTG 
sGlnSerArgArgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCy 

250 270 290 

CCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCC 
sHisSerHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaPr 

310 330 350 

ACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGA 
oProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGl 

370 390 410 

AAACCATGACACCTTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGG 
uAsnHisAspThrPheLeuLeuAl aMetLeuLeuSerValLeuLeuProLeuLeuProGl 

430 450 470 

• • • . * . 

GGCCGGCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTG 
yAlaGlyLeuAlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTr 

490 510 530 

GGGCTGCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCC 
pGlyCysArgArgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisPr 

550 570 590 

CCTGGGCGGCGTTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCT 
oLeuGlyGlyValHisProMetGluLeuGlyProThrAlaThrGlyGlnProTrpProLe 

610 630 650 

GGACCCTGAGAACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTC 
uAspProGluAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSe 

670 690 710 

GCCTGACdTCCCAAGC AGATC AAGTCC AG AT GCC AAGATCCTGCCTCTGGT G AG AGGT AGC 
rProAspProGlnAlaAspGlnValGlnMetProArgSerCysLeuTrpEnd 

730 750 

TCCTAAAATGAACAGATTTAAAGACAGGTGGCC FIG. 41 
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10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCys TrpAspGlyAlaCy s Pr oThrLeuGluG InGlnCy s GlnGlnLeuTr 

70 90 110 

GGGGCCTGGCTCCCACCCAGCTCCCGAGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGA 
pGlyProGlySerHisProAlaProGluAlaCysPheGlnValValAsnSerAlaGlyAs 

130 150 170 

TGCTCATGGAAACTGCGGCCAGGACAGCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGGA 
pAlaHisGlyAsnCysGlyGlnAspSerGluGlyHisPheLeuProCysAlaGlyArgAs 

190 210 230 

TGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGAAAGCCCAGCCTGCTCGCACCGCACAT 
pAlaLeuCysGlyLysLeuGlnCysGlnGlyGlyLysProSerlieuLe-uAlaProHisMe 

250 270 290 

GGTGCCAGTGGACTCTACCGTTCACCTAGATGGCCAGGAAGTGACTTGTCGGGGAGCCTT 
tValProValAspSerThrValHisLeuAspGlyGlnGluValThrCysArgGlyAlaLe 

310 330 350 

GGCACTCCCCAGTGCCCAGCTGGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCA 
uAla!.euProSerAlaGlnLe\iAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGl 

370 390 410 

GTGTGGACCTAGAATGGTGTGCCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCT 
nCysGlyProArgMetValCysGlnSerArgArgCysArgLysAsnAlaPheGlnGluLe 

430 450 470 

TCAGCGCTGCCTGACTGCCTGCCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCA 
uGlnArgCysLeuThrAlaCysHisSerHisGlyValCysAsnSerAsnHisAsnCysHi 

490 510 530 

CTGTGCTCCAGGCTGGGCTCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGA 
sCysAlaProGlyTrpAlaProProPheCysAspLysProGlyPheGlyGlySerMetAs 

550 570 590 

CAGTGGCCCTGTGCAGGCTGAAAACCATGACACCTTCCTGCTGGCCATGCTCCTCAGCGT 
pSerGlyProValGlnAlaGluAsnHisAspThrPheLeuLeuAlaMetLeviIjeuSerVa 

610 630 650 

CCTGCTGCCTCTGCTCCCAGGGGCCGGCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGC 
lLeuLeuProLeuLeuProGlyAlaGlyLeuAlaTrpCysCysTyrArgLeuProGlyAl 



FIG. 42.1 
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CCATCTGCAGCGATGCAGCTGGGGCTGCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGA 
aHisLeuGlnArgCysSerTrpGlyCysArgArgAspProAlaCysSerGlyProLysAs 

730 750 770 

TGGCCCACACAGGGACCACCCCCTGGGCGGCGTTCACCCCATGGAGTTGGGCCCCACAGC 
pGlyProHisArgAspHisProLeuGlyGlyValHisProMetGluLeuGlyProTlirAl 

790 810 830 

* 

CACTGGACAGCCCTGGCCCCTGGACCCTGAGAACTCTCATGAGCCCAGCAGCCACCCTGA 
aThrGlyGlnProTrpProLeuAspProGluAsnSerHisGluProSerSerHisProGl 

850 870 890 

GAAGCCTCTGCCAGCAGTCTCGCCTGACCGCCAAGATCAAGTCCAGATGCCAAGATCCTG 
uLysProLeuProAlaValSerProAspProGlnAspGlnValGlnMetProArgSerCy 

910 930 950 

CCTCTGGTGAGAGGTAGCTCCTAAAATGAACAGATTTAAAGACAGGTGGCCACTGACAGC 
sLeuTrpEnd 

970 990 1010 

CACTCCAGGAACTTGAACTGCAGGGGCAGAGCCAGTGAATCACCGGACCTCCAGCACCTG 

1030 1050 1070 

• - • . 

CAGGCAGCTTGGAAGTTTCTTCCCCGAGTGGAGCTTCGACCCACCCACTCCAGGAACCCA 

1090 1110 1130 

GAGCCACATTAGAAGTTCCTGAGGGCTGGAGAACACTGCTGGGCACACTCTCCAGCTCAA 
1150 

TAAACCATCAGTCC FIG. 42-2 
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CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGlnGlnLeuTr 



70 90 110 

GGGGCCTGATGGCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCT 
pGlyProAspGlyGlnGluValThrCysArgGlyAlaLeuAlaLeuProSerAlaGlnLe 



130 150 170 

■ 

GGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTG 
iiAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCy 

190 210 230 

CCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTG 
sGlnSerArgArgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCy 

250 270 290 

CCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCC 
sHisSerHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaPr 

310 330 350 

ACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGA 
oProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGl 

370 390 410 

AAAC C ATG AC ACCTTCCTGC TGGCC ATGCTCCTC AGCGTCC TGC TGCCTC TGCTC CC AGG 
uAsnHisAspThrPheLeuLeixAlaMetLeuLeuSerVallieuLeuProLeuLeuProGl 

430 450 470 

- 

GGCCGGCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTG 
yAlaGlyLeuAlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTr 

490 510 530 

GGGCTGCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCC 
pGlyCysArgArgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisPr 



FIG. 43.1 
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CCTGGGCGGCGTTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCT 
oLeuGlyGlyValHisProMetGluLeuGlyProThrAlaThrGlyGlnProTrpProLe 



610 630 650 

GGACCCTGAGAACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTC 
TiAspProGliaAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSe 



670 690 710 

GCCTGACCCCCAAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCC 
rProAspProGlnAspGlnValGlnMetProArgSerCysLeuTrpEnd 



730 750 770 

■ 

TAAAATGAACAGATTTAAAGACAGGTGGCCACTGACAGCCACTCCAGGAACTTGAACTGC 

790 810 830 

AGGGGCAGAGCCAGTGAATCACCGGACCTCGAGCACCTGCAGGCAGCTTGGAAGTTTCTT 

850 870 890 

CCCGGAGTGGAGCTTCGACCCACCCACTCCAGGAACCCAGAGCCACATTAGAAGTTCCTG 

910 930 950 

AGGGCTGGAGAACACTGCTGGGCACACTCTCCAGCTCAATAAACCATCAGTCC 

FIG. 43.2 
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SEQUENCE LISTING 

<110> SCHERING CORPORATION 

GENOME THERAPEUTICS CORPORATION 

<12 0> NOVEL HUMAN GENE RELATING TO RESPIRATORY DISEASES, 
^OBESITY, AND INFLAMMATORY BOWEL DISEASE 

<130> 2976-4039PC2 

<140> 
<141> 

<150> 09/834,597 

<151> 2001-04-13 

<160> 420 

<170> Patentln Vet. 2.1 

<210> 1 
<211> 3626 
<212> DNA 

<213> Homo sapiens 
<400> 1 

cgggcacggg tcggccgcaa tccagcctgg gcggagccgg agttgcgagc cgctgcctag 6 0 
aggccgagga gctcacagct atgggctgga ggccccggag agctcggggg accccgttgc 12 0 
tgctgctgct actactgctg ctgctctggc cagtgccagg cgccggggtg cttcaaggac 18 0 
atatccctgg gcagccagtc accccgcact gggtcctgga tggacaaccc tggcgcaccg 240 
tcagcctgga ggagccggtc tcgaagccag acatggggct ggtggccctg gaggctgaag 3 00 
gccaggagct cctgcttgag ctggagaaga accacaggct gctggcccca ggatacatag 3 60 
aaacccacta cggcccagat gggcagccag tggtgctggc ccccaaccac acggtgagat 42 0 
gcttccatgg gctctgggat gcaccgccag aggatcattg ccactaccaa gggcgagtaa 48 0 
ggggcttccc cgactcctgg gtagtcctct gcacctgctc tgggatgagt ggcctgatca 540 
ccctcagcag gaatgccagc tattatctgc gtccctggcc accccggggc tccaaggact 60 0 
tctcaaccca cgagatcttt cggatggagc agctgctcac ctggaaagga acctgtggcc 660 
acagggatcc tgggaacaaa gcgggcatga ccagccttcc tggtggtccc cagagcaggg 72 0 
gcaggcgaga agcgcgcagg acccggaagt acctggaact gtacattgtg gcagaccaca 78 0 
ccctgttctt gactcggcac cgaaacttga accacaccaa acagcgtctc ctggaagtcg 840 
ccaactacgt ggaccagctt ctcaggactc tggacattca ggtggcgctg accggcctgg 90 0 
a ggtgtggac cgagcgggac cgcagccgcg tcacgcagga cgccaacgcc acgctctggg 960 
ccttcctgca gtggcgccgg gggctgtggg cgcagcggcc ccacgactcc gcgcagctgc 1020 
tcacgggccg cgccttccag ggcgccacag tgggcctggc gcccgtcgag ggcatgtgcc 1080 
gcgccgagag ctcgggaggc gtgagcacgg accactcgga gctccccatc ggcgccgcag 1140 
ccaccatggc ccatgagatc ggccacagcc tcggcctcag ccacgacccc gacggctgct 1200 
gcgtggaggc tgcggccgag tccggaggct gcgtcatggc tgcggccacc gggcacccgt 12 60 
ttccgcgcgt gttcagcgcc tgcagccgcc gccagctgcg cgccttcttc cgcaaggggg 1320 
gcggcgcttg cctctccaat gccccggacc ccggactccc ggtgccgccg gcgctctgcg 13 80 
ggaacggctt cgtggaagcg ggcgaggagt gtgactgcgg ccctggccag gagtgccgcg 1440 
acctctgctg ctttgctcac aactgctcgc tgcgcccggg ggcccagtgc gcccacgggg 1500 
actgctgcgt gcgctgcctg ctgaagccgg ctggagcgct gtgccgccag gccatgggtg 1560 
actgtgacct ccctgagttt tgcacgggca cctcctccca ctgtccccca gacgtttacc 162 0 
tactggacgg ctcaccctgt gccaggggca gtggctactg ctgggatggc gcatgtccca 1680 
cgctggagca gcagtgccag cagctctggg ggcctggctc ccacccagct cccgaggcct 174 0 
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gtttccaggt ggtgaactct gcgggagatg 
gccacttcct gccctgtgca gggagggatg 
gaaagcccag cctgctcgca ccgcacatgg 
gccaggaagt gacttgtcgg ggagccttgg 
gcctgggcct ggtagagcca ggcacccagt 
gctgcaggaa gaatgccttc caggagcttc 
gggtttgcaa tagcaaccat aactgccact 
acaagccagg ctttggtggc agcatggaca 
ccttcctgct ggccatgctc ctcagcgtcc 
cctggtgttg ctaccgactc ccaggagccc 
gggaccctgc gtgcagtggc cccaaagatg 
ttcaccccat ggagttgggc cccacagcca 
actctcatga gcccagcagc caccctgaga 
aagcagatca agtccagatg ccaagatcct 
acagatttaa agacaggtgg ccactgacag 
agccagtgaa tcaccggacc tccagcacct 
ggagcttcga cccacccact ccaggaaccc 
agaacactgc tgggcacact ctccagctca 
cacagcccct gacctccctc accagtggag 
tctgtcctgg gagtctggtg tgtctcctac 
gggcatgact gctggccaga agctagtggt 
actcccctgc agcctggctg gcctctgcaa 
gtttcttccc accctgtctt ctcccctagg 
tgctacacct gaggttctgg agctcagaat 
cggggggaca gagggaacca tttaagaaaa 
tgttggctat aggcgtggtg gctcatgcct 
ggaggatcac cagaggccag caggtccaca 
tctacagaaa aattttaaaa ttagctgggc 
tcaggaggct gaagcaggag gatcacttga 
tggcaccact gcactccagc ctgggtgaca 
aaaaagacat aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaa 



ctcatggaaa ctgcggccag gacagcgagg 180 0 
ccctgtgtgg gaagctgcag tgccagggtg 1860 
tgccagtgga ctctaccgtt cacctagatg 192 0 
cactccccag tgcccagctg gacctgcttg 198 0 
gtggacctag aatggtgtgc cagagcaggc 2 04 0 
agcgctgcct gactgcctgc cacagccacg 2100 
gtgctccagg ctgggctcca cccttctgtg 2160 
gtggccctgt gcaggctgaa aaccatgaca 222 0 
tgctgcctct gctcccaggg gccggcctgg 2280 
atctgcagcg atgcagctgg ggctgcagaa 2340 
gcccacacag ggaccacccc ctgggcggcg 24 0 0 
ctggacagcc ctggcccctg gaccctgaga 2460 
agcctctgcc agcagtctcg cctgaccccc 252 0 
gcctctggtg agaggtagct cctaaaatga 2580 
ccactccagg aacttgaact gcaggggcag 2 64 0 
gcaggcagct tggaagtttc ttccccgagt 2 70 0 
agagccacat tagaagttcc tgagggctgg 2760 
ataaaccatc agtcccagaa gcaaaggtca 282 0 
gctgggtagt gctggccatc ccaaaagggc 2 88 0 
atgcaatttc cacggaccca gctctgtgga 2 94 0 
cctggggccc tatggttcga ctgagtccac 3 0 00 
acaaacataa ttttggggac cttccttcct 3 060 
tggttcctga gcccccaccc ccaatcccag 312 0 
ctgacagcct ctcccccatt ctgtgtgtgt 3180 
gataccaaag tagaagtcaa aagaaagaca 3 240 
ataatcccag cactttggga agccggggta 33 00 
ccagcctggg caacacagca agacaccgca 33 60 
gtggtggtgt gtacctgtag gcctagctgc 3 42 0 
gcctgagttc aacactgcag tgagctatgg 3 480 
gagcaagacc ctgtctctaa aataaatttt 3540 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3 600 

3626 



<210> 2 

<211> 227 

<212> DNA 

<213> Homo sapiens 

<400> 2 

accgggcacg ggtcggccgc aatccagcct 
agaggccgag gagctcacag ctatgggctg 
gctgctgctg ctactactgc tgctgctctg 
acatatccct gggcagccag tcaccccgca 



gggcggagcc ggagttgcga gccgctgcct 60 
gaggccccgg agagctcggg ggaccccgtt 12 0 
gccagtgcca ggcgccgggg tgcttcaagg 18 0 
ctgggtcctg gatggac 227 



<210> 3 

<211> 3509 

<212> DNA 

<213> Homo sapiens 

<400> 3 

cagctatggg ctggaggccc cggagagctc gggggacccc gttgctgctg ctgctactac 60 
tgctgctgct ctggccagtg ccaggcgccg gggtgcttca aggacatatc cctgggcagc 120 
cagtcacccc gcactgggtc ctggatggac aaccctggcg caccgtcagc ctggaggagc 180 
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cggtctcgaa gccagacatg gggctggtgg 
ttgagctgga gaagaaccac aggctgctgg 
cagatgggca gccagtggtg ctggccccca 
gagtaagggg cttccccgac tcctgggtag 
tgatcaccct cagcaggaat gccagctatt 
aggacttctc aacccacgag atctttcgga 
gtggccacag ggatcctggg aacaaagcgg 
gcaggggcag gcgagaagcg cgcaggaccc 
accacaccct gttcttgact cggcaccgaa 
aagtcgccaa ctacgtggac cagcttctca 
gcctggaggt gtggaccgag cgggaccgca 
tctgggcctt cctgcagtgg cgccgggggc 
agctgctcac gggccgcgcc ttccagggcg 
tgtgccgcgc cgagagctcg ggaggcgtga 
ccgcagccac catggcccat gagatcggcc 
gctgctgcgt ggaggctgcg gccgagtccg 
acccgtttcc gcgcgtgttc agcgcctgca 
a 99ggg9cgg cgcttgcctc tccaatgccc 
tctgcgggaa cggcttcgtg gaagcgggcg 
gccgcgacct ctgctgcttt gctcacaact 
acggggactg ctgcgtgcgc tgcctgctga 
tgggtgactg tgacctccct gagttttgca 
tttacctact ggacggctca ccctgtgcca 
gtcccacgct ggagcagcag tgccagcagc 
aggcctgttt ccaggtggtg aactctgcgg 
gcgagggcca cttcctgccc tgtgcaggga 
agggtggaaa gcccagcctg ctcgcaccgc 
tagatggcca ggaagtgact tgtcggggag 
tgcttggcct gggcctggta gagccaggca 
gcaggcgctg caggaagaat gccttccagg 
gccacggggt ttgcaatagc aaccataact 
tctgtgacaa gccaggcttt ggtggcagca 
atgacacctt cctgctggcc atgctcctca 
gcctggcctg gtgttgctac cgactcccag 
gcagaaggga ccctgcgtgc agtggcccca 
gcggcgttca ccccatggag ttgggcccca 
ctgagaactc tcatgagccc agcagccacc 
acccccaaga tcaagtccag atgccaagat 
tgaacagatt taaagacagg tggccactga 
cagagccagt gaatcaccgg acctccagca 
agtggagctt cgacccaccc actccaggaa 
tggagaacac tgctgggcac actctccagc 
tcacacagcc cctgacctcc ctcaccagtg 
ggctctgtcc tgggagtctg gtgtgtctcc 
ggagggcatg actgctggcc agaagctagt 
cacactcccc tgcagcctgg ctggcctctg 
cctgtttctt cccaccctgt cttctcccct 
cagtgctaca cctgaggttc tggagctcag 
tgtcgggggg acagagggaa ccatttaaga 
acatgttggc tataggcgtg gtggctcatg 
gtaggaggat caccagaggc cagcaggtcc 
gcatctacag aaaaatttta aaattagctg 
tgctcaggag gctgaagcag gaggatcact 
tggtggcacc actgcactcc agcctgggtg 
tttaaaaaga cataaaaaaa aaaaaaaaaa 



ccctggaggc tgaaggccag gagctcctgc 24 0 
ccccaggata catagaaacc cactacggcc 300 
accacacgga tcattgccac taccaagggc 3 60 
tcctctgcac ctgctctggg atgagtggcc 42 0 
atctgcgt cc ctggccaccc cggggctcca 4 80 
tggagcagct gctcacctgg aaaggaacct 54 0 
gcatgaccag ccttcctggt ggtccccaga 600 
ggaagtacct ggaactgtac attgtggcag 66 0 
acttgaacca caccaaacag cgtctcctgg 72 0 
ggactctgga cattcaggtg gcgctgaccg 7 80 
gccgcgtcac gcaggacgcc aacgccacgc 840 
tgtgggcgca gcggccccac gactccgcgc 9 00 
ccacagtggg cctggcgccc gtcgagggca 960 
gcacggacca ctcggagctc cccatcggcg 1020 
acagcctcgg cctcagccac gaccccgacg 1080 
gaggctgcgt catggctgcg gccaccgggc 1140 
gccgccgcca gctgcgcgcc ttcttccgca 1200 
cggaccccgg actcccggtg ccgccggcgc 12 60 
aggagtgtga ctgcggccct ggccaggagt 13 2 0 
gctcgctgcg cccgggggcc cagtgcgccc 13 8 0 
agccggctgg agcgctgtgc cgccaggcca 1440 
cgggcacctc ctcccactgt cccccagacg 150 0 
ggggcagtgg ctactgctgg gatggcgcat 15 6 0 
tctgggggcc tggctcccac ccagctcccg 1620 
gagatgctca tggaaactgc ggccaggaca 1680 
gggatgccct gtgtgggaag ctgcagtgcc 1740 
acatggtgcc agtggactct accgttcacc 18 0 0 
ccttggcact ccccagtgcc cagctggacc 1860 
cccagtgtgg acctagaatg gtgtgccaga 192 0 
agcttcagcg ctgcctgact gcctgccaca 1980 
gccactgtgc tccaggctgg gctccaccct 2 040 
tggacagtgg ccctgtgcag gctgaaaacc 210 0 
gcgtcctgct gcctctgctc ccaggggccg 2160 
gagcccatct gcagcgatgc agctggggct 2220 
aagatggccc acacagggac caccccctgg 22 80 
cagccactgg acagccctgg cccctggacc 2 340 
ctgagaagcc tctgccagca gtctcgcctg 2400 
cctgcctctg gtgagaggta gctcctaaaa 2 460 
cagccactcc aggaacttga actgcagggg 252 0 
cctgcaggca gcttggaagt ttcttccccg 2 580 
cccagagcca cattagaagt tcctgagggc 2 640 
tcaataaacc atcagtccca gaagcaaagg 2 700 
gaggctgggt agtgctggcc atcccaaaag 2 7 60 
tacatgcaat ttccacggac ccagctctgt 2820 
ggtcctgggg ccctatggtt cgactgagtc 2 88 0 
caaacaaaca taattttggg gaccttcctt 2940 
aggtggttcc tgagccccca cccccaatcc 3 000 
aatctgacag cctctccccc attctgtgtg 3 060 
aaagatacca aagtagaagt caaaagaaag 3120 
cctataatcc cagcactttg ggaagccggg 3180 
acaccagcct gggcaacaca gcaagacacc 3240 
ggcgtggtgg tgtgtacctg taggcctagc 33 00 
tgagcctgag ttcaacactg cagtgagcta 33 60 
acagagcaag accctgtctc taaaataaat 342 0 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 34 8 0 
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aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 3509 



<210> 4 
<211> 826 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His He Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr lie Glu Thr His 
85 90 " 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Val 
100 105 110 

Arg Cys Phe His Gly Leu Trp Asp Ala Pro Pro Glu Asp His Cys His 
115 * 120 125 

Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val Val Leu Cys 
130 135 140 

Thr Cys Ser Gly Met Ser Gly Leu He Thr Leu Ser Arg Asn Ala Ser 
145 150 155 160 

Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp Phe Ser Thr 
165 170 " " 175 

His Glu He Phe Arg Met Glu Gin Leu Leu Thr Trp Lys Gly Thr Cys 
180 185 190 

Gly His Arg Asp Pro Gly Asn Lys Ala Gly Met Thr Ser Leu Pro Gly 
195 200 205 

Gly Pro Gin Ser Arg Gly Arg Arg Glu Ala Arg Arg Thr Arg Lys Tyr 
210 215 220 

Leu Glu Leu Tyr He Val Ala Asp His Thr Leu Phe Leu Thr Arg His 
225 230 " 235 240 

Arg Asn Leu Asn His Thr Lys Gin Arg Leu Leu Glu Val Ala Asn Tyr 
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245 250 255 

Val Asp Gin Leu Leu Arg Thr Leu Asp lie Gin Val Ala Leu Thr Gly 
260 265 270 

Leu Glu Val Trp Thr Glu Arg Asp Arg Ser Arg Val Thr Gin Asp Ala 
275 280 ~ 285 

Asn Ala Thr Leu Trp Ala Phe Leu Gin Trp Arg Arg Gly Leu Trp Ala 
290 295 300 

Gin Arg Pro His Asp Ser Ala Gin Leu Leu Thr Gly Arg Ala Phe Gin 
305 310 315 320 

Gly Ala Thr Val Gly Leu Ala Pro Val Glu Gly Met Cys Arg Ala Glu 
325 330 335 

Ser Ser Gly Gly Val Ser Thr Asp His Ser Glu Leu Pro lie Gly Ala 
340 345 350 

Ala Ala Thr Met Ala His Glu He Gly His Ser Leu Gly Leu Ser His 
355 360 365 

Asp Pro Asp Gly Cys Cys Val Glu Ala Ala Ala Glu Ser Gly Gly Cys 
370 375 380 

Val Met Ala Ala Ala Thr Gly His Pro Phe Pro Arg Val Phe Ser Ala 
385 390 395 400 

Cys Ser Arg Arg Gin Leu Arg Ala Phe Phe Arg Lys Gly Gly Gly Ala 
4 05 410 415 

Cys Leu Ser Asn Ala Pro Asp Pro Gly Leu Pro Val Pro Pro Ala Leu 
420 425 430 

Cys Gly Asn Gly Phe Val Glu Ala Gly Glu Glu Cys Asp Cys Gly Pro 
435 440 445 

Gly Gin Glu Cys Arg Asp Leu Cys Cys Phe Ala His Asn Cys Ser Leu 
450 *" 455 460 

Arg Pro Gly Ala Gin Cys Ala His Gly Asp Cys Cys Val Arg Cys Leu 
465 470 475 480 

Leu Lys Pro Ala Gly Ala Leu Cys Arg Gin Ala Met Gly Asp Cys Asp 
485 490 495 

Leu Pro Glu Phe Cys Thr Gly Thr Ser Ser His Cys Pro Pro Asp Val 
500 505 510 

Tyr Leu Leu Asp Gly Ser Pro Cys Ala Arg Gly Ser Gly Tyr Cys Trp 
515 520 525 

Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys Gin Gin Leu Trp Gly 
530 535 540 
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Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe Gin Val Val Asn Ser 
545 550 555 560 

Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp Ser Glu Gly His Phe 
565 570 575 

Leu Pro Cys Ala Gly Arg Asp Ala Leu Cys Gly Lys Leu Gin Cys Gin 
580 585 590 

Gly Gly Lys Pro Ser Leu Leu Ala Pro His Met Val Pro Val Asp Ser 
595 600 605 

Thr Val His Leu Asp Gly Gin Glu Val Thr Cys Arg Gly Ala Leu Ala 
610 615 620 

Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu Val Glu Pro 
625 630 635 640 

Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg Arg Cys Arg 
645 650 655 

Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala Cys His Ser 
660 665 670 

His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala Pro Gly Trp 
675 680 685 

Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser Met Asp Ser 
690 695 700 

Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu Leu Ala Met Leu 
705 710 715 720 

Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala Gly Leu Ala Trp Cys 
725 730 735 

Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser Trp Gly Cys 
740 745 750 

Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro His Arg Asp 
755 760 ~ 765 



His Pro Leu Gly Gly Val His Pro 
770 775 

Gly Gin Pro Trp Pro Leu Asp Pro 
785 790 

His Pro Glu Lys Pro Leu Pro Ala 
805 

Gin Val Gin Met Pro Arg Ser Cys 
82 0 



Met Glu Leu Gly Pro Thr Ala Thr 
780 

Glu Asn Ser His Glu Pro Ser Ser 
795 800 

Val Ser Pro Asp Pro Gin Ala Asp 
810 815 

Leu Trp 
825 
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<210> 5 

<211> 207433 

<212> DETA 

<213> Homo sapiens 



<400> 5 

gctctaataa atttgcggcc gctaatacga ctcactatag ggagaggatc cgcggaattc 60 

ccccatgtgc catgtccacg agatggttga acagatgaga caacatggtt gtgcagcttc 12 0 

tctctttttt tttttttcag acagagtctc actctgtagc ccaggctgga gtgcagtggc 18 0 

gcaatttcag ctcactgcaa cctccgcctc ccaggtcctg attcaagcag ttctcctgcc 240 

tcagcctcct gagtagctgg gattacagga acgcgccact atgcccagct aatttttcta 3 00 

ggtttagtag agacggggtt tcaccatgtt gaccaggctg gtcttgaatt cctgaccttg 3 60 

tgatccgccc gcctcggcct cccgaagtgc tgagattaca ggcatgagcc accgcatccg 42 0 

gccatgcagc ttctcttttc tagagttaac aggagatgcg gcaggtctgt agtagcccag 480 

aaattctcag acttgcactc ttgaatttac acgcatgtac aaaatgcagc tcagcctaat 540 

gcctcctgtt cagttctgtg gtgctgccta cttcctgtgc ccaaattagc cttgcgttta 600 

taactaggag gaatgacttg gtgtcctaca atattttggc tcctgggctt ctttggtgtt 660 

cattctatta ctagttgatt ttttttttct tggaaagaaa agtattcaaa gagccagaat 72 0 

ttactttcat tatatcttac agttgttaaa ttaacctatc cttgttttca ctttctgtgt 780 

acttctttct ttggggtaca aaacagtgtt tctttagatt gtctatttct aaatattatt 840 

ttaaccagaa cataccaaat atattttcct gccagtttca ttcttcattt tacttcttaa 900 

ccattgttac gatttttttt ttaacttcgg tctcagattt tgctcaattg aagttgtccc 960 

agttcatgag attttgtttt ctttgcagct cttgaccatg acagatgtga ccagcacaca 102 0 

gattgttaca gctgcacagc caacaccaat gactgccact ggtgcaatga ccattgtgtc 108 0 

cccaggaacc acagctgctc agaaggccag gtcagaggct gtttcttaaa gatttcagaa 114 0 

aaatcccaat ttgtcatagg tttagctttt atagtgtata tggtataaat aatggcccag 120 0 

agttactttt caaatgggtt tctatttgga ttttattatc cctgaggttt tcctttagga 1260 

agagatgttc tgtatatttc aagggtccta ctagccctgg gaatttgtgt attgtgtttt 132 0 

agaagaaggt aggactgttg tccctgggca tggctgtgac taagcactgg atccttggtt 13 8 0 

tgaggcatct gatagtgacc tcactttacc agtaccaggt tttgataaag ttagggtttg 144 0 

agagtgagtc aggtgtacag cggcccatat tgaactgtgg aaatgacaga ctttgcaaaa 150 0 

tctccttgtt ttatatcttg gtttggcata atctcactgc tcatcatata tagattttta 1560 

aatttaagtt ataagatgac ccaggctttt atagtttttg acagcttaca agactttttt 1620 

ttttgtcagt ctcatacagt tcataaaata gaaaactttc agacttttga agtgagcatt 168 0 

tgaaaagcac caagttcaac attcccattt tacagatgag caggttgagg catggtgttt 1740 

aaaagagtgg gctgcttcct tgccagttaa aagcatgtcc ttgagcagaa cattccattg 1800 

cagtgttccc cactgcagag ctactgcaac ttatctatct atctgtctgt ctgtctgtct 1860 

atctatctat ctatctatct atctatctat ctatctatct gtctatctat ttatttgcag 1920 

tttgcttcct caattagatt ttgttgtcta taatatcctg ttggtcctca ccataagttc 1980 

tctctgtgga tacaaaagaa aagcttttgt ccactgtgtt ctagatctcc atttttaggt 2040 

atgagaattg ccccaaggat aaccccatgt actactgtaa caagaagacc agctgcagga 2100 

gctgtgccct ggaccagaac tgccagtggg agccccggaa tcaggagtgc attgccctgc 2160 

ccggtaggcc ttgcagggtc atcttggtgt gtgtgggtcc attacttcag cctgcttccc 2220 

ccaacactgt gcagcctaag ttgaacctag cagaggggaa gagctaattc tgtccattca 228 0 

tcccccacac gagtattatg ggcttttttg tttttaacta aaatacagtt cttaagtatt 234 0 

tgttcctact gtcctttgaa ataaagtgaa acatcctttg ctgctctgta gaattgagtg 240 0 

acagcttggt tctgtatcac tgagcctgct ccttgtctct ttcccatcca cctttgataa 2460 

cctgggacta gaccatgatg tctcagcagt cagtctgctg agcactttat ggagagtact 2520 

tcttattaac cactgggatt taattgttgg cacctgctaa tgggccttct ctgagaagga 258 0 

gaggatagat acttctgtca gcagcacctt ttaggggtga tctccagccc tgaaaacctc 2640 

aatatatcct gcttctgagg ttcaggatga tgaactcagg gcctgagacc agccagccat 2700 

gtgatatatt tggaccaggg tggtccagaa aggggaactt ccttgtctgt gcacaacatc 2 76 0 

tgagcttgtt cagggaagtt ggtttggacc aggccctttt tgaatgtccc ttggaagttt 2 82 0 



7/189 



WO 02/083077 



PCT/US02/12063 



ttgattcagc ttgcagagtg ggactctttt ctatatctca gtgtgtctca aattttaagg 288 0 

tacaggaaaa gacctgggaa tcttgcgaaa ggcatattct ggtgcaagtg tggggtgggg 2 940 

cctagagtgt gcattttatc tagtcgactg ctacactgtg aggagcaagt ctttgtctgt 3 00 0 

tcttaaatga tctctttccc atggtacctt tcttttatct cagtgactgt tactgttaat 3 060 

gaacattgtt gatgtctcca aagtactttg gttctggtga agttgctttg ttctttcatt 312 0 

gtcttccagg aagcattcat agcttctgtg cagtaccttg tgtgggttca ggatgatcac 318 0 

aggtagcaga ttacaagctt gtcttgtatg ctatagccat atcacttggg ttgtttctca 3240 

agaaggacct tctcaccttg cttttgggat gctttgtaca cttgattgta ccttccacct 3300 

gatgatatga aaacagtgca gcttttggag actatagatt tgttaattcc ttgattcatt 3360 

tccattcttg cagtttttac cccagccctc caatatgcat attcatttgt ctgctcttca 3420 

cttaggattt tagttttcta attgttcttc agaaggaagt gtaccagtct aatattggca 3480 

ccaaactggt gttttcatct aagacatagg ataagtgacc tcagaatatg ctttttagga 354 0 

tccgggagat atcaccagta aacattttaa aattcttgta ttctgcattt ggtccttaat 3 60 0 

aatgtgtcag aggctcccac atcctaatga agtacctaga atttaaatta gaaaggccat 3 66 0 

ttcggtattc agtaatttga actcataata cagtagtttt gtctgatttc taaaattctt 372 0 

ttctttctct tttcccctta atgaaagaaa atatctgtgg cattggctgg catttggttg 3780 

gaaactcatg tttgaaaatt actactgcca aggagaatta tgacaatgct aaattgttct 3 840 

gtaggaacca caatgccctt ttggcttctc ttacaaccca gaagaaggta gaatttgtcc 3 90 0 

ttaagcagct gcgaataatg cagtcatctc agagcatggt gagttaaaat cctcaaaact 3 960 

taagtttctg gttatccacc tttctaccaa gggcatgact gcagcttgca tgtggaaggc 4020 

tgtggatatg tgtaacgtgc ttggcaagaa ggggagtgct ggtgaacgca gcctgaggga 4 080 

ctgtgggttt gtgctgtcag agtctcttcc tcttaaaatt tttaatactt tgtatatata 4140 

agatctatga ataattatat gggggatgaa ttgtaacatg tatatgtgta cataatctgg 42 00 

tgacatcagt agattatttc atacctgttt tacctctgga ttctgctagg ggagaaagag 4260 

aggtcactga taattagcta ggttggatta agccacctga gttccttgga gttaaggtat 43 20 

tataatagtg cataagactg tataattacc actaagaagt gtacatctca gctggatgtg 43 80 

gtggctcaca cctgtaatcc cagcactttg ggaggctgag gtgggtggat cgcctgaggt 4440 

caggagttca agaccagcct gaccaatata gtaaaacccc gtctctacta aaaatacaaa 450 0 

aattagccag gcatggtggt gtgcgcctgt aatcccagct actcaggagg ccgaggcagg 45 60 

agaattgctt gaacccagga ggtggaggtg gcagtgagct gagattgcgc caccgcactc 4620 

cagcctgggt gacagagcga tactccgtct caaaacaaaa aaagaacagc aaaaaaaaga 46 80 

aatgtacatc tccttgggtc tttcatagcc cagccatctc aaaagaagag agcaccttct 4740 

tgtcaagagt ttctaagccc agaaaggctc aagttctctg cttgtccacc cagtgctctc 48 00 

agggggctta tagtcaatat tccatgatca cattttgtca tttttagtct ggagtcataa 4860 

attgtgatcc taccgtcagt taagtagact catagaacaa agctctttca gcagtttcag 492 0 

ctgtggtaca gaaacgttta gtggaaatgt tcttaccaag cagggaggag ttgagggcaa 498 0 

cacttccttg ggtacagcct ccttcatgtg aaggtatgga aatgtggcct gggtctcggc 5040 

tgcctgtggc ctctgtgtac cacctatagg gccattctga gactggtagg aggtgccctg 5100 

tatttagttt tctccaatta gtcccttttt tcagtgcaac ttagatgggg tatggacact 5160 

caaacattgg tgacatattc ttagtgtgtt tacctcaggc tactgtgacc acatttggta 5220 

tttcataata ttttgatagc tttttcagat ttcagaatct ccattggtga ctgtctctgt 52 80 

tgtttctctt tccatgtcca aatgtgggtc tcttccagca ttccatcctt ggctggcagc 5340 

tgacctttcc catagttatt cactccctca gaaaatggat ggcacccagc tttgcttatt 54 00 

accctggtgt ctctaacaat gactctcgag tccacggaag ttaaaagggt tcaaccaggt 5460 

ggccacagat atacttctgg taccctttct ctcttcctta ggttttctaa ctctaaacgt 5520 

ttctgggtat tctaatctgc tgtggccacg tttatgaaac agaaattcac agtcttaatg 5580 

ataagactga cagatgagag acaactgaag tgtaatgtcc ttccacagct atacctctag 5640 

atgtagccca gttagtagag cccagatttt tatggaaaaa caagaaagga cacctagcct 5700 

aacccttagg acagaggctg gctggtagaa agctggaagg aggtgacccc tgcaggtgag 5 760 

aaggagtgaa ttaggatgtc gaagacggaa gggctttctg tgatttaatt agtgccccca 5820 

tctgtgagat gtagagggag atgattaagg gagtggctct ttgagtgagc tgcaggtaag 5880 

tttgcatggt tggctgcagg ctggatggga ggggattttt atagttgagc ctcaggaagg 5940 

aaccaaggcc agaccctgcg aggccatggc tacaatagta atggatttga attgtatcgt 6000 

gaaggcaaag aaattattga agggccttta aaaagtattt ttaattgtct ttctttcttt 6060 

tggatctgtt tatttttagg tcttttagta ggcatgtgta ttttttcctc tcaaaaatgg 6120 
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aaataggctg ggtgtggtgg ctcctgcctg taatcccagc attttgggag gccagtgagg 618 0 

gaggattgct tgagcccagg agttcaagac tagcctgggc aacacaggga gacctcgtct 624 0 

ctacaaagga agttttttta aagaattagc cgggcatgat ggtggcacat acttgtagtt 63 00 

ccggctactt gggaggctga ggcaagaggg ttgcttgagc ccaggggttt gaggctgtag 63 60 

tgagccatga tcatgtccct gcactccagc atgggcaaca gagcgagacc ctgtctcaaa 642 0 

agaaagcaaa gggagggaaa tacagtatat cttttgtttt ataactacca aaattaggaa 6480 

tacttaccat ttcttggcta aactttatat tttgattttt aaaacttgtt aaaaattgca 6540 

atgagaagga aatttcagga gagcagaaga cagactgtcc caggtgtcac tgtcctatta 660 0 

ttccctataa aatccagtgc caggatggat gaatggataa agcaaatgtg gtataagttg 6660 

aatatccctt atctaaaatg ctttggacga ggagtgtttc gaattttaga atatttgcat 672 0 

tatactaacc agttaagcat ccctaatctg gaaatccaaa atgctctagt gaacattttc 678 0 

cttcagcatc catcatgttg gcactcaaaa agttttagat tttggagtgt tttggatttc 6840 

agattaggga tactcagcct gtgtttgggg gtagccatct cttcatatag acatttcaga 6900 

acttaaatat tgctttgcta taatttctgt gaatttttga tatattatct tctctgagct 6960 

acatttttat cctttataaa atggccatat tgaagtgatg atctatccta atctaccatg 7020 

gctgagtcaa gggataaaga ggttttcctg tgtctgtggg gtatacttaa cttggtggtt 7 08 0 

tttatctaga agcttgtttt ggtcaagatg ttggttatat tcaggccagg catggtggct 7140 

catgcctata atctcaacat tttgggaggc caaggtggga ggatcacttg agctcaggag 72 00 

tttgaaacta gccagggcaa catggcaaga ctccatctcc aaaatttaaa ttaaaaaaag 7260 

atactatctg tattcatagt tgtgtctctt ttgcctttag tccaagctca ccttaacccc 7320 

atgggtcggc cttcggaaga tcaatgtgtc ctactggtgc tgggaagata tgtccccatt 73 80 

tacaaatagt ttactacagt ggatgccgtc tgagcccagt gatgctggat tctgtggaat 7440 

tttatcagaa cccagtactc ggggactgaa ggctgcaacc tgcatcaacc cactcaatgg 7500 

tagtgtctgt gaaaggcctg gtaagttcac aggtgaatta ggtggtattc agagtttatt 7560 

gtgagagaaa ccataggagg catagttcat tgctgagatg tgtgaagtag tcatgaaaac 7620 

agatgaagta ttgatttcaa gcatgcaaag aagagtataa cccagatttc agaagcagaa 7680 

ggaaatattc tgggaccctg aatagtttta attataagca aaactaaaaa taactaacac 7740 

tactcgaaga aactgatatt ctaattaaca atgagattga taggtttatt gaccagaaaa 7800 

agtattgaga attgctctga aaagcaaatt tattggtgtt ggcagagaaa tgctgtggaa 7860 

gaagaaacaa aaaaggaaaa taaaaccaaa gaaaagatta gtaaagcaag caagtgactg 792 0 

cagggacagt gttcagaaag gtagtgtcaa cagggagaaa aatgatgaga gcagttctgt 798 0 

aagcgaggac agaagcaacc cagaactacg cgagagctgc aggagagtac tgagcagaca 8 04 0 

gacactcgga gtagcttccc agctttcagt ctctctgggc tacattttgg ctaactaaag 8100 

gcaggcccag gggtagctgc tgcagcatga agacaactca aaataagcct cctcctcctg 8160 

cagagggact cacaagcagt cggtggaatt gtgggttatt ttgggaggct ggactccatt 822 0 

tgcattgtgt ccaattttgg gaaagtaatt tgtctgagga attacagagg tataggagag 82 80 

aattcatagc attgtagatt tctaaatatt gatttctaaa cttctctagg ggagctgccc 8340 

agcagccttt cagaattctg aatcctttgt tgaacttaat gaatgccgta gaccttcttc 84 0 0 

cctccaaaat tgcaaacatg agattttaca tattcaagtg gattgtaaaa cccctaaagt 8460 

tcatcctggg tccccaaatt ctaaggggct tacagcccta cttttatgga aagattcact 8 52 0 

gtttcctcat ctttgtcctt aatgctcttt gaaaagaaaa tttatttttt ccacaagtgt 8580 

gtaaatacac ttgactaaac agtacttgat gacttttatt gttttctatt ttctctcatt 8640 

taattgattt catggcacat taatgggtca tcacccacat ttgaaaagtc ttggctgggc 8700 

acagtggctc acgcctgtaa tcccagcact ttaggaggcc aagacgggtg gatcacaagg 8760 

tcaagatatc aagaccatcc tggctaacat agtgaaaccc catctctact aaaaatacaa 882 0 

aaaattagcc agacgtgatg gcgcacccct gtagtcccag ctacttggga ggctgaggca 8880 

ggagaatcac ttgaaccggg gaggcggagg ttgcagtgag ccgaaattgt gccactgcac 8 940 

tctagcctgg agacagagtg agactccgtc tcaaaaaaaa aaaagaaaag aaagaaagaa 9000 

aagtctttga ccttagcgga catggtggga gctctaagtg tctctcttgg gtttcattcc 9060 

cagcaaacca cagtgctaag cagtgccgga caccatgtgc cttgaggaca gcatgtggag 912 0 

attgcaccag cggcagctct gagtgcatgt ggtgcagcaa catgaagcag tgtgtggact 9180 

ccaatgccta cgtggcctcc ttcccttttg gccagtgtat ggaatggtat acgatgagca 9240 

cctgcccccg taagtgaaaa agggagccct aggcacttat gcatgccctc tgtataggca 93 00 

acaactcagc catgaggctg tgctgtcagc ctctgaacat tttagaaaca agactggaca 9 3 60 

tgacctctgc tcaaacctga ccagagactg ccatcgagac cttgctgcct attgagaacc 942 0 
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ttcatacaga atcaggcaca ttgacagtaa ataaatgtaa gatagatcac agagtacaga 9480 

aataacttgt ccaacttcag tgtcatattg ctcaatccat gtaatatctc catatctgaa 9540 

tttcctaatt tgtaaagtaa atgctttcca atagataatc tctaaggtcc cttttgcctt 9600 

caacatcctg ggattgagag aggagggaag ggtcatctct gttatgtatt gggcaaaata 96 60 

ctgggctctt tacattcatt atctctttta aataatcaag acagaataat atttttgact 972 0 

caagccagtt gaatagtctg ttaaaaaaaa agtaaataca gtgaattcag atctacctgt 978 0 

gatagtcaat tgcaactttt tttttttaat agctgaaaat tgttcaggct actgtacctg 9840 

tagtcattgc ttggagcaac caggctgtgg ctggtgtact gatcccagca atactggcaa 990 0 

agggaaatgc atagagggtt cctataaagg accagtgaag atgccttcgc aagcccctac 9960 

aggaaatttc tatccacagc ccctgctcaa ttccagcatg tgtctagagg acagcagata 10 02 0 

caactggtct ttcattcact gtccaggtaa gatgccttgc atatccaaat tcaagtgttt 10080 

cactactgat ttatgaagaa taaaaccttg aaagctacgt tgtgtatatg taactccctg 10140 

ccctcagccc ctttccttcc tcctaatggt tggtacaaga aggaatagac cagaagctgg 102 0 0 

tccaaggcct gacctggacc tgctgagagt ggtggtgggt tcctaagaaa ccaattctaa 102 6 0 

gaaattggcc tttgattcag acttgaagtg accactcagc aatgtgtctg tgggtttcta 10320 

gaacagttgg gagaggctgg gctggtgcaa agactcctca gagattagca gtcaagaact 103 80 

tctctaagag cctgccattg acaacagggc tgtttgtgag gactttgtaa gggaaagtcc 10440 

actgtaaaca aagctaaaag ggcagagaca gactgggaga aaatacctgc actgcatgta 1050 0 

acaactgatg atcatccaga atatgtaaac tcccaccctc cagggaaaaa ataagaagct 10560 

aaatgtgaac ccaacagaaa aagtggttat tggagataaa gaagccattc tcagaaaagg 10 620 

aaatagaaca ggaaaatgta aactaacacc aggaaccaat tttttgtcta ctaaactgga 10680 

tcaaattttc cgtgttccct tttttccata ctaagatatg ggggacctgc aattctattt 10740 

ttaatctgct aggagttaac tttttaacga aatatttaaa tctctgcttt ttcatgaata 10800 

tcacgaatat atctggtaaa atgacaaccc agagaatggg agaaaatatt tgcaaagtat 10860 

atatctaata agaatccaat gtccagaaca cgtaactctt aaaactcaac aatagaaaga 10 92 0 

caacccaatt aagaaatgga taaaggattt aaatagacat atgtccagag aagaaataca 1098 0 

aatggccaat aagcacatga aaagatactc aatatcattc atcattacca agaaatgcaa 11040 

gtcaaagcca caatgagata ccactttaca cccactgaga tggctgtaat caataaaaca 1110 0 

ggtaataaca agtattggaa ggatatgtag aaattggaac tctcatgcag gttggcaggt 11160 

cctcaaaaag ttaaatatag agttatcata cggcccagca gttttactcc taggtacata 1122 0 

cccaagaaaa ttgaaaacat atgttaccca aaaacttgta tataaatgct tatgcttata 1128 0 

gcatcattct tcatagcatc atgctcaaag caacattatt cataataagc aaaagtgaaa 11340 

acaagccaaa tacctgttag ctggcaaatg gatgaacaaa gtgttgtata ttcatacagt 1140 0 

gtaatgttat ttggcaataa aaaggaatga agtactgaca tattttctga catggattac 1146 0 

cctaaaaaaa catgctatgt aaaagaagcc aggtgcaaaa gattatgcgt tgcatgatgc 1152 0 

catttatatg aaatgtccaa agacagaaag tagatgttta gtggtttcct agggctgggc 1158 0 

atggtaatga agaataatag gcatgaggtt ttctagagta gctgcagcat tattttctga 1164 0 

catggattac cctaaaaaaa atgctatgtg aaagaagcca ggtacaaaag attatgcatt 1170 0 

gcatgatgcc atttatatga aatgtccaaa ggcagaaagt agatgtttag tggtttccta 11760 

gggctgggga tggaaacaag gaataatagg catgaggttt tctagagtag ctgcagtatt 11820 

attttctgac atgaattacc ctaaaaaaac atgctatgta aaagaagcca ggtacaaaag 1188 0 

attatgcatt gcatgatgcc atttatatga aatgtccaaa ggcagaaagt agatgtttag 11940 

tggtttccta gggctgggga tggaaacaag gaataatagg catgagattt tgtagagtag 12 000 

ctgcaccatt ttatgctctc accaacatca tatgaaggtt catatctttc cacatccttg 12060 

ccaatacttg ttactatctt tttaatgaaa actgttctag tggatgagaa atggcatctc 1212 0 

actgttgttt tcatttgtat tttcctgata actaatgaaa ttgaacatca acttcattag 12180 

ttagcctttt gtatatcttc tttggagaaa tatttagtca aattctttgc ccatttttca 12240 

gttatgttgt cttttttatt attgagttgt aagagttctt tatgaattct gaagtccctt 12300 

attggatata ttatttgcaa atactttctc ccattatatg ggtttctttt cactttctta 12360 

atatgccttt tgaaatcaaa agttttcagt tttgattaag ttccatgtat cagtgtttta 1242 0 

ttttatccca tgtgcttttt ggtattgtat ctagaaaatc agtgcccaag taacccagga 1248 0 

tcacatagat ttgctcctaa gttttctttg aaaagtttca tagatttgtg tgttacatta 12540 

ggtccttgat ccattttgag ttaatttttg tgtatggtgt gaggtagggg tccaaactct 1260 0 

ctttttgcct gtggatgtgg tcctgcacca tttgttgaaa agattttttt ttttttttaa 12660 

ccattgaatt ttcatggcac atttgttgca gatcagttga ctgtgaatct aggaactttt 12720 
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t'tctaagctc tcatttttgt gcagttgacc aatgtttctc cttttgccaa tattacactg 12780 

ttttgattac tgtggctttg tataagtttt aaaattggga aggctaagtc ctccaccttt 12840 

gttcttattt tgcaagactg ttacagctat tctgggtttc ttgcatgttc atattaattt 12900 

taggatgagc ttgccatttt ctgcaacaac aaaaagccaa ctggtttgat aaggtttgca 12960 

ttgaatctac tgaccaattt ggggtgtatt gcctgtttgt ttgtttgttt gtttgtttgt 13 020 

ttgtttgttg agagagggtc tcactctgtc acccaggctg gagcgcagtg gtgctatctt 13 08 0 

ggctcactgc agcctccgct ttcccaggct caagtgattc tccagcctca gcctctcgag 13140 

taactggtac tacaggcatg agctaccaac acccagctaa tttttatatt tttttagaga 132 0 0 

cagggtttcg ccatgttgcc caggctggtc ttgaactcct gagctcaaag caatcctcct 13 2 60 

ggcctttgcc tccctagttc tgggattaca ggtgtgagcc accacgcctg ggccccttgc 133 2 0 

cattttaaca agggttaatc ttctagcctg tgaacctgga atgtcaattt attaccctct 133 8 0 

ttactttctt ttggcattgt tttatagttt tcagtgtaca tcattgtctt gttccttttt 13440 

ttaggggaaa ataattcagt ctttattaaa tatgatattt gtttagtttt ttttatacat 13 50 0 

gccctttatt gggttgagaa agtttccttc tattctagtt ttttgagggt tttgtaatga 135 6 0 

atgagtgcta gattttgtca aacacttttt ctgtgtctac ttagatgatc acgaagtttt 13 62 0 

gttctttatt aatactatgt attacattaa ttaagaatgt taaaccagtt ttgcattcct 13 68 0 

gggataaagc ccatttagtc atgatgtata cttttttatc atgatgtata tttttaatct 13740 

gctgttggat tctatttgct agttggggat tttgtgtgta cattcatgag ggggatataa 13 80 0 

gtttgtggtt ttcttctttt gtgattacta tctggttttc atagcagggt agtactggcc 13 8 60 

tcctagagta agttggaacg tattcctcct cctctattta ctggaagagt tcgtcaagga 13 92 0 

ttagcattaa ttcttcttta gatgttgatt gaattcacca gtgaaaccaa atgggcctgg 13 98 0 

cctttttctt tacaggaaaa tttttaatta ttaattcaat ctgtttgtta tagatctatt 14040 

cagattttct ttttctgagt gagtcagttt tggtaatctg tttttctagg aatttgtcta 14100 

ttctcagtaa tcaaatttgt taacatacag ttcgtagtat tttcttttgt tttttttttt 14160 

ttttttttga gacagagtct cactctttca cccagtctgg agtgcagtga cgtgatctct 14220 

gcttactgca acctccgcct ccagggttca agtgattctc ctgcctcagc cccccaagta 14280 

gctgggacta caggcgcacg ccaccactcc tagctaattt tttattttgt tttgttttgc 1434 0 

tttttttagt agagacaggg ttttaccatg tagaccagga tggtctcgat ctcctgacct 144 0 0 

cgtgatctgc ctgccttggc ctcccaaagt gctgggatta caggcgtgag ccaccacgcc 14460 

cggccagttc atagtatttt cttgcaggtt gctctttagc aatatgtagt ctcctgttta 1452 0 

acctgttaac gatttttttt tctgtctaat ttggttttat tattctttta cattttgcaa 14580 

gggctctaaa tattgtgagg gttttttttt ttaagagctt gctctcttat attgtggatg 1464 0 

caataattta agccttattg aatgtactaa aattcgtatt tttcattctc tcatatttct 14700 

tgcattaacc cttccttcag gatttgttgc tgtgcgtgtt tatcctggtg gtattcggag 1476 0 

attgagagtt ttcttatact ggttaatcct caaggttaat ttgtattaat aactaaatgt 14820 

gtagatctgt caatattggt cagtggttgg gtgtgctgag ctgttgtgaa agtgtggtat 148 80 

gatgtcctgg aaaagctagc aagccagcct caggacactc cagctctggt ccacttcgac 14940 

tgttgttgac tggtgtgcgc atctccctca ttggaagaag gaaggagcaa tggacctggg 150 00 

tggctgggtg caattagcat gttcatgggt gtggcaagtg cctcccagcc ttgggtggta 15 0 60 

gggccaccta aagatagctg gcacattgcc ttgagttttc tttcattctt gcttgcttgc 15120 

ttgcttttat ttttgagatg gagtttcact cttgttgccc aggctggagt gcaatggtgc 15180 

tttcttggct cactgcaacc tctgcctccc gggttcaagc ggttctcctg cctcagcctc 15240 

ccaagtagct gggattacag gcacctgcca ccacctctgg ctaatttttt gtatttttag 153 00 

tagagacagg gtttcactgt gttggccagg ctgatctcga actcctgacc tcaggtgatc 153 60 

cacctacctt ggcctcccaa agtgatggga ttacaggcat gagccaccac acccagctga 1542 0 

gttttctttc aaacactcaa atactaacag gtgcataaac agaacaagta caaggctatg 15480 

gaaggttgcc aagatggtga aaaactgcat cacactgatc aaccaaggca gatctgaaga 15540 

atcagatttt gcttacagtg caggttgtgc actatgtaaa tgtttccccc atcctattcc 15600 

cccctagagt tctgcagtgc acagcctgtg cagctgtcct gagcagtcct cagaggtcca 15660 

agcttcatat acctttctag ttgtaaagcc ctatatccat taggagtctt agtaatccca 1572 0 

cttgtggaaa tgtttatccg taaagatggt tatttttcta ataatgaaaa atgtaaagca 15780 

aactaaatat tagaatataa gggaatggtt aaataactgt taagattcat caaaatcatt 15 840 

ctaccatttg aagttttgtt tataattgat gaaataaggt ggtaaaatag ttgtgatcta 15900 

agttaagaaa ggtgaatatt aatatgaaca taattgtgca aattatgttc ttctggtata 15960 

gaaaaatacc agaagaaaat atatcaaatg tgaacagtgg tctctgagtg gttggattgt 16020 
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atatgatctt tccccatcct tctatttttt aagtatactg tcatgagcat acgttatttt 16080 

ttagtgaata acaacaaaaa atatttttta atatcactat gggctttcac cttgctgtga 1614 0 

tacattttca attttcctgg actagctctt ctttgatatt ttttatctta tttctcaaat 16200 

tagtttttag tgggccagtt ttggtgattt atcaaagata aattaactac agagatagtt 16260 

gcagataaat cttattgaac ttaactacat gcagttcttc catggtacag accccttgaa 1632 0 

actctttttc ttgcagcttg ccaatgcaac ggccacagta aatgcatcaa tcagagcatc 163 8 0 

tgtgagaagt gtgagaacct gaccacaggc aagcactgcg agacctgcat atctggcttc 1644 0 

tacggtgatc ccaccaatgg agggaaatgt cagcgtaagt caaattggtc aggtttactc 1650 0 

atggcaaatc ggtgtggaac acagcacttc tatttgactt gaatatctag gaggaaaaaa 16560 

gccactttgt tttggatacc acatttctta ttaaatcata ctggtcagaa gctcagctga 1662 0 

gctggaagaa agaaccaagg tttcagtgta gctggttaaa gcaacaagcc aaaaatgtta 1668 0 

gagtcaagcc tttgaccaca ttctgcagtg gtataagcaa gggcccaaaa ccagagaaag 1674 0 

tgccaacttc ccctgtttcc tttcctccca ttgaacaggg cacaggtcaa tagtcagagg 168 0 0 

gatcagctca gacatgggag tcttggggct catctcactg aggcccggac aaaaggttcc 16860 

agcagtatat ggacactaag gttgtgggaa gggcaaggtg taagggctag gactgggaaa 1692 0 

gtcctgggtt ggagtcagag ttaggaaagt gtcttctgaa ttccccctcc tccccccatc 1698 0 

aggaggcctt ggtgcctgaa gaagacctca gcccaaggcc tcagataggg agcctgggaa 17040 

taaaggctcc agggctagga atgtgaacat gtgaaggagc gtgacccgtg aggtagacct 17100 

gagtccttaa ctgcacctct ctcagagccc cacacttagg cccacctcac aggtcatgct 17160 

ctgccaggta aagcctgtgg cattgcctgt ggtcaggccc tgaaaaatca cacacgggct 17220 

ttttaaccag gagccagact cctcaatact tatagtgtat tttaaaggtt taaatagtcc 172 80 

cgtgttggct gggtgtagtg gctcacgcct gtaatcccag cactttggga gggccaaggc 17340 

aggtggatca cttgaggtta aggagtttga gaccagcctg accaacatgg tgaaacccca 174 0 0 

tctctactaa aaatagaaaa attagctggg catggtggcg ggggcctgta atccaagcca 17460 

ctcaggaggc tgaggcatga gaattgcttg aacctgggag gcggaggttg cagtgagcta 17520 

agatcacgcc actgcactcc agcctgggca acagagtgag actcagtctc aaaaaaaaaa 1758 0 

aaaaagaaaa aaaaaaaagt cctgtgtttc cccatttatg tcaaatagaa gcctgcaagc 1764 0 

aataggacat tttattatga gaacaaaaat tatgacaagt gatattaatg aactgctttt 177 0 0 

tttattctag tttccagcca aaaaacataa tgagtactat agtcgaaaga cttttcaaag 17760 

ttctgagcag gaaaagtaaa caacataggg aaatctctac ctactcccca gatcccaact 1782 0 

ccaaactctt tggcatggcc cttcagaatc tgatctgatc caagaccagc caggagagag 1788 0 

gcagcgttct agctgtgtgt cccttactct ccctggggtt cagtttcctc ctaataatag 17940 

ggtggttgtt aagatgaaag cagagcagca catgacacat gttccacagg atactagtca 1800 0 

atacttagta gatgtgggca attcttatta cccttctaat gccattctcc actactccta 18060. 

agtaaaagtt cttgctcaga ttaaaaggta tttggtgaaa atgttttccc cactctttgt 1812 0 

gaatagactt aatccgttag acagtaacca gagtacctaa cagagtggtg gagtgtacaa 1818 0 

ggcattgtgc tgttacccac tcaagacaca ggtgctttta ttatccccaa gatcaagtaa 1824 0 

actgcccagg gtctcacagt gagatgtggc caagctaaga accacgccca gtctgtgctg 18300 

accttaatta cttggctggt tgtgcttcca gatgccatca tacccacatc agcagctgct 18360 

atctagaagc atcacatttt cctctgtgag atctacaggg cctgagtaca atttgcctta 18420 

tttttcagag tcctaatcca cggtagaaag cgtggtgttt atagaatact gcagatggca 1848 0 

ccaagtcttt gatgttttct tcttaaaatt gtatagtcct taggtaacga tagtagccat 1854 0 

gatttcttga atgcttactg tgtttgagac atcatataca tatcatcaca aacaccccta 18 60 0 

tgagataggt actgttgtta tccccatcat atggacaagg aagctgaaac tcagaaaggt 1866 0 

taagtagcat tcccagagtg acagaagagc caagtagagg acgctgaatt caaacccagg 1872 0 

cagatggacg taaggccttt gcttttcact atacattaca caccttcccc cagtctcaac 1878 0 

agagccaagt gtcagacact catgattctg acttcagcat ccattctgtt aatctctgca 1884 0 

ctattctgtt ttatagtttt ctttgggtga ttccaaagga tttattttta aagcatacct 18 900 

ctctccagct gaatgccttt catttattca ccagcaaaac agggtataaa gtgaaaaggt 18 960 

gttgaccaaa aaggctttca cttttttcaa ctggagtata atatttatca cggcttgtat 1902 0 

tacgttggat gataaaagga gagatgttga gttggccaga tgagggagat gggaagagct 19080 

attttctgta aaggcaaagg aaaaggtgaa gcaagggtgg catgtcctga gggcccctaa 1914 0 

agcgctgacc ttggcattgg atcctggttc tgagccctgc ggtgattcca aatgtgacca 192 00 

tcgtggagaa ggcacctcag ccaagaccag ccctccttca gaggacagtc acctccagag 19260 

cacctcccta ggggcagcag ctatgtctgt gtccccacac ccagcacctg gcattctaga 193 2 0 
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gatgcttcac aaatatttat tgaatgaatc aaagaatcat agcagtgaaa aagagagtct 19380 

attgaaaaga tcaaaatagt cattgcttca gaaggcagtg ggtaccagca ttcactttcc 1944 0 

tcctgtcctt tgtttggttc ctttaatgtt acctatacat tattatcatt agctgaaaaa 19500 

tcatgaatct tacccattga atgctcgtac tttaatctgt ctttcctact gaactcaatt 19560 

cagataaaat tgcttgtttt ggaaaaagtc tccaatagtc agaatttttt aagccagttg 1962 0 

tgacctctgt tcctttttct tctctgcagc atgcaagtgc aatgggcacg cgtctctgtg 19680 

caacaccaac acgggcaagt gcttctgcac caccaagggc gtcaaggggg acgagtgcca 19740 

gctgtgagta ccatactccc tggaccacca gggaggacca agaggctgtg cagctgcctg 198 0 0 

aaccccaccc tgagagccac ccacttccct gtgtcttgtt gctgtgggct ctgagggatc 19860 

cctgggttga ttagtttgaa attttgccca ttctatttca gacaggtcag tccccaaaat 1992 0 

gaggaggtcg tcgagttagc agcaattcct taatggctct tgaattcaca tttttgttta 19980 

aatgatactg acatttcctg ggttgtccat ttggagtagt cattttaact tcagcaacta 2 004 0 

cttgattttt gtcatgtcaa gagattatac tctttcccaa agagtagaga tggaagagct 20100 

ggtgttttgg tgatggcatt tatttggcgt ttggttgtct gattgtggaa atgatcctct 20160 

tacctttaac atttcccatt actctagcat tttccttgtt gaagcatgtc aggcatcttc 2 022 0 

ctggagaggg tttgaagtcc ccatccatga ggatgcaggt gcagcagcat caggcatgtt 2 0280 

taaggtttca gacctggccc tgcctccact gagcaaggtg ctcttgaggc aaatcactgt 2 034 0 

tccccccatc tgtcagtagt gggactagca gtaattgttt ttggtgtcct ggtaggttga 2 040 0 

aacaggaagg atagttcctc aatcaatgta taccgggctg tctcactggg aaacctcata 2 0460 

aaatgcatgt atccgcattt actcattctc aagaaaaaac ttaaaaagtg tttagtgtca 2 052 0 

aaacactgag ctgattaatc attgtaaaca tcattatttc ttaaataaca gtagtaataa 2 058 0 

tgccaccagt gattgagaca tcattgctgg tgctactcag cagggtttgg tctgtctact 2 064 0 

ttttagttta tatctgcata tatgtttact agcttcatgt tgccctaaag ccaatatgtg 2 070 0 

aagaaaagtc taatccaagg ttaaatttaa cttttaggtc cacattccct taaacagaaa 2 076 0 

aatgtcatcg agcaaaatta atcactctta ccctcagtta gctgatgcaa caaagacatg 2 082 0 

aggtggcatc aagtaagact ttcaagttcc ttcgagatcg cagatgtagg caggtgctgc 2 0880 

tcatcaccct ggctaaaggg acacagcata cctgcctggg aaaggccatt acccacttcc 2 094 0 

cgccctttcc tcttcatgca tagctgttgt gtttacctaa atgcttcatg aagtgggagg 21000 

ctgggttttg ctgatgttta tagatgatct gtatggggaa attgttttct gagtagaaca 21060 

tatttttttt ctacagactg gtgaacactt gttccagggt ttaaaaaaac agtgattctt 2112 0 

ggtatttagt cttctctcac ttgtgtttaa agaagagaat tagtttatcc agggaacata 21180 

ggaaagaagt gagaaattag tatttgaaaa atgttttgac tgcactttta gaagaatatg 2124 0 

tagtccacac aaaattgaaa atgtttttca ttatagtaaa taggtttatt acatttggat 213 0 0 

ttctctcatt gctttggcaa atgcttgtca attgtcctca tatagttggt gtgtgaactg 21360 

ctgagcagtc agtattgaag cgttcatgca tgctcttcct agactgcctg agctgagtta 2142 0 

tggtgaagga tgcaattata atggctccag actatctgta ctttatataa aaggatctgt 2148 0 

ttggttttaa aatgaattca gtttctgttt taaatagcag tataaaatag tcttttttcc 21540 

cccccagatg tgaggtagaa aatcgatacc aaggaaaccc tctcagagga acatgttatt 21600 

gtaagtggtt ttgcaattct tatttctaga agcaaagtag ttcagtaaaa cttcattgtt 21660 

taaacgggtt tgagaatagt aagtgctata agactatagc agccaccaat gaagtgttcc 21720 

cagacttgat atgtttacat tctgttaagt ttactacata taggagcact gttttaagct 21780 

gttttaattg tgtttggggt taacgttaat gtgtccatag cagatagcag gagagagtag 21840 

a 9 a 9gcatgc atctttgtct atccacattt atgttctctt aaaactttac tttatttgcc 21900 

attacctagt tggggtcatc atatttcgtg ttttaggatg tagatcaaaa acagaattct 21960 

tacaatatgg ttgaactttt tgtcattctg tctggaactg atttgtgtta accaccttga 22020 

ggtgaaaggt gagtctgaca aggtgaccat gtttttatgg gtaaatgtgt tttctcttta 2 2 080 

tgggagaatc cacatggtag accagagtac gggaccagaa aaaaggagtt aatgttatgg 2214 0 

catatccatt gtaattatat acctgctgta ctgggttgaa taacatccct tcaaaattca 22200 

tgtccacttg gaacctgtga atatgacttc atatatatat atataaaact tctaaaagaa 22260 

aacaagagaa aaacatcttg accttgggtg gggaaaagtt gttagatagg attctaaaag 22320 

cataattcat aaataaaaag tagactaatg aaactatata aaaatttaaa acttgtagga 223 80 

aaaaaccatc caaaacataa aaagccacag actagagaaa atactcgcac atcatcatat 2244 0 

atttgataag gagtttatat ccagaataca tgaggaactc ttacaactga ataagatgac 22500 

aattccagtt aaaaacaggc aaagatttga aaagacattt catgaaagaa gatattacag 22560 

tggccaataa gcacattaaa aacgacttaa catcattcgt tatttggaat ggagatttta 22620 
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sfaaccacagt 
tagaagacag 
tggttaaaca 
gatacccttt 
tggacaactg 
cccacagact 
ttacttaatt 
tttagttttt 
tcacctttag 
ctcctgacga 
ttttcttggt 
cgagatgatc 
ataaaaggca 
attccacgca 
caccatcaga 
aaagacactg 
ccccaatcag 
tcctagagtg 
ttaagaagtt 
tttatctgct 
aggaaggcca 
tacagtttct 
ttggtaaaca 
acatacacaa 
catggaaaac 
aaacattgct 
tacaagtaaa 
aaaatgtact 
attactttaa 
ctgtattgac 
tatatgaaat 
atatacttca 
tcttaaccag 
tcttacgaca 
cacaagcatt 
aatacaggtt 
atgaaaatac 
tgcaagggtt 
gggtcctgga 
tgccagcaaa 
gaggtcttat 
tcacatttta 
atgatcaaaa 
ataatttcag 
caaaaataat 
tattaatgtt 
attcctcagc 
ttagccctgt 
atactcaatc 
catgaatata 
aaaggctgtt 
atatgggttt 
aggagtgaac 
ccagcacagg 
tttctcatgc 



cagatacagc 
tggtaatctc 
catgaaaaaa 
taatagcctt 
gtcatttctt 
tgattttctt 
tataagaaaa 
aaatatgcat 
tctatcccag 
agtaagattt 
cattacggat 
tctagaggta 
aaagtgatgt 
tacttactct 
gggcagatag 
ctgattgaaa 
acagcaaggg 
agttcttagc 
agccaggaat 
gtcattaacg 
cactgcggga 
tcaatgaaat 
taatggtaaa 
agccatttac 
aatatggcag 
aaatttaatt 
tgaaaatagg 
agaaaagata 
atgaattatt 
tggaattttt 
gtattcactg 
tgccattaat 
gtgggaagca 
gaatacttac 
ttaagtttat 
aaaactgtga 
agtatttgca 
ctatgggcag 
accgatgacc 
tactaaatac 
cagtaaatga 
aaaatagctg 
caacaccctt 
tacagtcgga 
tgtttgaccc 
ataatttacc 
aatatcttta 
gaactgctat 
atatttctct 
ataaaattat 
tccagttttg 
cagttataaa 
ttcagacatt 
tcaggagatg 
agttaggcat 



cctaaagtga 
atgtctgctt 
acctggcctt 
ttcagataat 
taaagcggtt 
tgaggaacct 
gcagcctgta 
ttctttcttt 
gaagatgat c 
tttaaagtct 
ggacgtactg 
t aggaaagaa 
taataagatt 
gcctgtcagc 
cagagcctgg 
tgccagtgat 
aggggacccc 
ctcattagga 
aatttaaatt 
atgaagccca 
catttaaaac 
ataaagtcat 
ttatttagga 
cattacaaaa 
taagacaaag 
ttaaaagtag 
atttgttttt 
cattctcaaa 
ttatggaaaa 
ttcttaacga 
cctcataagc 
gtcattaaaa 
accatgaaag 
tagagttatc 
tgattcttetc 
ggatccactt 
agatgtgaaa 
actgcggact 
cacgtatacc 
aaataagtaa 
gaaggtaaat 
tttggaaatg 
aaaaatctta 
tgcatctgta 
cagttaagag 
taacatcacc 
ttagtcatta 
atctctaagt 
tgaaataata 
tttttaaatt 
gagaagtttt 
taaatataat 
tgaaaacagc 
caaatcactg 
ttgataaagc 



ttagaatcca 
ctgcattcag 
accaggcatg 
tatggacgtt 
tgatgggtta 
gggtacagct 
tctctaaaga 
ccatagatac 
gctattacac 
tcctattttg 
ccttaacagt 
atgttagact 
taccaggatc 
ccatcatggc 
gtttactctc 
gcagccccaa 
aatcagacag 
ggcaaaacag 
gttaaaatat 
acctgcchta 
tgaaacatac 
gcagtaccca 
acttccttgg 
ttccattaac 
tatattgctt 
aacaaagtct 
aatattctat 
agtaaattgc 
actataggta 
ttttagaata 
aagcgtttga 
aaataaatat 
tatttggacc 
caaatgcata 
tgtggaaaga 
acacgtagac 
cctgagtata 
ggagtatgtg 
aaaggatgac 
acacttgagc 
gctttgtgag 
tttccattgt 
ttctaacctc 
aaagataaaa 
atgtactagt 
aacagtacac 
agaccaagga 
ggtagacaag 
tttttagtaa 
aagtacattt 
acactgtata 
tttgggcact 
tgcccagaga 
tgctctgata 
attaaattat 



acgctaatgc 
tctgttgcag 
cagttggaaa 
acttgatatt 
acccaaataa 
gagttaaagt 
cctgttctat 
tcttcttatt 
agctatcaat 
ttttgaattt 
gctctccaga 
ctacgttatc 
ttagacacag 
ctcatacaga 
tcaagagtga 
tcagacagca 
caagggaggg 
caaaggctta 
gtatgtaaaa 
aggtattacc 
agaacacgaa 
cttcagtatt 
agatttctta 
aataaatgtg 
tgttaacaaa 
atagtgtggt 
ttcaaagata 
tttttaagta 
ttaatatatc 
agattttaac 
agtgctcaaa 
agtagaatct 
tttctggatt 
tgttctgtcc 
catatagttg 
ttttttcaac 
cagagggtgg 
tggatttggg 
tgtaattgtt 
tgtcctttca 
agaaacttct 
tgtgattttg 
tcaagatctt 
atataacatt 
caaatttcag 
ttcctccact 
tcaaaataaa 
gt.tttcaaaa 
gagaaaaata 
cagctctata 
taatctatga 
tgatcaagtg 
atccttagga 
ggggtctttt 
gtatacttta 



catggctttt 
tacatctttt 
agggtatagt 
atgctgaaac 
aatgatactg 
gctttcctcg 
tttgtgtgtg 
gactatcagt 
tttgtggcta 
gtatggatct 
ctggagtaca 
tcctttccac 
actgacattg 
aaggggactc 
ccagaggctt 
agggagggga 
gacactctgc 
gttgggtcca 
tgtggatttt 
tagtggtaga 
gatgcacctg 
taaagaaatt 
cttctcatga 
acaaataata 
tgttatctat 
atgtttacta 
aaattaagaa 
aaaataataa 
atttgagtgg 
aaaagtacca 
atctttcagg 
ttgtaatact 
tcccagttta 
tctataaaag 
acccttgagc 
caagcgtgga 
acttttcata 
catgctcagg 
agttgtgtgc 
agatgaaggt 
ggtttaatga 
tgctattaaa 
ttaaaaatga 
gattagtttg 
tttgactaat 
ggcttaacag 
ttaagttaga 
ctaagaagcc 
tttttcaaat 
tgcctttttt 
gtttagaatt 
ttttattggt 

gggtgattcc 

taaaaggcac 
atgggaggga 



22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
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a€aaaatttt attggcatat attgcttgta ttaaggaaaa ttggttagaa attttgctaa 25 98 0 

ctcttctgtg agtttctcta aagacatcat aaaatgtttg tgattaagaa catttagagg 26040 

agtaaacttt attgctttat tttaaaatct agaaattgtt ttaattaatt ttctaataat 26100 

ttgtaccgct catcagcaaa acagggattt ggacatgttc atcaatgcct ccaagaattt 26160 

caacctcaac atcacctggg ctgccagttt ctcaggtaaa gacataccta gagaagaccc 2 622 0 

tgcaaatgaa ggtgtggtag attaagaaat gtaatatagg aattgagaaa gcgagctcag 2 6280 

gagacagatt ggtttgaaac ccacccttgc cacttactag ctatgagacc ttgagcaagt 2 634 0 

atctaaatcc ctctctaaac cttagcatta ttttattcat ctgtaaagtg aggataatga 26400 

tacttacctc ttagaattgt tgtatagatt aaattaggtt atacctacca gagcttgctg 2 64 60 

tggtgtcagg ctcagtgtgt ggttactacc ctagcccacc accaccattt ctgttcttgc 2 652 0 

tgtggccact ggcactacca tcattgtcta catccgtgct tcggaagtga aaaattcaaa 2 658 0 

tgattcgttt caataaatga aaacatttta aataaaatga gattttagta ggtacagaga 2 664 0 

aatgtaactt gggaattaca tcaagctcta aaagcacagc tcttgctgtc tgccttactg 2 67 00 

tgattcactg aagatctact gtatagaaaa tctaaagaaa taaaggatga aggccaagtg 2 67 60 

ctgtggctca tgcctttaat ctcaacactt tgggagactg aggcaggagg attgcttgag 2 682 0 

ctcaggagtt taagaccagc ctgggcaaca tagtgagacc ctgtctacaa aaaaaaaaaa 268 80 

aaaaaaaccc aggtgtagtg actcatacct gtaatgccaa ctactcagga ggctgagatg 26940 

agaggatcac ttgagcccag gggttggaga ttgcactgag ccatgatcgc accattgtac 27000 

tccagtgtgg gcaaacagag caagatccca tctctttaaa aaaaaaaaaa aaaaaagaaa 27060 

aaaggataat cactacttaa cttgataact caacaagtag atatgggttt gaaatttgtc 2712 0 

cattaaattt acttgcaccg tgctgttagg caagttactt aaggtttctg agccagtttc 2718 0 

ctcctgtata aagtaggata gtaaaaacac cgtcctggca gggcgtgata gctcatgcct 27240 

gtaatcccag cactgtggga agccaaggtg ggaagatcac ctgaggtcaa gagttttgag 27 3 00 

accagcctgg ccaacatggt gaaaccctgt ctctactaaa aatacaaaaa tcagccaggc 27 3 60 

gtggaggcac gtgcctgtaa tcccagctac tcaggaggct gaggtaggag aatcgcttga 2742 0 

accttgaagg tagaggttgc tgtgagccga gatcacgcca ctgcactcca gcttgggtga 2 7 480 

cagagtgaaa ctccatctca aaaaaaaaaa ataaaaaaac accttcccaa gtagagtgat 2 7 540 

gtgagaatta aatgagataa taaatgaagt actcaatata gtgcttgaaa tgtggtaaat 27 600 
ggtaactata ttttatcatt attactatta caatactggg tttttaaaaa tcaaaaacac . 27660 

aaagcaatga gattgatgca aaataagaat attgccttgt gcacgccact tacgtttatc 2 7 720 

atcttaaaac attgtgtaga atttgagaaa agttcagaaa ctctcaatga ggagggactt 27780 

ttaagaaaaa gtctgaatta tcagagtatt tggagaaagg caacatctcc aggcatgtga 2 7 840 

aagatttgca atgagccggg cggtggctca tgcctgtaat cctagcactt tgggaggctg 2 7 900 

gggcaggtgg attacctgag gtcaggagtt caagaccagc ctgaccaaca tggtgaaatc 27960 

ccgtccctac taaaaataca aaactagctg ggtgtggtgg ggcgtgcctg taatcccagc 2 8 020 

tacttgggag gctgaggcag gagaattgct tgaacctggg aggcggaggt tgcagtgacc 2 8 080 

caagattgca tcattgcact ccagcctggg caacaaaagc gaaattccat ctcaaaaaaa 28140 

aaaaaaaaaa aaaaaagatt tgccatgagt gtctcaatga agacgtgata atgtgggctc 282 00 

tagtcacagg gtctaactca gacatggaaa aaagtccatt tcattaatct ttatcggcac 28260 

ttgaattcct ggctaaggga gaatgtggaa cattgaagga ctctctggga ataggatgga 28320 

gttataccag attaggggga cttaaatact gtggtagctg gtggtagaag ggaggactga 2 8380 

gtgacccctt gaacccctcc tccctgctac agtgggttag gcagtgagcg gtacatcagc 2 844 0 

attactggca tgggagtctg gcgcattgcc aaggaggtgt aaaggggaaa tgcaaaggaa 28500 

ttgaagtggt gtgggcaaag tgaatgccag tgcttgttaa taggattcta gtggtatctg 28560 

tattttcatg atcatgtgtg tcacctgttt gggggtgggg caagggtgga agggagttac 28 620 

atggattcct ggtaaaacca tttt'tctttc tttctttttt tttttttgag acggaatttc 28680 

gctcttattg cctaggctgg agtgcaatgg cgcaatcttg gctcactgca acctctgcct 2874 0 

cccaggttca agcgattctc ctgcctcagc ctgctgagta gctgggatta caggcatgcg 28800 

ccaccacgcc tggctaattt tgtattttta gtagagacgg ggtttctcca tgtttgtcag 28860 

gctggtctca aactccgacc tcaggtgatc cacccgcctc ggtctcccaa agtgctggga 28 92 0 

ttacagatgt gagccaccgc acccggccag taaaaccatt ttggttaggg gcataggctt 28980 

gtatatcagc ctgcccagct ttaaatctta tctccatttc ttgttggctg tgtgacttga 29040 

gggaagttac ttaatttctg tgaacctcaa tttcccagtc tataaatgaa gataataata 2910 0 

gggcagctgt gaggattaaa tgagattgag ctcttaaagg ctattactgg aacacaggaa 29160 

atgtttgata aatgctattg tccttattat taatgaggcc agattctgtt ctccccctag 2922 0 
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c'cccccaaaa aatgtctctt ctctttcatg tttttctttt aacagctgga acccaggctg 29280 

gagaagagat gcctgttgtt tcaaaaacca acattaagga gtacaaagat agtttctcta 29340 

atgagaagtt tgattttcgc aaccacccaa atatcacttt ctttgtttat gtcagtaatt 29400 

tcacctggcc catcaaaatt caggtaagaa gaggcttttg gtctcatacc tgcaaaggtg 29460 

gtgaaatctc tttagtaaga ctaaatttac taatttggag cttgtggtaa atgagatgtg 2 952 0 

caatgtggct ttgcctttgt aacgtgtatg gcagaggagt gctgagcaca tgcatgctgc 2 958 0 

acagaagatt ggagtgggga tggactgtat cactcatgaa agacatttgc aaaagcactg 2 9640 

ttgaaagcaa gttggcatgt aacagatttg gtcattataa acgtattcac ttcttcagtg 29700 

agcatttgcc atgtgaaagc ctgtagggct acacaaagaa cttcaattct agagtaaggt 29760 

ggatgtaagt gaaacaaacc cacatataac aactgaaagc cagagtgtgg aaagtgacat 2 982 0 

gagatgatcc cagaaatgct atcaaagttt aaaggaggac aaatggggag actatgttga 2 988 0 

agaacatcag ccttctagtc agacagaggt ggattgattc ctggctccta tacaaatcat 29940 

acagcctttc caagtctcca gttctctgtg catgtgacct gaagggtagt tgtaaggggc 3 0000 

tgtgcagctg ctgcagtgtc ttgttagcct gctcctttcc tctgttccca gggggccagt 3 0 060 

gtactccctc ttgtccgaga cccatggccc cattttaact ttttatactc atgtcccctg 30120 

gggcctttcc tcaatacctt ctgcttctta ccttcttcat ttaggtgaat gtggaggtta 3 0180 

gggataggtg ggctttcaag gactggttca cctttaacca tggaagcatg gtcactggac 3 0240 

ggaggctgtt gctgtttgcc aatgttcaga agcataatca acctcagaag caagtcacca 3 0300 

caaacatatg aaaaaagttc aacatcactg atcattagag aaatgcaaat cagaaccaca 30360 

gtgagatacc atctcacacc agtcagaatg tctgttatta aaaagtcaaa aaagaacaga 3 0420 

tgctggcaag gctgtggaga aacaggaacg cttttacact gttggtggga gtgtaaatta 3 0480 

gttcaaccat tgtggaagac agtctggcaa ttcctcaaag acctagaggc agaaatacca 3 0540 

tttgacccag ccatcccatt actgggtgta tatccaaaag aatataaatc attctgtaac 3 0600 

aaagatacat gcacatgtat gttcattgca gcactattca cagtagcaaa gacatagaat 3 0660 

caacctaaat gcccatcagt gatagactgg ataaagaaaa tgtggtacat atacaccatg 3 0720 

gaatactatg cagccataaa aaggaatgag atcatgtcct ttgcagggac atggatggag 3 0 780 

ctggaggcca ttatcctcag caaactaatg caggaacgga aaaccaggta ccacatgttc 3 0840 

tcactcgtaa gtgggagccg agaacacatg gacacatggt ggggaacaac acacactggg 30900 

gcctattgga gagtgatggg gaggaaggag agcatcagga agaatagcta gtggatgctg 3 0 960 

ggcttaatac ctaggtgatg agatgatgtg tgtagcaaac cactgtggca cacgtttacc 3102 0 

tatgtaacaa acctatacat cctgcacatg tacccctgaa cttaaaaatt aacaataaca 31080 

aaaaaagcaa gttatcactc atcaattagg atgccttggg actgtgacta aagaacagtt 31140 

gatctttcca ttctggaaat atggaggaca aaaactatgt tgtagttttt ctcaccaccc 312 00 

tctctccttt ttcctgtctc catggtctaa gatttgttaa tcctctcatc aggttctccc 31260 

tttgccctcg catactccct gtgtccctcc tcagccagtc ttgtaagcat cagccacgct 3132 0 

tcttattcct gttttctctt gtctagtcac acatctgctc acaatggtct ggcctcttcc 31380 

ctcaccacat cctgaaactg tcctgtcata gtgggcccca gtgatggaat ttttcagtcc 31440 

cctttttatc tcattgcatg gctttgaatc atcatctttc ttgtcttcct ggatttttct 31500 

tttcctcgtt ttctggccag tctttttagg gaatcctctt cctcttcctt aaacactggg 31560 

cttgtctagg attcagacct ggtccttttc tctcttcact ctgtttcctg catgtgggac 31620 

ttgtggtgcc acatctattc tggtgattct cagtgctgtc tccagaccgg cactggcagt 3168 0 

tgctgcctgg acagccctac ctgggtagtc aaagctcccc gcgttcggtg catattcctt 31740 

cttgtgatct tagccccaga cttccatgct tcctttctct gtaaacagca ccaccatcac 318 0 0 

cctgttgtac aagctagggc ctgatgatca tttgcattcc tgggataaac ccaacttgag 31860 

catgacgtat tatttcctta ctttacacgt tgttggattc agtttgctaa tatttaggtt 31920 

agaattttgt atctatttca tgagtaagaa tgtctgataa ttttcccttc ctgtccttgt 3198 0 

tatggttttg atatcatgtt aaactaactt ttaataatta taatttgagg agttttcact 3 2 040 

ctttctcatc tctggagatg taagattaga gttaactgaa cctcaagtac ttggtagctc 3 210 0 

tctcctgtaa aatcatttga tcctagtgtc ttatttgtgg agatactttt tagctgctga 32160 

ttcagcttct ttaatagtta taggatattt tgaatttcct ctttatttga ttcacttttt 32220 

aatatttttt ctaggttttt atatcatgtt tttaaatgta tttttaattt tactttttaa 32280 

attgatacat aattgtacgt atttatgagg tacatgagat attttgatac atgcatacaa 32340 

tgtataatga tcaaataaga gtaattagga tatccatcac ctcaaacatt tatcatttct 32400 

ttgtgttgag tacatttcac atcttctagc tattttgaaa taatgaataa gttattgcta 32460 

actatagtca taatgactat tgtgccattg aacactagaa cttattcctt gtaactctat 32520 
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tt'ttataccc 
actctacctc 
tctttctgtg 
gcaaaggaca 
aacattttat 
cttgtgaata 
ttctttcttt 
tgagcagtct 
cagtgaacta 
ataatagcca 
ctgatggtta 
ttggacaaat 
gttgtttgaa 
atattttctc 
ctttttggct 
atcttacaca 
tctagtagtt 
tttgtatgtg 
tcccagcacc 
attgaaaacc 
ctggtctgtg 
gcatattttg 
tttggttggc 
tggtggatca 
ctctaataaa 
ttgggaggct 
gattacgcca 
agaaaaaaaa 
tttaggattg 
ctgtaggtag 
tgtggtgtct 
ttttccttgt 
gaagttttta 
tcgttgctcc 
ttactgagtt 
atatcaactg 
ctttcttttg 
atgctcgcag 
tcacagtaat 
aaatgtggta 
ataaaatgtt 
aaccttgaaa 
ccacttaaca 
aggttaccag 
agtttccctt 
attgtgaatg 
gttatatctt 
atagacaagc 
atttagaaaa 
ttaggtacag 
taggctgatc 
gatttgattt 
tcttgactcc 
ggtgcacacc 
agggtctcac 



attaacctct 
catgagatca 
cctggcttgt 
ggattccatt 
ttgtccattc 
gtgctgtgat 
tggatatata 
tcatactgtc 
gtatcgtctt 
ttttaactgg 
ctgatgttga 
ctctattcag 
ttgctggtat 
tcattctata 
tgatataatc 
aaacatcttt 
ttatggtttt 
gtgagagaga 
atttattgaa 
agttggccgt 
tatctgtttt 
aagtaaggtg 
tgagtacagc 
cctgaggtca 
aatacaaaaa 
ggggcaggag 
ctacattcta 
aaaaaagaaa 
ttttttctat 
cttttggtag 
ctttcaattt 
aaagagggct 
aaaaatctat 
tatatagaaa 
tgtcagttct 
caaataggga 
cctagttgct 
cgtctgtgtt 
caaaatgtgg 
tgtatgtgtg 
attcagcatt 
acatgctaag 
tgaaatatct 
cagctagggg 
tggggtgatg 
taattattgc 
actacaataa 
ttattctgaa 
agaagaataa 
taatcaagac 
tataggtaga 
tgtttgtggt 
tgggctcaag 
gccacgctag 
tttgtcgccc 



atctccccag 
gtttttttag 
ttcacttaac 
ctttttgtga 
atttgttgat 
aaacatttag 
cccagcagcg 
ttccatagtg 
tctctgtatc 
gatgagatga 
gatttttttc 
atcatttgcc 
attctggtta 
agctgtctct 
ccatttgtcg 
gcccagacca 
aggtcttata 
ggggcctagt 
gaacctgtcc 
gaatatgtgg 
tatgctggta 
atatgatgcc 
agctcgtgcc 
ggagttcgag 
ttagccatgt 
aatctcttga 
cc ctgggcaa 
tgctttggct 
ttctgtgaag 
tatggttttt 
ttttgtgacc 
ttcacctcct 
cgagaatggg 
tgattctgat 
tagtttggtg 
cactttgact 
ctggtcaaat 
catgaaaccc 
aaacaaccca 
tgtgtgtgtg 
aaaaaggaat 
tgaaataagc 
agagtagtac 
gagcgggaaa 
aaaaagcttt 
cactgatttg 
aaaaagtctt 
atttatatga 
aatgggaaga 
attgtttcag 
tcccagaaat 
gtttttttga 
taaccctctc 
ttccagctga 
aggccaaagt 



ccccccagca 
ctcccacatc 
ataatgacct 
ctgaataata 
ggacacgtag 
agcatgcagt 
ggattgctgg 
gctgtactaa 
ctcgccagca 
tttctcatta 
atatgcctgt 
catttttaaa 
taaatccctt 
tcactctctt 
atttttgctt 
atgtcctgaa 
cttaagcctt 
gttatttttc 
tttcccccac 
atttatttct 
tcatgctgtt 
tccagttttg 
tataatccca 
accagcctgg 
gcagtggtgg 
acccgggagg 
acaaagcgag 
atttgaggtc 
aatgtcattg 
ttcacaatat 
tcttcaattt 
tggttagatt 
attgctttct 
ttttgtgtgt 
gagcttttag 
tcctcctttc 
atgttgataa 
tttgttgtat 
aatatctatc 
tgtgtgtgtg 
gaaattctga 
tagttgcaag 
acagattctt 
tagggaatta 
ggaagtaggt 
tacacttaaa 
taaaaatctc 
aagggcaaaa 
atcaacctac 
gtagaggggt 
agaaccacat 
gacagggtct 
acctcagcct 
attttttttt 
gtagtggtgc 



gctggtaacc 
tgagtgagaa 
ccagttccat 
tttcattgta 
gttgattcca 
atctctccag 
atcatgtggt 
ataatttaca 
tctgttattt 
tggttttgat 
tggccatttg 
tcaagttttt 
gttggatgga 
gtttcctttg 
ttgttgcctg 
ggattttccc 
taatccattt 
tgcacatgga 
tgaatattct 
gagttcttta 
gtgggtacta 
ttctttttgc 
gcactttggg 
ccaacatggc 
gtgcctgtaa 
aggaggctgc 
actctgtctc 
ttctgtggtt 
gtgtagagat 
taattatttc 
cttacatcag 
tattcctacg 
tgatttcttc 
tgattttgta 
ggttttctgt 
agtttggatg 
cttgttgatg 
tccaatgttg 
agtggatgaa 
tgtgtgtgtg 
tacatgcgac 
aggacaaata 
agagacagaa 
ctgcttcatg 
agtgatattg 
aatggttaaa 
agcaagatgt 
gttctagaat 
ccaatgttaa 
agacacacag 
aaatatgtcc 
cactatgatg 
cctgaataac 
tttttttttt 
agtcacagca 



accattctgc 
aacatatcta 
tcatgttgct 
taatatatac 
tgtcttcact 
tatactgatt 
ggatctatta 
ttcccaccag 
tgtcttttta 
ttgcatttgc 
tatgtcttct 
ttcctattga 
tagtttgaac 
ctttgtagag 
tgatttcgac 
aatattttct 
aaatttgatt 
tatccagttt 
tgactccatt 
ttttgttcca 
tagctttgta 
tcagaaatgc 
aggccgaggc 
aaaaccctgt 
tcccagctac 
agtgagccaa 
aaaaaaaaaa 
ccatacaaat 
tacattggat 
agtccacaaa 
tgttttatag 
gttgtttttg 
ttttgctagt 
tcctgcaact 
atataagatt 
ccctttattt 
ctgtcctcta 
atagcattat 
tggataaaca 
tgtgtgtata 
aacacaagtg 
ttgaatgatt 
agtggattgg 
gttattaaag 
gttgcacaaa 
gtgattttat 
ttttgtagat 
agctataaca 
ggcttactat 
atcaatggga 
aactgattag 
tccaggctgg 
tgggattaca 
tttttgagac 
cactgcagcc 



32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
34920 
34980 
35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
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Ccaacctccc 
gtgcatacca 
agatgagatc 
tgatcctccc 
gcccccagct 
ttttaacgaa 
ctcacatctt 
actataaaac 
agctttttag 
cacataaaac 
ctatggactg 
tataataact 
aagacatgaa 
atgatcggca 
ctttgggagg 
ccaacatggc 
atgcctatag 
cagaggttac 
ctgcctctca 
gataacacta 
ggagagagtg 
agtcactctg 
agcaattgca 
ctgtacaccc 
caaaactgga 
atacacacca 
acaacaactt 
ttatatactg 
acctatgcac 
ctgatctcta 
gagtagggaa 
gaattctccc 
ccgatccgaa 
tcctgacact 
tgattagggt 
tgtactgaca 
gctgacaaat 
acttttccca 
caatgccgag 
tggttttaga 
aattttaagt 
atcttttctg 
cttgtgaaaa 
caggtaagtt 
actgtgacta 
ctggttctct 
tgtcttcctg 
tttttcccca 
agcaactgtg 
gtctaaatag 
aagccgtttt 
tttttaaatg 
agattctcct 
ttaaaaattg 
taaagcaaag 



tggctcaagt 
ccatgcctgg 
tccctgtgtt 
gccttggtct 
gatttttgac 
tagtgctaga 
atataagaat 
ttttagaaaa 
actcgatact 
caaaagtttt 
ggagaaaata 
ctcaaaactc 
tagacattta 
tcaataacca 
ccaaggcagg 
aaaaccccgt 
tcccagctac 
agtgagctga 
aaataaaaat 
cacacttatc 
gggaaagatt 
gaaaatactt 
ttcctaggca 
gtgtttatag 
aacaagccaa 
tggcctaata 
tgataaatcc 
catgattgca 
atattgaata 
gaccctgttt 
gatacagagc 
tgctgtaggc 
aaaggttgag 
attaagatat 
tctagtaaat 
tacctctcca 
gattaatact 
gtctgtgcct 
tttttttcct 
gctttcattc 
ttcataatca 
gatcactaac 
taaacgatgg 
ctattgcaaa 
ggcccgtgtc 
ctgctttggg 
ggtaaataca 
gagacttagc 
caaattcagt 
tgcagtgtag 
gcctttgatt 
agattttaaa 
tttttaaatg 
agattctcct 
taactgagga 



gatcctcctg 
ctaattcttt 
gcctagtctg 
cccaaagtgc 
aaaggccctg 
gcagttggac 
taacgcaata 
aaatatagga 
aaaaacgtga 
tgctctgtga 
tttgcaaacc 
aacagtaaac 
accaaagagg 
ttagagaggc 
cagatcacca 
ctctactaaa 
tcaggatact 
gattgtgcta 
aaaataacca 
aaaatgacta 
ggatcactca 
tgtttcttaa 
tttatcctag 
cacttgaaag 
atattcctca 
cccagcaaca 
caagcaaatt 
tttatataac 
agtatcctga 
taaaggagct 
ggtgcccatc 
tgcatttgga 
ctctaggtcc 
agattatata 
aaaatttatt 
tattctcaat 
ttttagattc 
ataagcaggc 
cccagttccc 
tagagggagc 
aggagatgtt 
ttaaattaag 
catgcccacc 
cgtgtctttt 
accgtgtgtt 
gaccgtatct 
tcatgttctc 
ttttctaggt 
aaaaatggag 
gccattcagt 
ctcatttttt 
aattgagatt 
agattttaaa 
tttttaaatg 
aatctaattc 



cctcagcctc 
tttttttttt 
gtctcaaact 
taagattaca 
caaaggcaat 
aaccataggc 
ttgatcacag 
gaaaatcttt 
ttcataaaag 
gaaaccctgt 
atatttctgg 
aaacgaacaa 
atatacatat 
caggtgtggt 
cttgaggtca 
aatacaaaaa 
gaggcaggag 
ctgcactcca 
ttagagaaat 
aaattaaaaa 
tacattgttg 
aaaactaatg 
agaaatgaaa 
caatatttaa 
acaaatgaat 
aaaagaaatg 
ttgctgattg 
atttgaaaat 
agcttcaaat 
tatctgatta 
aagaaggtgg 
atagagcttg 
cagtcagtat 
tatttgttag 
tatttactca 
ctaattgaac 
ttagtgaaaa 
ttttgtgctg 
tattgctcca 
tcttccccac 
gattttcttc 
atccacttat 
atttagtttg 
tgctgtttgg 
ctcagtcctc 
ggctgtgcgc 
aacaagggct 
gtgtagcatg 
agacttattt 
taaatttcag 
aaattagatt 
ctcctttttt 
aattgagatt 
agattttaaa 
tctacttgga 



ccaagttgct 
tttttttttt 
ctcaaactcc 
gacatgagcc 
tcaatgaaga 
aactaaagga 
atctaaatgt 
gaatcccaaa 
gaaaacttga 
tatgaagatg 
gaaagcactt 
ttcagttaga 
ggcaaataat 
tcacacctga 
ggagttcaag 
ttagccaggc 
aatcgcttga 
gcctgggcaa 
gcaaattgaa 
tagtaataac 
gtgggaatat 
ttatacttat 
accagtgttc 
cagctttatt 
gcttaaataa 
aactgttgat 
aaaaagaaaa 
gaaaatttta 
ttctctgact 
ggtcaggcct 
aaatcttggg 
caggggtttc 
aatttcagtt 
gttattttca 
ttttaagttg 
atcgtgcttt 
tcttgactta 
gagatggcca 
ccctctgaaa 
acttacttct 
ctggttttct 
gaatgctgca 
ccaagctttc 
tacaaatgac 
gcatgagagc 
tatgtggagg 
gcatgtttct 
gctcttaaat 
aaggaagaca 
tatggctttc 
ttaaaaattg 
aaatgagatt 
ctcctttttt 
aattgagatt 
aatttgtgta 



gggactacag 
tttttggtag 
tgggctcaag 
gttgcgccca 
aaggatagcc 
atctctaaat 
aaaacataaa 
acgaaacaaa 
taaactggac 
aaaaggcaag 
gaatctagaa 
aaatgggcaa 
gacataaaag 
aatcccagca 
accagcctag 
atggtggcac 
acctgggagg 
cggaatgaga 
agcacaatga 
accaaatgtt 
aagatggtac 
catataaccc 
tcacaaatct 
tataatagcc 
atgatgctac 
atacagaaac 
tcttagaaat 
gatcttatat 
tcttaggact 
acccacacag 
ggcattctta 
ggtagtttct 
tggcaacatc 
tgtaaacact 
agattgtttt 
gttgcttata 
tgctgctgac 
tacttgttgc 
gtcactgaag 
aagaagtaat 
ggtgctcctt 
ttcaggagca 
cttgaaaacg 
tgcactattc 
ctggttgaca 
gctgagctcg 
ttattgctgt 
acacacattt 
gt agggaaat 
gatgcaggca 
agattctcct 
ttaaaaattg 
aaatgagatt 
ctcatttttt 
agcattgggg 



35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320- 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
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ctttaaaaat ggtaatacta aactcaagtt ccttcactct gaaccaatcc caggattctt 39180 

ttccataatt aggaacagat caaaacctat aactgctgag ggaatttgaa actgactata 3 9240 

aaaatctttg tagatgcagc taaaaaacaa acaaaaccca cagactgaaa agtcctgttt 3 9300 

ctgggagggt ggcgggtgtg ttggggccgg ttgtgcctgt tttttgattg gttgttcctg 39360 

tgggtgagca cccttccctc tgcccctgct ttctgaaagc tatcctagtt gttctcccca 3 9420 

gtaagccggg gctcagccgg cttcagctat gtccatagtt tgggggcgtg ggcggtaggt 3 948 0 

attcaagata gaagatggtt ggacagaaag tgtgtgaggt gggaaggtgg agtggtgagg 3 954 0 

ctctggctcg ttcccgactc tgcaacagag gttgttactg taatgtggag ctaatataga 3 9600 

agtctgcgtg ctgggtgttt catctactgt tttatgtaag aggccagtgt tcacaagtaa 3 966 0 

aggcaacaca gaattggtat tgagaactgg gcattcattt agattttaga gttctgttta 3972 0 

tgggtgctaa caggatcgct atactgagca agggatgtgt gaaacttttg ttcacttgga 3 978 0 

tgtgcttttt ttcttataac agctttattg agacatcatt tacataccat acagttcgct 39840 

catttaaagt acacaattca gtggttttca gtgtattcac agaagcgtgc agccacccgt 3 990 0 

cactacagcc aattttagaa tattttcacc acttccagaa gaagccccta gcagtcagtc 3 9960 

ctcattcccc cctcttccgg caacccccag tctgtatttc ctgtctctgt ggatttgcct 40020 

attctggaca ttttatataa attgagtcac atgatatgct ctcacttagc atactgtttt 4 0080 

caaggttcac ccgtgtttag tatgtttcag tacttattcc ttgtcatagc tgaataatct 4 0140 

tccttatatg tatataccac gttttgttta tccatttttc tatcgatgga tgcttgggtg 40200 

gttttcactc tttggctatt atgaatagtg tcagatatac ttttaatgtt tatatttatt 40260 

gtataataac agtgcatatc cgtacctcct ggtgttaatt gtctgtatct taagagtttt 40320 

aagaattata tggttgtgct gtaattctag cacttaggga ggccgaggcg ggcagatcgc 40380 

tttagctcag gagtttgaga ccaggctggg caacatagta aaaccctgtc tctactaaag 40440 

tacaaaaaat tagctgggcg tggtgaagtg ttcctgtagt cccagctact tgcaaggctg 40500 

aggcacaaga atcacttgaa cccaggaggc agatgttgca gcgagccaag attgcaccac 40560 

tgtactgcag cctgagcgac agagcaagac tcctgtctcc aaaaaaaaaa aaaatttttc 4062 0 

tttatgatga aacccacaag gctgtacaag aaccttaaat atagtttgaa acaaagaaag 4 0680 

acacatgctt ttaagtacag tttaattctt ttcttttctc ttatttcatt tttttttttt 40740 

ttggagacag agtctcactc tgtcgcccag gctggagtac agtggtgcaa tcttggctcg 4 0800 

ctgcaacatc tgcctcctgg gttcaagtga ttctcgtgcc tcagccttgc gagtagctgg 4 0860 

gactacagga acacttcacc acgcccagct aattttttgt attttagtag agacagggtt 4 092 0 

ttaccatgtt gcccccatca ttctcagcaa actatcgcaa ggacaaaaaa ctgaacaccg 4098 0 

cttgttctca ttcataggtg caaactgaac aatgaggacg catggacaca ggaaggggaa 41040 

catcacacac cagggcctgt tgtggggtgg ggggaagggg aagggatagc attaggggat 41100 

atacctaatg ttaaatgacg agttaatggg tgcagcacac caacatggca tatgtataca 41160 

tatgtaacaa acctgcatgt tgtgcacatg taccctaaaa cttaaagtat aattaaaaaa 4122 0 

aaaaaaagaa aaatgtgtta ttcctttgtc tcgaaagtcc cctgcttttc agcttttgct 41280 

ttggttgtct cttgtcagct ctttaatgct attgttccct tgggtttcat ctggactctt 41340 

tttccctggg caatctcatc cacaccatct gggaacattg cctttggctg gacaggtaaa 4140 0 

tctttgtctc tagcagactt ctctcctgtg atctccagag ccaggaaccc actagacttc 41460 

cttaaaattc attcagccag cagactttct ggcccagagc agcagctgtg agtggacagc 41520 

aggcacctcg ttagaagctt ggtgttggag gtgtgcccat gcattcctgg gagccatgag 41580 

a 999gagggg agtgccctgt gtcacctgag tccaagttaa gagagatcac accagagctg 41640 

attcctaagg agcatggggg cgggggaagg gggtgttcgt ggtggggaaa ggaaatgacc 41700 

agccctttca ggaactacca ggagtttagc agagctggtg acagtgctat gagcaagaag 41760 

gtggagagct ggggagggcc attatgaaga gactggagat tgcaggctgt gaacttgagt 4182 0 

cctgatgtga ctgggaattg ttgagatgtt ttaggccgaa aagtcaccga cgatcagtgt 4188 0 

ttagaaagac cccactggca gcagttggga gtgtggattg gaaggagcca ggcctaagtc 4194 0 

ctgggaggac ttaggaggtc caggagtcca gggagtccag gtggaattct tgtgtagaaa 42 000 

cccagggaag gggaccatct gatgacacac atcccatagc aacttccttt cctcctctgc 42 060 

acctccacag cagagtgtct gtttccacta tacgaagatt tcttaatggc aaggatcgca 4212 0 

tttttatccc tcagtgtcct cagtgtcatg tgctaatcag ttggagacac tccctaaatg 4218 0 

tatgccagat gagtatatag ttgttagtat aagggcttta cggtaggtat cagaactttt 42240 

tcttttttac aacttttata acttttataa aggagccgac acctttttct gatattgtca 42300 

gaacaattgc tgaccagtgc tatgggctgg ctgaggttgg caaccaagaa tctgttatct 42360 

cagtctacac agcgtggagt tagaagattg gattgttgtt ggttagaatt ttgtccagag 4242 0 
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acjttagaaga tagaagaaaa gggagggctg aggacataaa catccaaatt ttacaaagaa 4248 0 

gaaagttcag tgctcctgtc tatattgaaa tagttaagag ggaaatgatt gggtctgggg 42540 

aatgcttcaa aataatatga tcatagggaa agtaagtgta gtgtggatga aacaaggttg 42 600 

gccttgagct gataatttac taaagcagga tgatgagcgt gtgaggttca ttatagtgct 42 66 0 

ctttctactt tttaaaaagc ttaagatttt ccaaaatgaa aaagaagctg agcatggtag 42720 

ctcacgtctg taatcccagc actttgagag cctgagacgg aaggatcact tgaagccagg 42780 

agttcgagac cagcctgggc aatatagtga gacccccatc cctactaaaa aagaaaaaaa 42840 

attacacaat gaaaagaata atacaggtgg tagaagaaga aataaaggta tgagtatatg 4290 0 

atttggcctc ctacagacag ctacaggagc atgcctctgc tgggcgggcc tggagcatag 42960 

tgcaccctgg agctgaagtt ctgttcgcct ttacaggaag tacacagcat tgtacggggg 43 02 0 

ttttaccagt taacagtgga cagtctctag aacactcagg ggtcaggggc aaaactccct 43080 

ctccctcaaa ccagcccacc tgtcttggtg agaatggaaa acggagcttc ccagcaagcc 43140 

tcctgccttc accccgacgc acaggtgcat cagggtctga gggctcctcc ctgtggggcc 4320 0 

tgggaagccg cccttcagtg ggctatggct gcattgtctt ctgatcttgt agctgtgtgt 43260 

gggtaacagg cacatcgtct ccccatatcc cagagagtca tgtccacttc ctatctgtgg 43320 

tggcatcttt ccatcactgt caccctcagg ataatggcac aggaactaac agaggcacta 43380 

agagcagtga tatatactgg agggaggggc ggggctgatg accaaaatca ccttctgttt 43440 

cagtaagagg ttggttaaag tgatgcctaa aatggataga tcaggaaata acaataaaaa 4350 0 

cattaatgat aaaaaggcaa ctgccagaag aactgaaaag taacctagtt caaggtgggg 4 3560 

gtccctggag agaagccttc agaagagcag aggaagagca gggaacattg cttttcatga 4362 0 

ctagcctttc agtgcctttt gatatttcag ctaagtatgt agttctctaa tgatttctaa 43680 

attgttaatg agtgtgatag aataggaatt ctggagaatg gtcaaaagga attgctagga 43740 

gaaaattagg gaaagaaagc acagagcagt attgaggtgg gaggttggga gcagacagat 4380 0 

ggtgagtgtt cgtttgcatg gcaggaatga gtggctgaat ggcattaact ggccatgggg 43860 

ctctgaccag agcttgggta tacagtttct gtgccagcac aaaccagcat taggtgcgtg 43920 

gtgtgtgagg ccgggtgatg gtcatatgga tctggtccag agctgtggtc ttgacataga 4398 0 

gttccatgta tgaactttgg tggtggtggt ggttttgttt cgttttttaa gaaaaaaaga 44040 

agagttgttt tatgtcacag ccaaactcaa ggaggtgaca gcacaagacc caggttagaa 4410 0 

tgtacttagc cagggctcac ggaatcttcc atcctttgca aagccaagaa gagtagaggc 4416 0 

ctcaaggaga atagaggagc acatgggctc cttctgggct gagctctcac ccttggtggg 4422 0 

gccgctggtc ccagggggtc gttgggagcc tgcggagatt ctgcccagta catacatacc 4428 0 

ttgcgttttc tcggagcctg acaggtgctg ccttgaagcg gaaagacttc acctcacagg 4434 0 

gtggccagct gtcctttctc ttcctctccc tgtcctgtgt gctcatgata atggatagat 4440 0 

agacatctcc atggtagaaa taatgcagcc ctttctgcag tccatggttc cacagcatat 4446 0 

ggtcagtaga cttagatgag tgctacataa agtgtaatta gttctgcacc acggtgtctt 4452 0 

accttctgaa aaggaaactc tggtattgac ttagaaatgc cagttttatc cttccgacca 44580 

gaacgcaagt cctgaatgag gccagaggcc aatgaaaggt cctttcttct ctcttttcaa 4464 0 

atctgctgta ttaaacagtg acttttccta gtcattcagt attcattcat tcatccattt 44700 

attcaagttt ttgagtacct tctttgtgga tggcacatag tagatgcaaa aatacaatca 44 760 

tttttaaaat ggaaagcatt tgttatgtgg tagtggaata taagaaatgc catgcttcca 4482 0 

tatttttatg gtaatcttgg actcaggttc tacccaccat acttaattag aaattcattc 4488 0 

tcagcctctg ttttggcaga gacccattct cttccctcag gaacatacag aaacctcagg 4494 0 

gctgaactga gctcagaggt tggggaaggt cccctggtct gatcttcagt cgctgactgc 45000 

tgcgcccgat tccctggccc ctcactcttg agggcagcat cctcactctt gtgtggcaca 45060 

ggcttggaca tagaatgaca cttaagcacc agagcctccc tagcctgcca caaggacacc 4512 0 

ctacattccc aggcactttc agaatgcaaa acctcccctc cattccctcc ctttccctgc 45180 

ctccaggttt gggccctgcc tctccttttg tttcaaagca ctgtcctctc ttcctctacc 45240 

cttatccttc cccatctccc tcccactccc caaaagagcc aaactaatta tggggattta 453 0 0 

accttccaat taaaggcgag gatttttttt tttcaatgca gttcctgaaa tgctaaagaa 45360 

attgtccagg gaaaggagca caggaggagt tttgttttgt tttgttttgt tttgttttgt 45420 

tttgttttgt tttgttttga gacagtctcg ctctgtcacc caggctggag tgcagtggtg 45480 

cgatcttggc ttaccgcaac ctctccacct cctgggtttg agcaattctt gtgcctcagc 4554 0 

cccccaagta gctgggatca caggtgcaca ccaccacgcc agctaatttt tgtattttta 45600 

gtagaaatgg ggttttgcca tgttggccag gctggtttcg acctcctggc ctcaagtgat 45660 

ccacccacct cagcctccca aagtactggg attaggcgtg agccactgca tccagcctga 45720 
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tgtttaaata 
gaatcactgg 
ttgtaagcta 
gctcttgtat 
aagttaggaa 
tgtctttaca 
tacatgcagc 
catgattttt 
agtgtatttt 
ttggacaccc 
tgagggagcg 
ggaggcagtg 
aggctctctg 
acttttggct 
caaaccctgg 
cagaaggccc 
gttgtggaag 
tgtttggaaa 
ttgtattaaa 
ccaaatctat 
tctttaaaaa 
tatagttaca 
aaaaaatcag 
catttattca 
tggagatata 
ttttccataa 

gggggtgtgg 

ccttcacttg 
tcccaaagtc 
ctttatccct 
tggttttgtg 
ggattttaca 
tttgttaatc 
taacagcact 
tcagtcaaag 
atgtttctaa 
cacaatcttg 
agtcttgggt 
tggacatgaa 
ccagtgtggg 
caccagataa 
agtaaaagta 
aagattgaaa 
gtgttataac 
tatctaaaac 
agctaggact 
tgtatgaaat 
aaaaacttca 
tatttatgga 
tttctggccc 
taggaaagca 
ggattgtcac 
ctgagttccc 
tatcggtgat 
cgtgggctaa 



atttcttact 
aggagcttaa 
tccagagttt 

gggaggtagg 

ttggtatctt 
cagccatggt 
ccaggacggc 
ttattttatt 
atgtgtggcc 
ctatatagac 
tagaggagag 
atgtgctgag 
gtgacatctg 
gaagtgcaac 
ggctcactga 
atggggcaga 
tcgtaggacg 
cagagaagaa 
atagattttt 
ttcaccgttt 
tgaaaccaaa 
atcatgaagc 
atcttactga 
gcaaatgttt 
tggtcagtgg 
gtaagtgtac 
cttcagttgt 
agacttcaca 
ctcccttaat 
tcttcagttg 
cgagcagttc 
gttggttggc 
aagtttatgc 
ctccttcatc 
gttgaccaac 
gtggtcagtg 
taactatgga 
accagccata 
catggagtgg 
taaaggcggg 
gtcctcagac 
tctcatagat 
tccagacagg 
ttggattctt 
ttttatgtga 
ggttattgtg 
tgatgtaact 
aattccaagc 
atgtcaaaaa 
ctgtgagtct 
gatgctcgtc 
cttggtcagt 
taaaaaaaag 
agtaaggaca 
tatccctgaa 



tcagagtttc 
agatatagag 
gagagccatg 
aagttacctt 
catagacaga 
atatggtaga 
tttgaatgca 
tttttttaag 
aaagacaatt 
cataattatc 
cctgccagag 
ccagaaagga 
ccgtcaggta 
attagtgatt 
gcttccgagt 
tacaccttag 
tgaattctag 
tgacattatg 
afcataggaaa 
ccataataaa 
aagtatggga 
aggtatggtg 
gttctgtgag 
attaagtgcc 
gataagggga 
aaatacgggc 
gcacatggtg 
gcaggatgta 
ctcgtttaga 
ataactgaag 
tccaacagcg 
attgtatgac 
cttttacatt 
ttcagaagtg 
aaagactgac 
ctttcataaa 
gacttagatg 
ccaaaagctg 
tcctgcttca 
ggctgccctg 
agtgccctta 
tctttaagtg 
gctggggtgg 
cattccccaa 
ggaccatggc 
ataaatgaag 
ttgtatttct 
cccataatat 
gaaaataggg 
ggctctaaaa 
atttaccttc 
aattgtttcg 
caggcagcgc 
caattctcgt 
atcatgatgc 



ctgaagtgtg 
acagggatag 
gcctagttct 
ttcattcatt 
agaatgcagg 
ccaagtttgt 
gcccaacaca 
ctcatcagca 
cttccagtgt 
acacgccctg 
tttccagtgg 
cagggaggac 
cagtagtaca 
gaggaagtgc 
tcagagtttt 
acatattgat 
gtttcttgga 
agttttattt 
ttttcattac 
ttgtgaaaga 
aaatacttta 
ctgcttcatg 
acagatcatt 
aattaatgta 
agtctctgtc 
tatgaattaa 
cccagaaatc 
aaaagcaaca 
gaattatgaa 
ggctaaatct 
cttccactaa 
agtctgtgag 
gagtaatttt 
ttcagtttga 
cgcagtgggc 
cattttcacg 
ataaagattc 
ttttatatga 
gcaagaaatc 
aggagtgttg 
gcatccagtg 
taaacaccga 
actctaatct 
aatgaccttg 
tcatattccg 
gaaaagaaaa 
gattataaat 
attatggcct 
ccatcaaaaa 
acagtgcaag 
ctgtgtttgt 
tttatagtac 
accacaacat 
atttaatcaa 
atcttatggt 



gcctatggag 
ggcctgagat 
tggaatacaa 
cttctacaac 
aatcctcttc 
ccaacccatg 
attcataaac 
ctatcgttag 
ggcctaggga 
agagggaagc 
ggccggggat 
ccccatgcac 
gtagtagagt 
aggtccgtaa 
tcagaattga 
gaaagtggag 
gtgtggagta 
ataaaataag 
cttcagtatc 
aagatttgat 
attttagagt 
agccaagttc 
atttttctta 
tcagtggctg 
ttcaatcctc 
atagagcaag 
tgatagctcc 
gccaggaaac 
acacttccta 
gccaagtctt 
aatcttgtct 
tagggacagc 
taattacctc 
acaataagtt 
ggggtattca 
ttatgacctt 
aactaagagg 
ggttctttat 
aggctaataa 
tgtggcacag 
gtttttggca 
ggacacactt 
gtttccttct 
tttatataat 
cagctttcct 
gaaagtgggt 
ggtagccaga 
gaatagtgtt 
agcaaacagt 
gggttggggt 
catgagaatt 
cattgtctct 
ctggctagta 
tttggaggta 
cagtgactta 



actagcatca 
tctgactttg 
aagtgtatct 
agtaatttaa 
tcaggatgtc 
gcccgcgggc 
tttcttaaaa 
tgttagtgtt 
agccaaaaga 
agatcaatgg 
gcttaaaaac 
acaggcaggg 
gagcggaagc 
tatcttaccc 
tgaacacatc 
ttggaattgg 
tggtgtctac 
aatttcttct 
agaaaggc cc 
tgggatattg 
taacaacatt 
ataaaagaaa 
ttaacatatt 
ctttaggcac 
acagatctta 
ataaagagat 
atcatggagt 
tccttagttt 
tggcttcatc 
cctttgccaa 
tttacaagat 
ccaatgtatg 
gtgtaatttt 
tgtcatcttg 
gcagggttag 
tgcctcctca 
atccctttaa 
gattataatt 
ccattagcat 
ggcgcatgaa 
tctagcaggg 
aacgttttaa 
ctctctttat 
tctccagtct 
tctctcaagg 
ttctgtttat 

gggaggacac 

ggaatttatt 
ggaagtctat 
ttggtgggac 
gtctgtactg 
ccaaagaggg 
gcacgtatat 
tgcatttttt 
aaaagcatta 



45780 
45840 
45900 
45960 
46020 
46080 
46140 
46200 
46260 
46320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
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tgtcatagtt 
ccaatgttat 
attaactgta 
tagcaaaacg 
tgagtacata 
agccatgaat 
agggagcgcc 
"cagaactggt 
gctatgggac 
aaaaagtttc 
agaagcgtga 
acagattgcc 
cttcaggtaa 
ttaatagagt 
aaatttactt 
tctagcatga 
gcatgtatcc 
agaatttgga 
cttatgcttg 
tttttagtgc 
tttatgaaaa 
aacagaaagc 
tgttttgttt 
gtcctactgt 
acaccacaaa 
atggctaaat 
ttgaacagtg 
aatgtagtat 
tgatttatca 
aagtatcatt 
ctctcttggt 
ccaagtgcac 
tcctcctata 
ctctttttac 
tctgtgagct 

gggggcaggt 

ggtcacttga 
caaaaaatac 
aggctgaggc 
caccactgca 
aaagtctgac 
atactatttt 
ctctttgtat 
agttacagaa 
acttctagag 
tcttctctgt 
gttccattct 
cttgtgtttc 
ttctcaaacc 
tttttttttt 
cttggctcac 
cccaagtagc 
cagagacggg 
cacccgcctc 
aatgtatata 



aaattgacag 
tttagagttg 
taataaaatt 
cgaggcacaa 
aatgttgacc 
tggtactgag 
aggcttcatg 
gtgaagctta 
tggcaactaa 
cttaacttct 
atccgtggtg 
ttctctcagc 
ttttgcttat 
atcagctcct 
catatgtgtt 
caaagtgaaa 
tgtattaatt 
ctaaaacaac 
ttcaagagat 
cctgggattt 
tactatagac 
agcaatagcc 
tttacttctc 
agaagatgag 
agcatacaaa 
cagtgatccc 
attacactac 
ttgccatgtt 
ctatactctt 
gtactgtgac 
ttgaatttct 
ttgttcctga 
ttgaagtctc 
tggagcacat 
ctcctgactg 
gctatggctc 
gctcaggagt 
aaaaattagc 
aggagaatgg 
ctccatcctg 
tacagaattc 
ttctagtgtc 
gggatccgtt 
acatttgcaa 
tattgataat 
cagtttcggc 
cttgttcact 
ccctcactca 
atacattgca 
ttgaaactta 
tgcaagcaac 
tgggattaca 
gtttcaccag 
tgcctcccaa 
aatgaaatca 



tgtttttcct 
atgaaatatg 
atttaagttc 
ttttttgtaa 
aaatgaagag 
gcaaacaagg 
gaggcatgcg 
gaaggggctg 
acaagtaaac 
ttaaagtgta 
gtttcatgac 
acagcaattt 
acagactctc 
aaaagataaa 
cctgaaaagt 
acgtttggtt 
ttgcatgcat 
atttaattaa 
gattctggtg 
tcttttattt 
aaataattga 
tgccttgcgc 
tgtgtttaca 
gatatactgt 
agccccatct 
tatctatgtt 
cacatttttt 
ttggcttttt 
gctgatgtac 
agaactgaga 
ctcggagttg 
ggcctgctgc 
tgtttcccag 
tctctagatt 
aaaatatatt 
acacctgtat 
tcgagaccag 
tgggtgaggt 
cttgagccta 
ggtgacagag 
taggttcaaa 
cagtataata 
ttctctctta 
atcctatgtg 
attatctctt 
aagttgtata 
gagaacaaga 
caacctcctg 
tttgctattt 
gtctctctct 
cgccgcctcc 
ggcacgtgcc 
gttggccagg 
agtgctggga 
caacataaat 



aaattataca 
ttgcattttt 
tgatgacgta 
aaaggacaat 
cacagaatgg 
agagctttgt 
agggtctcca 
caggaggctg 
taaaggggtg 
agctctgttt 
ctgcataatc 
tatggacctg 
tctgttcttt 
aagaataatc 
tagatgtcag 
tgtatgttct 
cctatagata 
taatggttac 
attatccaaa 
ccatagtttt 
aaaagacaaa 
cttccccacc 
cttggttttt 
gttacatggc 
ctttcattta 
atggccataa 
gggtagaagg 
aattgtttga 
ctttagaaca 
aatctcacaa 
cccaccctag 
acaactgtca 
atcccatgtc 
tatgagaaaa 
tattttatcc 
ttctagcact 
cctggacaac 
ggcgcaagcc 
ggaggcagag 
tgagaccctg 
atcattttcc 
ttgagaaggc 
aaaagaaaag 
tgtagctcag 
gttcacccag 
tagttttctt 
gttaatatta 
cctcttcccc 
ttgaaatgca 
gtcacccagg 
cgggttcaag 
accacgcatg 
ctggtctcga 
ttacaagcat 
tttaagatga 



cagaataatg 
atattcttgg 
gtgtcagatg 
atcttgcagt 
ggctgcccac 
gtgtgggtgg 
gggtcaaaca 
tgctaaagcc 
aggtactttt 
tcagcttaac 
acactacctg 
gtacagttct 
tgatttaggc 
ctgtgtttcc 
ttacagtaat 
ggcttccagc 
atttggctct 
tgatgtgagc 
gttttaaagc 
tttcgtagtc 
aaacgtgctg 
ccacgtcccc 
cagttaagtt 
ctcccctagc 
ataccattgt 
aactcttatt 
atcattcttt 
tttttctatg 
gtaaaaaata 
tggtcacaca 
aaccttccca 
tttggaaact 
ttcaccttcc 
gatgcctggg 
tacaatttga 
ttgggagacc 
atggcaaaac 
tgtagtccca 
gttgcagtga 
tctcaaaaaa 
attagaattt 
tagtgctgtt 
tatatttaag 
tgaattatca 
aacacttcag 
ctaatgatcc 
ccaatactct 
aaaggtacca 
tatgaattat 
ccagagtgca 
caattctcct 
gctaattttt 
actcctgacc 
gaaccaccgc 
tttggttgat 



cctttttcag 
tatgaagact 
atgttcagtg 
cagtacacac 
ctggaggcag 
aggcgcaggg 
gacagaacct 
tcagtgtatt 
ccttaagtga 
tcttgaaatg 
tttttctgtc 
tcgtgacttt 
actagatcca 
accacagatt 
ataaactcaa 

ggggggcttt 

tcatatgggg 
tacacatatg 
tgaaagtgat 
taatttfcatc 
cagtttttaa 
caatttaaat 
ctcttttgat 
atctcccctc 
atcataatgc 
cacagctgca 
gcttatccta 
tgcctacccc 
aaataaaata 
caggaaggtg 
cctcctgccc 
tcctttgcct 
tatccattcc 
agataacttt 
ttgaatttct 
aaagtgggtg 
cttgtctcta 
gctaatcagg 
gcggagattg 
aaaaaaaaaa 
agaagacgtt 
ataattccca 
tgtaatataa 
caaattgaac 
gcttcttatt 
cattccatct 
agaagttcct 
tttacctgac 
ttcttttttt 
gtgttgtgat 
gcctcagcct 
gtatttttag 
tcaagtgatc 
gctcagtcaa 
gaagaaaata 



49080 
49140 
49200 
49260 
49320 
49380 
49440 
49500 
49560 
49620 
49680 
49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
50760 
50820 
50880 
50940 
51000 
51060 
51120 
51180 
51240 
51300 
51360 
51420 
51480 
51540 
51600 
51660 
51720 
51780 
51840 
51900 
51960 
52020 
52080 
52140 
52200 
52260 
52320 
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dtgaaaactt gtgggttgca gttaaagttg tgcttcgagg ttatagctgt aaatctgtgg 523 8 0 

ctctccacct tagctgcacc tagaatcact tgcggagctt ttgatgtcca agccactctt 5244 0 

cagaccattt gactcagaac ctctaggggt gcaccccaaa catttgaaaa ctctccaggt 52500 

gattccaatg tgcagctaag gttgagaatc actgctttaa aagcacatafc tagaaaagaa 525 60 

aaaacattcc tctcaataac tcagaactgc taattaaacc taaagaagta gaagaaagaa 5262 0 

aaatataaat aagagcaaaa attaatgatc taagggaaga agtacagtag agaggatctt 5268 0 

aaaaactgtc agttggatct ttgaaaagag taagaataat attgacaaac ccctggcaag 52740 

attaaccaag gaaaaaagag agaaggtata tggtatatgt agccaatgtc agaaataaaa 528 0 0 

aagggggcat tcctccaaat cccgcagacg tttttttcct ccagaagggt atttgatgaa 5286 0 

caactttatg tgaatgaatt tgaaaatgaa acagatggaa tgggaccatt aaaagaaaaa 5292 0 

tatgcaactt gccaaaactg acacagaaga agtaaggagc ctgaatagtt ctgtatccat 52 98 0 

ttttaaaaca acactaattt aagttttgtc atagtgaaaa ctctaggccc caggtggctt 53 04 0 

cattggtaac ataacctttt taggccgggt gcagtggctc acgcctgtaa tcccagcact 5310 0 

ttgggaggcc gaggtgggtg gatcacgagg tcaggagatc gagaccatcc tggctaacac 53160 

ggtgaaaccc catctctact aaaaatacaa aaaattagcc aggcgcggtg gtgggcgcct 5322 0 

ataatcccag ctactcggga ggctgagaca ggagaatggc gtgaacctgg gaggcagagc 532 8 0 

ttgctgtgag ccgagatcgc gccactgcac tccaacctgg gcaacagtgc gagactccat 5334 0 

ctcaaaaaac aaaaaggaga atactgtgat catcttaatc aatggaggag ataaagcatt 5340 0 

tgataaaatt taatacccat tcatttttaa aaaaatttct tagaaaacta gaaagagaaa 5346 0 

cttctggaac actgataatg gaacaacata aaaagtcaac agcaaaccac tttcttaatg 5352 0 

gtagaatgtt agaagtttga gtttgggaac aagacagaga tccactttcc acagtttcaa 5358 0 

ttactcaggg tcaaccgtgg tccaaaaaca ttaagtggaa aattccagaa ataaacaatt 5364 0 

tataagtctt aaattgtact ccattctgag tagtgtgatg aaatgtcgtg ccatcctgct 5370 0 

tcgtcccacc tgggatagga atcatccctt ggtccagtgt atccacgccc ctatgctacc 5376 0 

cacccactag tgtcttggtt atcagatcta aaaatcatag tatatgtagg gttcagtact 53 82 0 

atccatggtt tcacgcatcc actgggaggc tgggaafrgta accccacaga ttaagcagga 53 88 0 

cactatattg ccaaatccaa aggatacttt ttggttctcc tcttcttgac ttctcggtga 53 940 

tatcaaccaa ctgactgctt catgagcgag actcttccag ccgtctctga cccatgctgc 54000 

cctactgctg cttcttagcc tgcctattat gcaatctttt ctattcagcc tctcactgtt 54060 

gattttttcc cagaccttga ccctggccgt cttctcactc tgtgttctct gactggactc 5412 0 

tgccttcccc atctgcagca cgacatgccc agaagtgacc aattccatcc ttcctcaccg 54180 

gtgttcccca cctcactaac tggcaacatg gctctgctgg ttccaagcca cttctcttgc 5424 0 

tttcactcag caggcactcc cttaccaggc cccagagctg tggccgtcag tgcactctcc 543 00 

acctccactc cccaccccac caaccctagc atcacagtct cgcctggacc ttttcacttg 54360 

actttggccc ctccagaatc cacaaaaagt ccagagaaga cctttaaaga tcatgtatct 54420 

agccccttac cttcctgttg tatctaaaat aaaattcagc gtcctgggtg aactggcgtc 5448 0 

cgcttttcca tattcagctt gcactcttct cttccacatt tctgctacca ggaccatctc 54540 

ctatgtcttc aaaagcacaa gctctttccc tacactgagc atttgtacat gcttatccca 5460 0 

ttttctggaa cactcttctc actacccttc attctgtata ctttcatacc ttgcttgatt 54660 

tttccagaca acaacaaatt ctccaattct ctgacaccaa ctcagtgtct cacagttcag 5472 0 

ttccattcag acactaccca tagttggagc aaatcctgca ggttaggggc tcagtcccac 5478 0 

aagactgctc ccacttcaga cactaagcac aaatgggtcc ccaggctacc tcactgctac 54840 

tgggccaact acaaattcag ggcttctggt gactgctcac tccaagattt gacaatttgt 54900 

tagaacagct cacagaactc aggagagcac tatacgtaga attacagttt tattatacag 54960 

ggtacaactc aggaaccgcc aaatggaaga catgaatagg acaaggcatg ggggtgggtg 55020 

agcacagctc tttcatgccg tctctggaca agcctcctcc ctgcacgtgg atatattcac 5508 0 

tatcctggaa gctcccccaa gcctcatcat tctgagtttt tttattgagg tttcattaca 55140 

taggcatgat ggatgaaatc actttctatt ggtgactgaa ctcatctcca gcccctttcc 55200 

ccttcttgga ggttggggga tggagctgaa agttctaacc ctctaaccac ctagttgggt 552 60 

tttctggtga ccagcccccg catctgaggc tctttaggcc cctctcccca fcgagtcattc 55320 

attagcacac aaaagacact cctgtccctc tgaaaattcc aaggggtaag ctaaattgtc 553 80 

tccccttgtt cctcactcct gtagaactgt attatttttc atcgtggtat tcatcacaat 55440 

ttgaagaata tattcatatg tttacttgct taatatctgg gtcggtagcc caagcttcag 55500 

999 a gcagtg cactgtctct cgttcactcc tatacacaca cagcttcagc atctagtaca 55560 

taatctagta agatatttgt gggaagtaaa cttagactct tcagggtaag tgcagagtga 55620 



23/189 



WO 02/083077 



PCT/US02/12063 



ggaggaacca 
agggatgtag 
atggaataag 
ctggacattc 
ttcaggccca 
cttgtggggg 
actaactagc 
aaaatatttt 
tttttttttt 
tcatagccca 
tctcccaagt 
tttgtagagt 
agtcctccca 
cctggtccaa 
tcctaagcac 
gaaattcgat 
actccgggca 
atgagatgct 
ctctgcttgc 
ctggtgttgt 
tagagggaag 
ccagggtcct 
gttggcttca 
aaataacaaa 
acctctttac 
attctcatct 
ccatatatat 
caggttagaa 
ctgcatgtct 
gcttccacag 
atgccattgc 
aacctgcact 
ccttcaccct 
tgctggcctc 
tgactgctcc 
ctccaagtct 
cctgctgtgg 
gtagcactta 
ccacctgctg 
ccatgagcct 
cagtcacgtg 
gagaccagac 
ggtgtgatgg 
tcgcttgaac 
ctgggtgaca 
ggcagtgtcc 
gggggggttg 
ctgcaccctt 
atcctagcat 
agttgggtgc 
gccagcaaga 
tggaaaaata 
cagttatcta 
ctctttgcca 
attgaaatcc 



taggcaccaa 
gggctgtatt 
atggccctgc 
agtccctggg 
tggcctcttg 
ttgaacttct 
gatgataaac 
cccaggacaa 
tttttttggg 
ctgcagcctc 
agagctggga 
tgggttctca 
tctcagcctc 
aagaatttaa 
tctgaagaat 
gattcaaatg 
cagaaaagta 
tttccagcct 
ttttctgtag 
caacagatct 
caggcacaga 
cgcctctcag 
tgctcttcct 
aatcaggatt 
tggttcttcc 
atctgtattc 
ccttatgcct 
tatctgccca 
aaaatggatc 
ctctggtttg 
acctctaatg 
ggctctttcc 
cacccttctc 
ctgaggctcc 
tttttccaga 
ttgttcaaat 
agtgccactg 
cctacctttt 
gaatgtaaat 
agaacaggac 
aggcctgtaa 
tggccaacgt 
cgcacacctg 
ctgggaggca 
gagcgagact 
tgatagcaga 
gaaggcaatc 
gtacccctta 
cttagagtcc 
actgaggtgc 
gagtaccttg 
ccctgtagct 
tttttttact 
tcactgtagc 
tctgcaacta 



cctctcaggg 
gtagctccag 
tgccaagtac 
cgaccgaatg 
tgtagtaaac 
ggaaaagaat 
ttttagctaa 
gttaaccatg 
atagggtctc 
aaactcctga 
ctgcaggcat 
ctatattgcc 
ccaaggtgct 
aatggagcct 
ggctcatata 
agaaaatcaa 
aactctaggt 
caggaaggag 
cttcttgtag 
tcactgaata 
ctgtattacg 
cagcagcggc 
tgtggatttg 
atgcgtggaa 
tgttctcctc 
actctcttgg 
tccaaatgcc 
ggacatctcc 
tactgatatg 
agccaccatc 
gttttctcag 
caatgccctc 
tctgcctttg 
ccaagccatc 
cgcacttttg 
gtcaccttct 
gtactcctga 
ggcatgctct 
tccatgggac 
ctgacccaat 
tctttgccaa 
ggtgaaacct 
tagtccaagc 
gaggttgcag 
gtgtctcaaa 
gaggcccaga 
ctattgaatt 
gggaggaggc 
ctctctccag 
cattcatgaa 
atgacactgg 
agcccctttt 
tgttaagctt 
ttttcaaatc 
ttaaggccac 



aaaaaagctt 
tagacatcct 
gtacagcctg 
acactat cag 
aaggacagat 
ccggaacttt 
gaatagagtt 
tctaaaagaa 
tgttgcccag 
gctggtctca 
gcaccatcat 
caggctggtc 
gggattacca 
ataaggaaaa 
atatatggaa 
tgtaaagtga 
cctgacacag 
cctccttaaa 
ctggagcagc 
cagaagggcc 
ggtggaccca 
aggctcttga 
ctgcctcctt 
gccaagaaag 
aacttctcat 
tgaatgtggc 
atgcccagcc 
cctgggctgt 
tccagattcc 
atctcaccta 
caaacgttaa 
ataatgcttg 
tctctggcct 
ccaggcacgg 
caggcgtcca 
gagaaagaaa 
accccttgct 
gtaatttgcb 
agtaatcttt 
taaagtattg 
ggcaggcaga 
cgtctccact 
tactcaggag 
tg age t gaga 
aaaaagacaa 
gatgaaggct 
gcctgttgtc 
agtgagcctt 
ccagcccagg 
gctgacccca 
cagagagect 
acaagttaat 
attcttcctt 
attccacacc 
tgggagttat 



gtgtcaatgg 
taaattcttg 
gtgaggaccg 
aataagtcga 
ggcattttca 
ggtttttcta 
actttcatgg 
tttatttagc 
gctggaacac 
agcagtcctc 
gcccagctaa 
ttaaacttct 
gcgtgagcca 
ggggaaaggt 
aaaagaacaa 
gttagaaaaa 
tctgttgaga 
gagggaaata 
aagtagaagg 
ctctcacctt 
agagaggttt 
ccaacttaga 
atgtttatct 
actgagaagg 
tgttagggtg 
caggtttatt 
cagacctggc 
ceggtagata 
aaagtgcttt 
gactgctggt 
atcacaacac 
cagtgcccca 
tacacacccc 
ccttgcctcg 
tgagccacct 
ggcccacaca 
cttagcatct 
tatttactta 
gctctgttta 
attaaaaaga 
tcacttgagg 
aaaaatacaa 
gctaaaagag 
tcgtgccact 
tgtagactgt 
ttccaccaag 
agggcaccca 
gagataaggg 
agggctgtgt 
tcctacaaac 
tatgtctaca 
gagtaaaaat 
ttaaccgcaa 
tggtttatac 
tttctcttaa 



ettgetttta 
gagcatgtgt 
gccacttcta 
ggaagtactt 
tcacttctcc 
gacgatgttt 
catggggaag 
tttttttttt 
agtggtgcag 
ctcacctcag 
tattttcttt 
ggcctcaagc 
ctgctcccag 
atccagataa 
aagtcggggg 
atatttccag 
ctgtgtccta 
agagactctc 
gactggggaa 
ccctgaggca 
ctgagtgaca 
agctgttgcc 
attagaataa 
ttattctcag 
gccccaggac 
gcttgaaata 
agccagcctc 
tctcagggtt 
tcacacctct 
ctccctgata 
tctgtttaaa 
catgatctgc 
actccaaaca 
ggctttgeae 
cccatacctt 
ctctgaccat 
ttttatttct 
ctgtttgtct 
tcagtgtttt 
caatgtaggc 
ttaggagttt 
aaattagctg 
gagaatagaa 
gcactccagc 
cagaagagtt 
tgacgatcct 
ttggaggagt 
cagttcagtc 
tggtgtcagc 
gcagagctca 
tgtagcactc 
gcagttatag 
gtcagctgtt 
atgagagagc 
tatttttaag 



55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57960 
58020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
58560 
58620 
58680 
58740 
58800 
58860 
58920 
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aaagttgttg 
tttgtaaaaa 
ttatgaaaag 
taaatatctg 
tatttttctt 
cagggctctc 
tggcttaccg 
atcagaaggg 
tgggcaacgg 
acagttgttt 
gttgggcctc 
agagcttttt 
gacagacaga 
agcatatggt 
gtgtcttacc 
cagaccagaa 
cgcgtgatta 
agcatctcaa 
aacttgagtt 
ataagcaaat 
tatcaaatgc 
gtatgtatat 
attctaagtg 
gggaactaag 
ggggaaagag 
gctcgggtga 
ccaaacacca 
aaataaataa 
gtattgcaac 
gttggcattg 
ctggggtttt 
atttagaaca 
ggatattcat 
ccacagtacc 
cactctttat 
ctgtattact 
attttcaatt 
cacagtaaac 
aggacattag 
gttcctccct 
gatggaaagc 
tgacaaaggt 
taattgcagt 
caaaaaaaga 
ccagaaatca 
tcacagtaaa 
actcatgata 
gctgacatct 
aactaatcct 
aggggactga 
cgaggcttta 
tacagtcatt 
gaagtgttgt 
caatgtagta 
aataacacag 



gcttgggttt 
tattgaatat 
ttgattgtta 
ctaacttgaa 
atttgggata 
tctgaatacc 
aagttgtctc 
agcccggagg 
tgggagctga 
cctctctttg 
cagacgtaga 
ttccttcttt 
catctccacg 
cagtagactt 
ttccgaaaag 
cccaagtcct 
ggcaccgcct 
aaatttttca 
ttatttttaa 
atttacagct 
ccatcaatca 
atatatattt 
aaataattca 
ccctgaggat 
tgggaggggg 
taggtgcacc 
cctgttcccc 
atattttaaa 
tttcacttgc 
ttgctaatgg 
tttggatcta 
gaagggaagt 
gtctgaatag 
accttttctg 
tattattttt 
gatatacttt 
taatttaaat 
cagttatcaa 
acccagggag 
gagtgcttat 
ttgaaagagt 
agcaccaaaa 
gtaatgaaag 
atatatcatg 
tttcgtcatt 
atcaaaaata 
aaacaaacaa 
gagcatttaa 
tatctcaaac 
ggctcagaaa 
actagggact 
cagctttgtg 
tactattgga 
gagatttgcc 
aagctaaaca 



gatcatttca 
acactaataa 
ctttaataat 
taaacagcat 
agccactgtg 
acatgggaac 
actcattacc 
tggcatgtgg 
ttctgggggc 
ctcctggtgg 
gaggtaagct 
ttctcttaag 
gaaggaataa 
agatgagtgc 
gaaactctgg 
gactgaggcc 
tcatgagaac 
ttattcaagg 
ataagtcttg 
gacaaattta 
atgagtggat 
atatacattt 
ggaatggaaa 
gcaaaggcat 
atgagggata 
agaatctcag 
caaaaaccta 
aataaccacc 
atcattactt 
aaatgctcac 
ctcatgcctt 
atgtaaaatc 
ggttaagaca 
acccacataa 
taaataagtg 
aaaagcctaa 
gtatatttat 
gaaaggagtt 
tttcaaggtt 
gagaaataat 
ccgaattgat 
tacaaagggc 
agtcacttaa 
tcaatgggga 
tattgaaact 
cctttttaac 
acaccagagg 
aaattccagg 
ctatgaaata 
ggtgaagtga 
tctgtcttca 
ctagatgttc 
aaggaaacaa 
tgcataaaaa 
atcaactgga 



aatgtgtttt 
ttacacacat 
taactttttc 
gactaatatt 
ggccattctg 
gatacagtct 
tcatgagccg 
gagctgtttt 
tgggataata 
ctgctgtggt 
tcagtgggta 
caagtgggtt 
tgcagccctt 
tacataaagt 
tattgacttg 
agagatccaa 
ttgtcatgcg 
aaagataaat 
atttgtttcc 
aataaataaa 
aaagaaaatg 
gcagcaacct 
accacacttc 
aagaatgata 
aaatactata 
aaatcaccat 
ttgaaataat 
attaaaataa 
tgcctttcat 
gtgtatataa 
tttctatata 
acaactcaca 
ggctccagga 
agatgatgtt 
gataacatat 
tgaattacgt 
tcacaatgga 
gaaaactttt 
gagatgagct 
tgtttctaaa 
tttatacaca 
agctgcatag 
taccagaaaa 
aaaaataaca 
caatatccat 
gctatgaaga 
ttgtatccag 
cactgagttc 
ggtattgtcc 
cttacaccac 
cagcccaagt 
tagactgtgc 
aattattatt 
tgaaggctta 
aattgtttgc 



attttactta 
ttcttctgtt 
accttcttag 
tgttataatt 
atgaaggcca 
agaggcctgg 
gcacattgtt 
cactgcctgg 
actcagtact 
ttggaagatc 
aagattaaag 
ttagctattt 
tctgcagtcc 
gtaattagtt 
gaaatgccag 
gggcgtatta 
agagggaaat 
gtgtgtgtag 
agagcttcat 
cttggttgca 
tggggtgtgt 
ggatggaatt 
atatattctc 
cagtggattt 
catcgggtac 
gaaagaactt 
aaataaataa 
ttacaacata 
ttagtcatgc 
gatctagaca 
aaaatgtatt 
agcactcagc 
tatgggcctc 
atcatagagg 
tcaaagggtt 
gatgttctga 
gtaagaaaga 
gagtgctact 
tatggagatt 
atctgtaaag 
taaatcttgg 
tgaatgtatg 
cctattagaa 
cctgaattct 
tcctaatttg 
atatctggat 
gctagttagt 
taagtggttt 
ctcgttatgt 
atagttaact 
tcgttatctt 
actaagtgaa 
ttcatataaa 
agtatatcaa 
agtgtgcatg 



aaaatatgta 
tgcctcttcc 
attttaccag 
atcttagaat 
agtggaaggt 
ttaggaagcc 
gtgacaaagc 
gcctgaggtg 
ttcctttctc 
aaacaaagtt 
aatccctgga 
agtgataatg 
atggttctgc 
ctgcaccaca 
ttttatcctt 
gcacagccac 
cagccattta 
ttaagttttt 
tccaaaagta 
aattacaaca 
atgtgcgtgt 
ggagaccgtt 
acttacaagt 
tgaggactcg 
agtgtacact 
attcatgtaa 
ataaataaat 
aaaactttgt 
ctgtggcatt 
aagaagctgt 
tatagataag 
actgttgata 
ttgttgtggg 
gaatgttgca 
ttttagataa 
aaaaaattat 
agtaagaagc 
ctctccaaga 
atgatagcct 
caaaaggaca 
taccaaaacc 
tgcagcagtc 
tatatcatgt 
tcctagtggc 
aaaacaaaac 
aaccagcatc 
tttaacaaga 
tacgtagatt 
tttatggatg 
aagaggcaga 
cgtgtaatcc 
agggcactaa 
gttattaaaa 
aaaataaatg 
agagaaagta 



58980 
59040 
59100 
59160 
59220 
59280 
59340 
59400 
59460 
59520 
59580 
59640 
59700 
59760 
59820 
59880 
59940 
60000 
60060 
60120 
60180 
60240 
60300 
60360 
60420 
60480 
60540 
60600 
60660 
60720 
60780 
60840 
60900 
60960 
61020 
61080 
61140 
61200 
61260 
61320 
61380 
61440 
61500 
61560 
61620 
61680 
61740 
61800 
61860 
61920 
61980 
62040 
62100 
62160 
62220 
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agatgcacag cctacaggaa aaagattaaa atatatatat gtgtttagta tggagttggt 6228 0 

catctaataa actatgcttt tacccgtgtg atttggaatt tctctgtaac tgtgaagctg 6234 0 

accattcaga atatatttca tcatttttaa aatgtcagcc attatttatt ccctctcaga 6240 0 

attgtagaag caatgatttg atgttatttt taaaaacaat agtagtagcg gctgacgctg 6246 0 

tttagtcctt gctacacacc agccccggtt ttaagtgctt ttacacatgt tgcctcctga 62 52 0 

accctgtctt attgtgaccc cagcacttaa gcacagaccc caaggagcat tgctgcatat 6258 0 

attagtcgag taaatggacc tacagcttta aaaaatgtaa gaatgttggt attttaagaa 62 64 0 

aatatttata aaaatgccct tctagaacag agccatatct atccaggcca agattctgtc 62700 

ttctacagta gttccaagga ggcagtttga agtaaggcct acttgtatcc caaggcataa 62760 

cctcaaagct tcagcatcat gcccagaaag atctctaata tccttaatga tatgtgaacg 6282 0 

gctgttcacc atgtgctagg cgctgtgctg tagtatctca tttgatccca gtgacaacct 6288 0 

tatgggctag gaacacattt cagagggagg acacggatgc ccagagactt caaacacctt 62 94 0 

tcttaaggtc acacagctgt agactattgg agcaggagtt cacatcaagc ctccctaaat 63 00 0 

cccgagcccc tgtttgtccc tctactctgc aaaatttcac atttgtttgt ttttagctta 63 060 

taccacctct ctgaaggaag ggttctttca gttcctatct gccgtataat gaggcatagg 6312 0 

tttttgttgt cctgtgaatc cttggtgcca gtttcagtag gaagaaagca agcctgactt 6318 0 

tagtagtaga aatacgagga gatagggcca caccatttta tgtatttgac aagtcgacgt 6324 0 

catagcatta aatccacagt acttttattt ttatctttat tttcttattt tactttaagt 633 00 

tctgggatac atgtacagaa catgtagatt tgttacatag gtatacatgt gccgtggtgg 63360 

tttgccgcac ctatcagcct gttatctagg ttttaagccc tgcatgcatt tggtatttgt 63420 

cctaatgctc tccctcccct tgctccccac ccctgacagg ccccagtgtg tgatgttccc 63480 

ttccctgtgt ccatgtgttc tcatggttca actcccactt aagagtgagg acatggagta 63540 

tttggttttc tgttcctgtg ttggtttgct gagactgatg gcttccagct tcatccgtgt 63600 

ccctgcaaag gatatgaact catttatttt atggccgcat agtattccat ggtgtatatg 63 660 

tgccatgttt tctttatcca gtctatcatt gatgggcatt tgggttggtt ctaagtcttc 63720 

gctactgtaa atagtgctgc aataaacatc gtacgttaat atgtatctct gacaggggct 63 780 

tttaaaaaat ataaggacaa tggtgttatc acatccaaca aaatttagta atccctaatg 63 840 

ttatctaata tcaagtctaa aaagtatcat tttggacttg atttgtttga atcagggagg 63 900 

atgaggtgtt tttaacatgt agcattaact aataaaaccc cttcagtact cattccatta 63 960 

atggctttgc catattattt atcagcaact tcttcgagag atgcaacaga tggccagccg 64 020 

tccctttgcc tctgtaaatg tcgccttgga aacagatgag gagcctcctg atcttattgg 64 0 80 

ggggagtata aaggtgagaa tgtgactcag aagtccctat aacttgactt tttaaaactt 64140 

aggctcctaa gtctgggaaa ccagagagag caaaagccct attccattac ctcctttttt 64200 

tcctctcttt ctttttaaca ataagctaaa acctgaagtt gctgggtact cttttgcttt 64260 

tgtttttcaa tctttgtttc atgtcgctga gcagtcttgt gttcagggga ttttaaaatc 64320 

taaatgattt gtctgctttt cttttaagta ttctgtggca tacaaaactt ttatttgaaa 64380 

agagctacat tgaacatcga atcattgttt ttttgtttgt ttgtttgttt gagacagagt 64440 

cttactgtgt cacacaggct ggagcgcagt ggcatgatct cagctcactg caatctacct 645 0 0 

cccgggttca agcgattctc ttgccccagc ctcctgagtt gctgggatta ccgatgtgcg 64560 

caaccacgcc cagctgattt ttgtattttt agtggagatg gggtttcacc atgttggcca 64 62 0 

ggctggtctc gaactcctga cttcaagcaa tccacccacc tcggcctccc aaagtgctgg 64680 

gattacaggt gtgaaccacc gcacccagcc tatttttctt tatattttgg ggaatttttt 64740 

aagcagcaaa atgataatac actgtgtatt aatactctgt ggaaaatatt tatcaccccc 64800 

ctccccaaaa tgatgaagaa aatggtaaga ctttctcctt gggcaaaatg atggaatatt 64860 

taagatacgc tggagaaata gctatgtatc ttgaataaaa catacttact taaaatgtta 6492 0 

ctgagctcac aggaagtagg taaaatctcc agggaatact ctccttccct accctaaaaa 64 98 0 

gaacaaacca taaaccaaaa ccagaaccat gatttgtcct aagtggttcc agatggtaaa 65 04 0 

cacaacgccc acctgaatag atggaaatcc tctctataga aaaacatctt caattcagcg 65100 

ctgttgaagt cccacagatc aagatgaagt aattaaagat caccagggac aagaggcaag 65160 

ccgccataaa taagagtagc agaaacaaca gatgafctgat gtggacagga ctccaaatgt 6522 0 

tagaattgaa atctaaagaa tataatataa ctgtgtatga aatgtttaaa gaaataaaag 6528 0 

atgaaatcat aaagatgaga gtcaagtatg gccaagcaga tctgaagaag aaccaaatgg 653 4 0 

aacttctaga aatgaaaaat atacttgttg aaatttttta aataaataaa ctcaatttac 654 0 0 

atgttaaaca gcatatcaga cataagagaa tttacaaagt ggaagatgaa tctgaaaaaa 65460 

gtacgcagaa tgcagctcag agagacaagg agatagaaag tacaatagat actaagaata 6552 0 
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tggaggatag aatgagaaga tccaactcat atatatttcc agaatcccag aacaaaggag 6558 0 

aggcaatatt cagaaatgat agctttccag aactgatgga aaacatgaat atacagattc 6564 0 

cagaagcaaa acatattgta agcatgatta aagaaagaaa gggaaggcag gaaggaaaga 6570 0 

aaacaagttg gatttctcat aatgaaattg tataactcca aagacaaaga gaacatctta 65760 

aaatcagcca gagaaaaagt atcacataca acgtggagaa actgaacgtt agtgcactgt 6582 0 

tgatggaagt ataaagtgat ggagccactg tggaaaacag taccactatt ccccccagaa 6588 0 

ttaaaagtag aattatatat gatctggcaa tctgggttta tacatagaag aactgaaagc 6594 0 

agagtctcaa agacatattt gaacatcagt gttcttggca gcattattca cagtagccaa 66 00 0 

aaagtggaag caggcccaag tgtccatgga tggataaatg gataagcaca acgtggtcta 6 6060 

tgcatccaat ggaatgttat ccagccttcc aaaggaagca aattctgacc catgctacaa 6612 0 

tgtgatgaac cttgaagaca tactaagtga aatactccag taacaaaaaa ataaacattg 66180 

tatgattaca cttttaagag atacacttgt ggtagtcaag ctcataaaga caggaagtag 6 6 240 

aacagtggtt cacaggggct gggagaaggg gaaaatgggg agttagtgtt taatgggtac 663 0 0 

gaagtttgag ttttacaaga tgaaaagagt tctggagatg gatagtaatg atggttgcac 663 6 0 

aacaatgtga atatacttaa taccactgaa ctgtacattt ttaaatggtc aagatggtaa 6642 0 

atgttatgtg tattttatca caatttttgt taaaaatggg aaaagaggcc gggtgtggtg 6648 0 

gctaacacct gtaatccgag cactttggga ggccgaggcg ggcggatcac ttgaggtcag 66540 

gagttcaaga ccagcctggc caacatggtg aaaccccatc tctactaaaa atacagaaat 66600 

tagccaggca tgatggcaca catctgtaat cccagctact tgggaggctg aggcatgaga 66660 

atcatttgaa cctagaaggc agagttcaca gtgagctgag attgcaccac tgtactccag 66 720 

cctgggcgac agaacaagac tctgtcgcaa aaaaaaaaaa aaaaaaaaaa aaaagggaaa 6 6 780 

agatcacctg caaaggaatg acaagctgac cgctaacttc tcagaagtca gaagagagtg 6 6840 

agagaatgtc tttgaagtac tgaaagaaac ctccccacct agaatagtgt gcccagtaag 6 690 0 

caccttccga gaacaaggat gaaataaaga tatcttcaaa ttagaataaa aattgagaat 6 6960 

ttgccaccag cagacctgca caaaagaaat gtctgaagga tgtgctaaaa tgaaggaaaa 67 020 

ttacccccag gaggatagac taaagtgtaa gaaagaatag tgagtaaata aaagtataat 6 7080 

catatataga aatctaaaca aacattatcc tggtaaaata atatttgtgg gttaaaaaat 67140 

agacaagcct aaagtattga acaacatgat atatgtcagc agcatatgac acagttagtt 6 72 00 

gctccttaaa acattttctt gattttcttc caggacacca cacacttgcc ttctagctgt 67260 

tcttgctcag tggtctttgc tcagtcctct tcatcttttg acagtggtgt gcgtcaggcc 67320 

ctgcctttgt ccccttcttg tgtcttttac acccacttct tcagtgatct tcactatcac 67380 

aggtttaagt accatgatcc tctcacctgg actgctccca tgagcctcca acacaaacat 67440 

cagttgccta ctcagtgtcc ccacttggat atcttaacat ggccatggcc aaactgagct 67500 

ttggatgttt atttctcaga cttgctcctc tgacatacct tccgcctttc agcacataac 67560 

acctccatca ccctagttgc tcaggccaaa atcgtggaat tgttcttgac tcattcttga 67620 

tgtcatgttc tcattggaca gccagtctat cagcaaatct tgactccgta ttcaaaaatg 67680 

tttcccagcc actgtgtccc atctctgcca ctattcccac ttgatctagg ccaccgctgt 67740 

ctctcacctg gattcttgca gttacttcat aactgatctc cctgcctctg ccattcaccc 67800 

atctagtcag ttccttgtgc cacagcagag tggtactgga aattattttc ttcagccaaa 67860 

ctgaagaaac tgatgacatt tgtgggtata gtcagcagaa ttctaaaggt ggccctcaag 6792 0 

atttccatcc cacgtttatt cagtcacaca ctactctagg tactgctaga gggattttgc 67980 

agatgtaatg aaacaaggga gttgacttta agatacagaa agtttctctg gctggtggca 68 040 

gaaagaggaa gccagagaga ttggaaacat gagaaggact tgacacacca tccctggttt 6810 0 

agagatggag tggccacaga agtttttttg tttttgtgag acagagtctg gctgtgtcac 68160 

ccacactgga gtgcagtggc atgatcatgg ctcactgtag cctcaaactc ttgggctcaa 68 22 0 

gggatcctcc ctcctgggtc tcccaaagtg ctgggattac aggtgtgagc cactgcgtaa 68 28 0 

gccctgaatg catttcaaag cagagtcttc cccagagtct ccaactaaga gtcccatcca 68340 

gctgacaact tgacttcagc ctgagagacc tcaggagata gctgataatt ccccagaact 68400 

catggaagac atgaacagac ctgtctctct cggcagagaa ccctacagag cttgctggga 68460 

cttctgacct ctaaagctgt gagataataa gtgggtgttg ttttaagcaa ctaagtttgc 68520 

agtagtttgt tacactaata taagctgggt gcggtaggtc acacctttaa tcccagcact 6858 0 

ttgggaggct gaggtgggag gatcgcttga gcccaggagt tcaagacctg cctggacaac 68 640 

atagtgagac cccatttcta ttctttaaaa aaaaaaagga aaggaaagaa aaagagaaga 68700 

aactaatata ggggacaaat ctcatctaaa ggctgcaaca gaagatgaaa acatattttc 68760 

cagagccaaa tgcatccaga gagtagacaa agaatctact tggtcagcca acccagcttt 68820 
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tacaagcttt tacatccagc aggcttgtct acttagtgtc taaaggacag acaaaataga 68880 

atgtttttta agtctccatg atatctgcat ctgtgtttga gccaggtatg cagaactatc 68940 

tgacaaatct accaagcact caccactgcc attcctgtga cctacagtta tgaattagaa 69000 

tttcaaatcg agccaaagta cagcatccag aagaaaccca cttggcgtgg ccccttccct 69060 

acagttcaag cagccctctg gaaggggact gcgagacctc agtggaaggt agcaagcagt 6 912 0 

accacccgtg tcactggtgc tcagagagta ctgttagctc tctcctgccc tttttcatta 6918 0 

tcttaaaaac tgcaatgtac cttttttata tacctgaaat tgatggttct tataagcaca 6 9240 

aatgaccgtt aagttgtccg tatacattaa agcttgagta tcacttaagg taattttgtt 693 00 

ttcttctgcc agactgttcc caaacccatt gcactggagc cgtgttttgg caacaaagcc 69360 

gctgtcctct ctgtgtttgt gaggctccct cgaggcctgg gtggcatccc tcctcctggg 6942 0 

cagtcaggtg agtagatgcg gtccagcgaa agacaccttc taagcatgtg agggagctaa 69480 

gcatgggata cttccctttc ctagagaaca aggttataaa ggtgataacc aacagtgcct 69540 

gcccatgcca tggcaaggat tcagtgagat aacccatttt cacctatatg cccaaaactg 6960 0 

ttttcccgag gtcatcgtcc cttcctgggg ggacgtcccg ggacctcctc accgttctgg 69660 

cctccccagc ctctcagtgc tcgtggatgc ctctgaccac accttccttt gctgtcggag 69720 

tttctggtcc ccctttgcct ctgctgctct ttccctgtct cttttgccct gtcctgcccc 6978 0 

tgaatctggg tgtgctccct tcaccctgca tgttcttcct cagttgcctt ctccactcca 6984 0 

caacttccag ccatctcccc tgggctggtg tttcttcaat tggtatttct agccctcctg 6990 0 

catttctttt gctttttcac tgttttaaat ataaggcaca aacacaacga taacataaag 6996 0 

gaaacgtctg aaaatccacg cacagttctg ctatcccaca acccaccaat tttcattgtt 70 020 

ctgtgttctt tccagccctg gccagcaatt tagttgaaat gattgtatag atctaatttt 70080 

tattctgctt ttttcatata actatcttaa acatttttat agcctactac ataatatttt 70140 

accaatgcat gtgacacttc tccatcaaat atatgaatca tgattatata aacttctatt 70200 

ttgaagacag aaaatatgga atacagtttt gaaagtatct cgcttaatta cagagtgctt 7 0260 

tggaggatgc ctcaatgtga tcattttaga caactttcag ttgagatgat tctggagtca 70320 

gggttacctt agccagagag caccctgtgt ctggtggaaa ggctgcccca ggatactggt 7 0380 

ccaggccctg agctccactg tcccttcagt gcaccctcag acccagaaga tcctccagcc 70440 

cagctttaga atggttagga tctatagtat cttcacgagg gaggcctttc gacatggctc 7 050 0 

ggccaggatt cattgtcttc atctcaccta gggattcctg tgagttgagc ccctaactcg 7 0560 

gccttgcctg tgttgaggac aggacaactc tctcgtctgt cactcaagtg cagtttgtgt 70620 

ctcctgccac cgcccagccc agcaaacagt ggctggtgtg ccctgagggc agggtggcgg 70 68 0 

ctggggcggc aaggtgacag gtgtgcagcc ctgtagcctt tcctgctcca gcctttgcct 7 0740 

gacatgcccc aaccacctga aacacctaat aaaggtctgc tcaatgcttg agacccaccc 70800 

aacagtgggg ccgtccttgc tgccccttct gtaccccctg ttttaagtgg tgcccttttc 7 0860 

taggctccca gagcattccc cactgtgtgc caggcactta cagctgtcta cccccacccc 7 092 0 

cactcccctt cacaggctca acaaggcaag taacacgctc cactaacaca actcctggca 70 980 

ggtctcagca gagctccgag cctgcgctgg ccatttccca cgtatattag gaatccagct 71040 

cccccagaac cgtgccttcc ctaatggatg tcagattact ccgttatcaa tacaggaagg 71100 

aaaatgggat tgcaagtccc caaagcaaag tgggcggcgg ggcagtgatg agcataaagg 71160 

gtagttgcta ttgatcgtgg gttgctgcgc gccagatact aagactttat gcatgccagg 71220 

tactaagact ttgtcagttc actcattgat tcttcacact agccctgtac tactattatc 71280 

cccattttac aaatgaggaa actgaggtgc agggaggtta aagaaccttg cttattccaa 71340 

agcctggccc cagagcctgc cctgcctgta tcgtggtatg gaatggaagt gactaatgct 71400 

gactgagtga taggggtgaa tctgaatctc gtcataaagc gtgctgctat ttctcggctc 71460 

cttttggagg aaggcaggag aaattgggca cttagcacac gtctacaatc tcatttatat 71520 

gttcccacca ccaagcagag cactgtgctt ggttgatgtg ctgaaaagca aactgcacct 715 8 0 

tttgtgtaaa aagtagggaa aggagaagaa tgtgtatttg tatttgcata agtataccct 71640 

caaagaatac ataaaaatta gtgaaagcgt ttacccagat ggaggggaag gcggagggag 717 00 

ggtcagggtg gaaggaagac ttttccctgt gtatgtgtac ctttttgggg gttttttgtt 717 60 

ctggtttttt ggtttgtttg ttttgcacca tgggattgaa aaataaatgt ttaaattatt 7182 0 

ttttaaaaca aaataaaagc cagctgctcc cagcatatgt tctctctgtg gttctcccaa 71880 

ggtcttgctg tggccagcgc cctggtggac atttctcagc agatgccgat agtgtacaag 71940 

gagaagtcag gagccgtgag aaaccggaag cagcagcccc ctgcacagcc tgggacctgc 72 000 

atctgatgct ggggccaggg actctcccac gcacgagcta gtgagtggca caccagagcc 72 060 

atctgcaggg aagggcgtgg cggggaaatg gctgtgcggt gcgggacgga agactggaaa 72120 
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ccctcaaagc atctgactca cctgcatgat cacaagcttt ctttgacggt ttctcccatc 7218 0 

cgtgttccag catctaacct tttacttttg cataggaaat acttgattta attacaggtc 7224 0 

cagggatgag ctgatggttg ctggaggagg ccagtgtaga gccagtgaga gaactaggaa 723 0 0 

tgacactcag gttcactgtg gaaaactgtt cttgggactg tctcaactgt gcaaaaaaca 72360 

aaagatggag tgtttacaag tagacattcg tcatcagttg ttcttgaaca tggtctttta 7242 0 

aaaactagtc agatgaatta acttgttttc atctgaagcc tgctatcttt tttaaaagat 7248 0 

gtgctattta ttcttgcacg atttaggcaa ttatctctct tccagggagt accttttttt 7254 0 

ctagttgaga attaataatg gtccatctct tttgatcata tcaagctagg atagaagggg 72 60 0 

ggctatttta aatgtcaagg tcagcagtgt tactttgaat gtaaactggt ataataggta 72660 

gttttctata gtaacttgat taatttagtc ttaatccatt tgaaactctc tcttcctttc 72720 

tctctgcctg tccctctcct tctccatctc accctccctc tctcacacat acacacacaa 72780 

acacatacac acaacactaa gtgcctagac tttaaataga tctagcaatt ggaaagttag 7284 0 

taagcctaag tttttacata attgcattcc tacattcttg taaaatttaa atagctacca 72900 

ttggcaatct gctttttttc taaaatctga tttgcagcca ggaaagaatt ttctcaccca 72 960 

aggaacattt gatctagcag cagggatgag aggaaagcag aaatgaatga actgtgaaag 73 02 0 

ctcctgtttt tattatcaaa aaggacactg tcaagaaggc gccccctgcc cccacccccg 73 080 

tgtcacccta ggcctgataa gcgatcagag gaaaggactc attcatgtca cgcttccttg 73140 

agcagaaaag agcactgaga gcacttggga cccctggatc agagagcatc tgtgtgtcct 732 0 0 

gcagcctcct ctgaacttgt ggttcattct caggctgggg tggactcaga tgccaggaaa 73 2 60 

gggacagcct cccattgtca ggcagaagct gcccaaagcc tggagaagga cttgtttgcc 733 2 0 

ctctttcccc caggaggggc tcgacccacc caccctccct ctcagaccaa ggtggtggct 733 8 0 

gtgaggaggg cagcaaatgc tgacaaggat gaaaagcaca tggaaaaaaa tggacgagga 7344 0 

gggaaaactc tgccaaatgg aaaatgacca aatttaagag ggtgggacag tcccctgctc 73 50 0 

ctctcccaga gggcactgct tggaaattgt gttttcccca tttatggtgc tctgtattct 73560 

ggcattatgc agcagcctcc cagaagctct cttctgcttc aaaacctggg atctctggca 73 620 

ttaccctatt gggatggacc gctggacagc aatgctcgag tttgtgaatt tggagagata 73 680 

ctcaaaagag ctaaaactgc agcattttac ctttaaatgc agtgcctaga gagagagtat 73740 

tgtctcttcc ccaacactaa ccccactccc atgaagaatt gcctggaaag atgttttcaa 73 8 00 

ggaatttgaa ccataaaaca ctatctgatg cacagaacac ctctactttg agactcacct 73 8 60 

ctcataaagc ttctttttca cattactgtt aaagaccaga cgttctagaa aagacccctc 73920 

ctctcatgag ctcccccatc cctgctacag aacacagcac ccatggcgcc tgcagtggac 73 980 

tggcccctta attcccacag gcccccccag caaggccaaa gggaggcccc tgggtattgt 74 040 

cctcctacaa ggaagatcct ctttgtttgt tcaaaggacc agttttccta ggccaaagaa 74100 

gtctcttccc catgttagtc ctatgccttg aaatatcatg caccatgacc cacagccatc 74160 

tggttatgtc ttattttttt cctaaaagat aatgtttatt tttaaaaagg aaggaaggag 74220 

caagtgaagt ttcattctgc tccagcggtg gggaagccgc tgaatccacc tgcttctcct 742 80 

ttgcaaccga cagcaaacag ctttctccgg cctcagggca gaaaaaggga atggcaggga 74340 

gtaagaggcg ctgggctcgg agcctgtttc caagaaggaa ttggttgtca tctggcagtg 744 00 

ttgcgcgtca caagagagcc tgtatataaa ttaaaatagt caagacaaca ctgaccttgc 74460 

acttgtacat aactatacag tagtgtccag aatgttcaga cattcggagt gtacataaaa 74520 

cagaaaaaat cttcatgtat ttttattaaa tataacaatg tctgagtttc acctaagatg 74580 

tttttgtgcc atatgctgga tatccaggtt ctcgccaggc cccgatacat gaataacaaa 74640 

cccaagaaac gcatccccat tgtgtgatgt gttcagatgc atctggcacc aattaggtat 74700 

ttcttaaaac aggactcatc tgtcagagtg cacatgaaaa atcaggcagg gaatcgaaac 74760 

gacagcgctg gaggagactc aggaagcaga ggcgtccctg ccgctgccct tggccctgca 74 82 0 

agcacatcat gaccctttct ggcagcctct tggtgctctg ggtagtgagg gatgaccagt 74880 

cttgtcctga gaaatgtttc tcttagtctt taagttcaaa gactaacctg tagcaatcag 74940 

actttccaaa agggggttct ccattttttg tagttttgtc taaattttta atgaccattt 75 0 00 

cctggaatca gtttattata ctgaaaactg ggggtgggag tagggagcta gtttgttgat 75 0 60 

aaatagttcc catttccccg tggagaattt gacataccct ggactcctgt gtgcctcctg 7512 0 

ccatccctgc acacagcctg gggagaagcc tgtgcctccc cgtgtggaga gaaggcaacc 7518 0 

ccagatcccc tgagctaacc cggaggaaag gcagtcctgg acagaagact gtcagcagaa 752 4 0 

ggaaagtact ggactacccg tgggtaagtc ctgccattca agactggaga cacctgggaa 753 0 0 

ataaaaagag cagggcactg ctggtgggaa gaggcatttt accttccagt gcaaatcctg 753 60 

ctcctttgat ttaatggggt gtactggggc caggggctga ttcacttcct tgggagatgg 7542 0 
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tgfgtgttttc atgaacatct ttgatccttc catttcattt attcatccat ccattcaaca 75480 

agtatttgct aaacactaac ttaagctaat gctagggtag tgactgagat gtaaaaatag 7554 0 

attttagaat taaaacaaaa tccaagtcct cacacccctg tcatcccagg agatctttcc 7560 0 

ttgtggtggt ttctgtgaga attggccatc ctgaggacac agccaggacg gcagaggcct 7 5 660 

cctggcctca gggcatgccc tgcctacctt ctgaaatgtt taccccattg accaaacttg 75 72 0 

gctccagcca ttgcggtggt ttctagatag ccaggcccac caagagatat tgccccttga 75 78 0 

tgagagtcaa acaccctgcc tacaaggaga tgttttgaaa tggagaggaa aattggcacc 7584 0 

tcatctttta aaggcagtaa tggaattgat tttcagtaac tgaatttgtg cacaaaacat 7590 0 

tctaaacact agtgaagcct gtttcgttga actaattaat tctggctctg gaaatgtttt 7596 0 

tgttttatag ttatttacga tttcgtttgt ttggattcaa gcttagtttg ttaatatgta 7602 0 

taatttagca tctattacac tcatgtaaat atggagtaag tattgtaaac tatttcattg 76080 

cggggattgt gggtgttata catacattta ggactgcaat tttttggtat ttttttgtat 7614 0 

tgtaaaataa cagctaattt aagcaggaac aagagaacta agggaggtct gtgcatttta 7 6200 

aacacaaatg tgaagaactt gtatataaac aaaagtaaat actataatac aaacttcctt 7 6260 

ctgaaataaa agtagatctg gtaaaaatgt ggcttttgtt ctgagtgttt cattttgatt 7 6320 

ttgcattgtt ttgcctatat ttacattttg gtcattagaa ccttaaggga aaaaaaaaag 7 6380 

cgtttaccag ttttcagact gcgtcagagg gagcacttga cagttaacgg aggtgagggt 7 644 0 

gaaaacccct ggcaggtacc tcagacagcc ctccaggcag cttgtgagcc acagaccttc 7 6500 

cacacagcct ttgtgtgcct acagggcagc tgggcccgtg ggggcaggga cagatctgag 7 6560 

cagtgaggta tggacagccg tctcactgct ccactaacca caccactgtc tccagtggtg 7 662 0 

ctgttatccc tccactttgc gctctcctca caggctcctg aatgtcctct aggttgtctc 76680 

caaactgcta aatctcaggt gtgctgacag gtctcagccg tggccagtgt tttcctcggc 7 6740 

cttgaagttt tgaggaagtt actcccattg gcaaggcctt gtttgggggc ctcctgtgtg 768 00 

tcctgccccc ttttatttgg ctcctgccct cagaatgggt cagagtcatg tatttcagtg 76860 

cagagtgcag attcccagtg ctagcagttc cttccagaag gatgaggtca ttgaaagaag 76920 

gaggaaacac aagacggtca ctcctgtagg ggaagatggc agggtactag ggccagggcc 76980 

tgaggtgaag gcaggagcct ggaggaggcc tgcctaccag agctcaggtc tggcttaggg 77040 

a ggg aaat gg ctggactagt ggctgcaaac aagttgagaa ggaccttgaa actcttccag 77100 

ggagggttat gagagccacc aacagtaaca ggatagaaac cctgtgattt ggtggtactg 77160 

tggcagagac aatgctgtgt ttaccaaacc tactgggccc acagctcatc atttcccagc 77220 

ctccctgcag ttaggttggg accaagctaa gccatggcca aggaaaggtt ggaagaagtg 77 2 80 

atgggcacca attccaaacg tggcacacaa atgctcccac aggggactct cctacggatc 77340 

acatgttaaa gactggaggg actgtgggtc cctgagtcac cccttagagg ccggccgcct 77 4 00 

gatcagtaag agctgtgctg gcctttgcat cactggaaag caaactcact ctgctgatcc 77460 

actgagattt ggggctttgt tacactagtt agcttttctt attctcacta atatacaaat 77520 

cagcactttg aagtgcagtg cacatgtaac caaaccctta agtacatggc actggctcag 77580 

cgtgtggcca ggtggcgagc agtgataaga caggtgcagc tggcaggaca actgaggtcc 77 640 

ctgccaggcg gagcaaaaca tctggtaagg ccgtttcact cgataactcg gaaaacaaac 77700 

tccatgtctg ctgacttttt gacttgaggg gaataaacaa caacaacaac aacaaatatg 77 760 

tgttgattgt tactggctgc ttttggcgag gcatcacgag aaaaagttta cttcaagaaa 77 82 0 

gaattagcca gtttacaagc agaagggaaa aagatggaaa gatgttaacc agggccttgc 77 88 0 

aattctggaa aagccaactg cttctagtct ccaaccagta agaggtgcag gtgagcaaca 77 940 

aaggccccat gagacttagt tgaacacagt gactgggcct catggtgacc atccaatgaa 78000 

gggtgtggtc ttcccactgg agcatggaaa ctgcaaagcc accagcttga gggaggggca 78 060 

cgtggttaag gaagcaaaga aaaagctgac atgagaactg tttccaggaa caaaccatga 7812 0 

atgtagtcac tgaccctgga cctgcctgga aataaataaa gatgtagtaa atttaagaaa 7 818 0 

ccattagcaa agccttaaaa aaaaaaaaaa aaaaaaaaaa aaacggtggc aattgaggaa 7 8240 

gtcttaaatc caccttcagg caggaaaatg gttgtgaaag ctgtgtccag cgagggcttt 7 8300 

acaagctcag atgtgggccc aagagcagaa ctgggcctga gcgaggactc cccccagccc 7 83 60 

aggacagggc ccagcaggat tgcaaaaaca ccgcagagca gcaattgctg catctgcacc 7 842 0 

cgctccttcc tgggtaggag ggcctaggag ggtgtcctgc ctgttctcat cactacctgc 7 848 0 

tgggtgggag gtggattaag ctgtcttgtt aattcaggat tgtggacttc aaagagccac 7 8540 

ctgagggagg ctgcatcacc tggaggtctg ggcggggtgt gacctcgtat tgtcccccta 78 60 0 

ggagagggtg cctgtgaccc atgtaggaga ggaagcacaa aactggtttt ggtgatcaga 7 8660 

atagcaaact gggccagatg cttaccacac atttcctctt cctgagtcca cagcttcgac 7872 0 
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tgcatggcct gggcccgtgg catctaccca gatgccttgg gaccagccct caccaatgat 7878 0 

gtgtcatttc aaggccattc acagtgacag tgtctcctcc actgtctcca tcaaccttga 78 840 

aggcttatgg tgaagatggt aatacaacag gattgaagga gactgggtcc ctgaatgact 7 8900 

ttgtggaaca gagcactata ggccaaaggt agtagaatta ggtatcttgt tttatgctgc 78960 

ttaagtgaat taattaattc tctagattta ttcacctagt tattcaacaa atttttattg 79020 

agcacatcag ggctagggca ttgtgtcagg atcagagatt ttgcaatgaa caaaacagaa 79080 

aactccatgt cctccagggg agttgataat aaaatgtcag ataagctctg tgaagaaagg 79140 

taaaagtagg gttaggggta gaaagtgctg gggggattgc tacttaacat agtgtggcca 792 00 

ggggaagcca ttctgctaag gtgacagctg agcagaacga tgagtggagg gggcaggaag 792 60 

gaatgtggag gtctgggaga agagcattcc aaacagggat ggcaggtgca gaggccctga 79320 

gggggcagca tgcctggtag gcctgctgga tatttctgag aatgaaaaag tccattccca 793 80 

gttgtttgcg tggagaacct aacccagtta taaacacaaa gtacttgaga gctggtttaa 79440 

tatacagcca gctttccagc agtgcaacta ctgtgtacat caagggaaaa ctgaacttcg 7950 0 

ttttccttaa aacttatcat cagctggtca tcattttgac aaattctgtc aacaacagca 79560 

gtgtcattcc tggcatctgt atgggtcacg tctgaacaga cacacgccct gcagccctgc 7962 0 

aggtaccagc tgtataacaa gaactccctt ccaccctgtg tcctggaaac aagaaagcca 7968 0 

ttagaccgga agatcccgat ggctatctca aatgtgctgg atggagttgc cagggcccac 79740 

tggcatgccc tgtaagcctt tccttccacg tttggttcct gccccttgaa gactccattt 7980 0 

ctgagtttgt gtgtgtttta ctttctagtg tgtgtcctca tcttaatttt tctctctctc 79860 

ttctgccttg aactgaaggt tcgcttgggt gtggagagac aggcccccag cagagcagct 7 9920 

tcccgagaca tcctccgatc cagggcttcc cagcagcccg gcaaggcagg gctgtgcctt 7998 0 

tctgcttcag ctcacaagca tgccaggctc actggcaagc tgctgtctgg ttgagggact 80040 

gctcctaaag ccctgcacag cccctgtcct cctggccctc tggaaattcc acccccgtgt 8 010 0 

ccacatttca tgcaaaaatg agctggttct gtgagcatgg cccggcctga ctcgcttagt 80160 

gggcggtaag tggtttccac ttcaaccttg cacctaatca ccgggctcca caccaggatg 8022 0 

gacattcatg agccgtgaag tttccagtaa taaatccaca gatgcttcca gcacctgcct 8 0280 

tttcgcatca cctccactcc cagccacctg ccaggcaaca ggtaacagag acccagtcac 8034 0 

aggagggcag tgtgggggca ggactgcagt ctcccaaagc ccatgcacaa aaccgacagc 8 0400 

gccctggcag gacaaggagg ctgacattca gatgtggagg aacaaggcat gacccattcc 8 0460 

tggtcatggg ggccacagct ggactcagcc ttgaggcttg gccagactta acaccgtgta 8 052 0 

taaaccagga cctttttagg tagagtaatg gaaaccaaac tctaatgatc ttagacagtg 8 058 0 

ctattagtct cctggagctg ccagaacaaa ttaccacaac ttcagtgtct tgaaacaaga 8 064 0 

gaaactgatt ctcacagttc tggagaccag aagtctgaaa tgcaggtgtt gccagggctg 8 070 0 

tagtctctgg agactctagg ggaatctgtg cctacctcct ccagcttcca gtggctcctg 8 076 0 

acattccttg gcttgtggct gcatcacccc aatccctatc tctgtcttcc cctggtcttt 8082 0 

tgctcaaaat gtctgtgttt agtttccctg tagacacctc tgcatcactc tcataagatg 80880 

cagaggtgcg acatacaggt gttgagagcc cacttagata atccaggata agctcctctc 8 0940 

aagatctgta acttggctgg gtgcagtgtc tcacacctgt aatcccagca ctttgggagg 8100 0 

ccaaggcggg aggttcactt gaggtcagga gttggagaac agcctgggca acatggggag 8106 0 

accctgtctc tactaaaaat agccaggcgt ggtggcacac acctgtggtc ccagctactc 8112 0 

aggaggctga ggtaggagga ttgcttgagc ctgggagttt gaggctgcag tgggctatga 81180 

ctgcaccact gaattccagc ttgggtgaca gagtgagact gtctcaaaaa aaaaaaaaac 8124 0 

ataaaacata acttaaatca catctcttgc cacagaaagt aatactcttt tgcctacata 813 00 

taaggtaata tttacaggat ccaggggtta ggatgtggac atatctttgg gaccactgac 81360 

agccatgaag caatctcata attttcaaat aggttctgtc ctttttatct ttccagtctt 81420 

ttggaaagca tatgcctata ttttcaatcc acaattctat ttttatttga ggtcatttca 81480 

tttctggttt ttatttttta ttgagacagg gtctcactct gtcacaggct ggaatacact 81540 

agcacaatca tggctcactg cagccaactt ctgggctgaa gtgatcctcc agcctcagcc 8160 0 

tcctgagtag ctggaactac agacacacat caccatgcct ggctgattca ttttttaatt 81660 

ttttctagag acaggctcfca tgttgcccag gctggtctca aactcctggc cccatgcaaa 81720 

cctcccgctt cggcctccca atgtgctggg attataggag taagccgcct tacccagcct 81780 

ccagttttat tgtgttttgt tttgttttgt ttgagacaga gtcatgctct gtcaccagge 8184 0 

tggsgtgcag tggcacgatc tcggctcact gcaacctctg ccttgcgggt tcaagcgatt 81900 

ctcctgcctc agcctcccga gtagctcgga ttataggcat gcaccaccac gcctggctaa 81960 

atttttgtat fctttagtaga gaccgggfctt caccatgttg gtgaacacaa agtatttgag 8 2 020 
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a'gctggttta atatacagcc agctttccag aaatgcaact actgtgttca tcaagggaaa 82 08 0 

actgaacttc gttttcctta aaacttatca ccagctggtc atcattttga caaattctgt 8214 0 

caacaacagc catgtcattc ctggtatctg tattggtcac atctgaacga cacacgccct 82200 

gcatgcagcc ctgcaggtac tggctgtata acacgcctgg caagaattcc cttccaccct 82260 

gtgtcccgga aacaagaaag tgattgtgat ccactcacct cagcctccca aagtgctggg 82320 

attacaggcg tgagccactg cactcggctt cctcccgttt tttttttttt tcaatgctta 82380 

tattttactc taattaactg agtcaaaaat tgagaatagt tgaatacact ttcatgtaag 8244 0 

gcgaatcatt tagccgatac ttaactctgc atttgggcta ccatgccgct gtggttagcg 82500 

gggcaaggtg atgagccctg tctcaacaca cacaccccgc ctctccccag cccacttaca 82560 

cgcgtccatt cccacgcagg tgtgggggcc ttagaggatt ccctcttctt cgtaaagtga 8262 0 

gaatgggctg gactcggctt cactgcccaa caactccttt ttttcctttg ggaaactgtc 8268 0 

cttcccattc catataacct tcatgggctg agatcactca ccgagctaca ggaagaggcc 8274 0 

cattactatg gccccatcag ggttctgcct gggacagtcc ataaatgctg gaagagagag 8280 0 

gtgcctttcc ctactgaggc tgctaaaagg agacactgca aactggggct gctggtggca 82860 

atcttacact ctcagtgaaa gcctgcctgg ctgcagggga aaccaatgca cagaccagca 8292 0 

ggggcaacat catttgaacc cctggagaca gctgtgcctg aagcccacat ggctcaggca 8298 0 

catgctcaag ccactctgat ttgcttgtta cagtcaagag agtccttggc cgggcacagt 8 3 040 

ggctcatgcc tgtaatccca gcactttggg aggccgaggc aggcggatca cctgaggtca 8310 0 

ggagttcaag accagcctga ccaacatggt gaaaccccgt ttctactaaa aatacaaaaa 83160 

ttagccgggc atggtggcat gtccctgtaa tcccagttgt tagggaggct gaggtgggag 8322 0 

aatcgcttga acctgggagg tggaggttgc agtgagccaa gattgcacca ctgcactcca 83280 

ggctagctaa caaagcgagg ctctgactca aaaaaagaga gtcctgactg aagctgagag 8 334 0 

gctggtggac agctgtcagc aagcagtgtt tgtgagtctg atgtgagctg cgaagtccac 8340 0 

acctgatttc agagctggtg gctctttttc caatcaggac agctccaggc ctgagttttg 83 46 0 

gtgtggtctg tacctaaccg ctgtgtctta ggtcaggatc tctagaagcc aagcccaaga 83 52 0 

caggagttct caaatgccgt gatttactga gggaatgcct tcgggggaac ctgccagtga 83 58 0 

ggagagctgg aggaggcggg caggggcggg agctgagcca ggtgggggtc cctgctggag 83 64 0 

tctagcctca gcttgacccc acaggggctc tggagcagga acagcacact gcattgtccc 83 70 0 

ttgaggcagt cactggctgc agttgcctct gggagcagag taaaagtgga caggcatttc 83 76 0 

tgggagtaca attccctaga gaaggggaca gctctgagct gtgttaccag ccaccatttc 83 820 

caggggctgg gggatgcgct gcactggccc agagaggtct ctgagcaagg cccccacaac 83 880 

ctccacttca acctcctgcc aaacctcaag cagattgaga aggagccgct agagacacag 8 3 940 

gaaccctttc ctcatcagaa gtctccaagc tttggaggaa gaaggaggca ggcaggaggc 84000 

gggaggatcg tttcacccca ggggttcagg gctgcagtgg actatgatca tgccactgca 84 060 

ctactgcctg gtgacagagt gagaccctgt ctcaaaaaca aacagacaaa aaaaatcaat 8 412 0 

gctcttagca tctgctgggt ccacattggg cagtgtgtgt gacgtggtcc agctgcaagg 8418 0 

gaggctggga aatgcagttt cattatctga gtactagtgg agaaagaaga gacgaaagca 84240 

atgagaatcc ctgccatgag gaagtgagag ctgattcatt tcagcttcct aaggtgaggt 843 0 0 

gatggggccc agacaggttc tgaggccact ggaggtcaca caggccagca gggatacacc 843 60 

cagtggtaga ccctggtctc caggctccca ggacacagac cacataaaag caaggtcagt 8442 0 

ctcagcacac agatttaata agcactgtat gttatgccta cagctgcagg aaaccagggg 84480 

gaagatggcc acagggccaa tgccaggcag ggcacagggt ggggcccctc aagcagcata 84540 

tggggaacag gaaagactct gccagcatgg tcatggtgca caggtggcta caatggtggg 84600 

taatgcccag ttgggaggat gcagtgttag acggggtctt gtccgatgga ttccttaggg 84660 

ggttatatgc ttgggaggct gatacaattt cctagaattc tgggattctg gatggtctct 84720 

gacctgctct aggggatctg ggtctccaga gaggtctcag ctacaaaagt gaccctctcc 84780 

tgaccccacc ttgtgggaga ccccagtggt ctcaaggtct gtgaataaag ggacttacaa 84840 

ccaagatgcc tcctgacagg gagacggggg ctttacagtc aggccccagc ctacatccct 84900 

tgccccaggg gacggcacac agggtgtcca aacctacaaa gggcagcgga gtgaaagcac 84960 

cagggccggc ttggggggta agccagcccc ttctcaaggg gccagtaggc cacagaggcg 8502 0 

gcccaggcag gcagggtgga gtagctggct gtgctatccg aggtcgctgt cctaatcaga 8508 0 

gcagggccgg gagagccagg acaggaagta tggagagcag ggagcgtctc tccagggccc 8514 0 

tgcctgtgga ggacaccttg ggggtgggag ccaagtgcag acttgaggca aaagcctgtc 85200 

tctgccctct gcctgcaccc ccaccccagg cccttcatgc ctccacccag gcccaccagg 85260 

gacggaaggg tgtgaaggtg acagtggggg aggtgccagc tcaccctccc ctccctgcac 8532 0 
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ctacctgcag gaggctgtgc tccgggtctc cctgggggtt cagctggtcc agcaggactg 8 5380 

ggggccaccc gctggcaaag gcctgaatgg caccatctat gga-gggaggg gtccagggtg 8 5440 

gacttagctg ggaaacccta gcacagggct cagcacaggg gacaaagtga gagaccccct 85500 

gctgcaaaga ctggctcctt ggacagaagg tgtgtggtga ggaataacaa aataataagc 8 5560 

acagctgtga cagagtgact caactgtaca gtcttccatt ctcccctcta aaatggctcc 8 5 620 

cagccaggcg gcttgggagg gggcagtaag cgccgccccc actccccctc cactcccccc 8 5 680 

cgggcccctc acccaagcag cggttcctgg taaagagccc ccggaaggct tcgcagtcct 85740 

cacgccggtt cccgctggct ccgcagtcgc accagggcgc cacgcgcgcg ctcacgttgt 858 00 

ccacgtagtt aggggtgacg gcggtgcctg cggggaccct gagggcgaag tcatcggccc 85860 

acccgggcgg agatatcccc tggagaaccc ccgccctcgc ccggatcccg gccgcgcgta 85920 

cccacgaggc ccgc'gtaggc gcgcaggcag cgggcgccct ggtccagcag gcagccgtcg 85980 

ggggcgctgg gcgctggggt gcacgagacc tgaaaggcca ggaggcgagg cctgcgcggc 8 6 040 

gggagggcgg tgagccggag agaggccccg tccccaccct cgccacggcc ccgccgccgc 8 6100 

ccgcgcgcac ctgcagaccc ggctgcgctc gcagaagttt aagggctcaa ggcaggaggg 8 6160 

cggcgcgggg ccgggccccg aaaaggcgca ggagggcacg aaggtctggc gccgacgctc 86220 

ggcgcacgcg gggcccgcgc acgggcagaa gagcagtgcg tgggtgagcg cgggcggccc 86280 

gcgggcgaag aagcggcgca gggcccggcg gcagcgggcg cggggacagc ccccctgcgc 86340 

agcccggccc aggcactgcg ccacatactc ggagcgcaaa cgctggcacc gcgcgtccgc 86400 

cgtgcaggct tcggccgcgt ccacacatcg gttccctccg accgagctcg ccgaccctgg 86460 

gaaaggcgcg gggaggctgc aggtctcagt gcgccccgca tgcacacttc acccagccca 8652 0 

tccacgcccc tggaaggtgg gggacactga aaacccgaga aagcatagag tgtggcccaa 86580 

aggcggggcc gactggcact gatcggctct ccgtcgtggg gatgggcagc ggcgggctct 86640 

ggagggctct gcaacacgtc agggatggag agggagcatg gaggctgagc caggccttcc 86700 

ccaaaagctg aagcccctcc ttcccaagca tcgccaccat tcccagaacg gccacacacg 86760 

tccaaagacg ggtccagggg ctcctcctac actcaatggc actacctgac gggggaacaa 86820 

tgaggctttc tggggatctg gccaatggag actttaaaac cagagatggc atagaagaca 86880 

gtggctgctc ctagtctctt aatgacaggc tgtgacgcca tacctttcct cctctacccc 86940 

ctccacctcg ccgcttatcc tccaccatgg agaccctttg ggtgggtcta ttgatttctg 87000 

tttctccctc cttcctgacc tcctgctcct cctcctgccc cctgcatggg tgtctcccct 87060 

ctccagccct tcctgatgct cccttgagtc gaggtcatcc tgccccatct gcactggctt 87120 

cctccagccc ctccaccctt catcagccag gggagagcgg atttatgcaa gagctctggc 87180 

cactgaagaa cccactgagt cctggctgcc agggtgtgga ttcagagaag tccctgacga 87240 

cacaccgccg ggctttgtcc tgagccacag gacccaaggg ttcatcagtg gttttagtga 87300 

ccaggcccca tgataggtga caccctgctg gagacagaga aggtgtctgg caccccagtc 8 73 60 

acacaccagg ggttgggtag acagagggtg agctctgccc acatgcccac aggctgtgtc 87420 

taaatagagc caggataact ggaagttggg acttggcagc cccttcccag gatttgagtc 8 7480 

acaaaaacca ggctaatcga atccccccca gcctccacaa gttctccagt ctggagcatc 87540 

tcattcagac tagaagcctt tctggtgccc agcctgagtg gacgtgcaca gcctggcccc 87600 

tcatccatct ccctgtgtgt ttgtgcacag gggctacggg gtggtatccc cccccagggc 87660 

tgggagcagg gtgcctctct ggcaatggct gtgtctttca ggctgtggat tggaccgcgt 87720 

ggtgagagtg attcctgctg tgacttcagc acctgcctct gcaggaagga tgaacatagc 87780 

tcgagggctt agaatggtcc ctgagtggag gggatagatg gcagcggggt gagtcctccc 87 840 

ttcctcccct ccccaaaggg cacactttgc tttcctctca cagccttgcc aggcacatcc 87900 

tcaggtctca gctgcacaca ccagcacctg atgttgccta agggtttgaa cccacctctg 8 796 0 

cctgttgctt acctgtcacc cttgcccaga acacccttac ccagtctcca ccctcacagc 88020 

agctcatctc agctgcaaga tctcctccac cttaaacctt taccagattc cactttccac 88080 

actacactcg gtaaagtgac ccctccctcc gcgacatcca tgcagggacg gagcagtcca 8814 0 

agaaggtgga acaacagatg aacaacaccg aagattcacc aaggaatctg tgtgatgggt 8 8200 

aggagggctg gggctgaggg gagggtgaga gagttgggag atggcagaca ataggcaaaa 88260 

gttcctgggg cacgggtgcc aaggctggac tcgggagggg gatggcggtg gtggcttgac 88320 

ggggctggga ggtgaggtgc tcagcacata gtggggtcaa ttgccagggg tccccaagcc 88 3 80 

gtgcgcacag gcctctcact ggcaaccctt cctgtggttc agcttttcct ggaatatctt 8844 0 

gccctcccac cttgtcactg aggagcaggg acagagaggc cagggttcgg ggtccagggg 88500 

aagagagggg cgctcaccca gtaacagcag cagcagcagc gcaggcccca ggcagcggac 88560 

catgctggac cttcaacaga gaaggatggc tggcagagtc ctagtctgat aggcgccccc 88620 
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tfccctagtgc 
gggaggagcc 
cgtcaggtaa 
cacactagcc 
ctctcccatg 
attctatttt 
aggctggtct 
ggatgacagg 
ttagacttgc 
gcg tgatggt 
gtacaatcct 
ctttttctgt 
caatctggga 
gccagttatt 
agacacagca 
aatctcatcc 
gatttaaaaa 
caaacaaaca 
ggagtgcagt 
ttctgcctca 
ttttgtattt 
aacctcaagt 
ctgcgcttgg 
aaggagcaaa 
agaagtccct 
gcttgttgac 
tgactaccta 
tttttaatga 
caaaatgtaa 
ggaaaggggt 
ttgagacgag 
gctactccat 
tggatgatct 
taacttcctg 
gtggggaccg 
ttctatattg 
ggctggagtg 
attctcctgc 
tagtttttgt 
ttctgacctc 
gctaccgcgc 
cctatctcat 
cagccttatt 
cacatcctct 
ccaagctctg 
aagggagggg 
ggaagcccag 
gcgtcacagc 
gtacctcctg 
tgcctcctgc 
ccagttctgc 
ggcctcccct 
atggagaagg 
ggcctcccca 
ggtaagagcg 



ctctggagca 
tttgctgcag 
acctgggagc 
ctgccacgca 
ctaaactagg 
aaacttaaaa 
tgaactcctg 
tgtgagccac 
agaaaagttg 
ccatcttact 
gttaacgaaa 
tccaagatcc 
gttccttcat 
ttgcagaatg 
ttgttttgac 
aacctgggca 
gtgaccagaa 
aacaaacaaa 
ggcacgatct 
gcctcccagg 
ttagtagaga 
gacccgccca 
cctcattagc 
ggatcagttt 
gctgaagata 
tgtacggtca 
tcctgtctca 
atcagaagac 
accaagatgc 
ccagattcag 
tccataaagt 
agagagcgca 
gctaaacagg 
atactgccat 
accagaggtc 
caagctgatt 
cagtgacatg 
ctcagcctcc 
atttttagta 
aggtgatccg 
ctggcctact 
cctgtgacgc 
ttgctcagcc 
tgtgttttgg 
accacacagg 
cagagagaga 
ttcctcccgg 
tgagctgggc 
ctcagcctca 
ttttctgctg 
agcacagcca 
cttagagccc 
ggtggagggg 
gggtcccttg 

ggggaggtat 



ggagcacagt 
caactccccg 
cacgggattc 
cacccagcaa 
aacgctatga 
aaattttttt 
ggctcaaacc 
catgcctggc 
gaaaatgtaa 
taatcatagt 
ctgtagactg 
attctaggat 
ctttccttat 
tttctcttga 
fcaaccaaaaa 
acagagtaag 
aagcccccac 
caaacaaaaa 
tggctcactg 
tagctgggat 
caaggtttca 
cctctgcctc 
atcttaaatc 
gaggacaggt 
gctttgcttg 
ccacagttgc 
gtacatctgt 
tgtgatgtgc 
ccctgcagtg 
accccaggag 
gaaagcacat 
gccctgaggg 
ggtggattgt 
ggcatctgta 
actctcatca 
tttttggttg 
atctcagctc 
caagtagctg 
gagatgaggt 
cccacctcag 
gcaaactatt 
agaatgctgt 
cctattcagg 
cgtgtgggag 
gcaggtacac 
agaaggtgtg 
ggcagcccaa 
tcgctcgcta 
gggtccatgc 
ttggccccat 
ggtgatcctg 
tctttccttc 
gtgctgggca 
tagctggagt 
attaacgtgt 



gaaaacccag 
ccctcattaa 
ggcgcctgaa 
cacacagaca 
aacagcattt 
gaatagagtt 
atcctcccac 
ctattttaaa 
gcattttcct 
ccaatcatta 
ttttgcattt 
cccacatcac 
ctttaccctt 
gttgfcctgat 
gttattctat 
gccctgtctc 
tatgatttgt 
cgatgtcttg 
caacctccgc 
tacagacgcc 
ccatgttggc 
ccaaagtgct 
tccacacagg 
aaaataaaaa 
aatgagctca 
tgctgcgcgc 
ctgtggtttg 
tttccgctgt 
gttgtgcttc 
agggttcttg 
ttattaagaa 
ctgctggttg 
tcatgtctcc 
acctgtcatg 
ccatcttggt 
gtttggtttt 
actgcaacct 
ggattacagg 
ttcaccttgt 
cctcccaaag 
ttatcagcaa 
aactgtctgg 
atggagttgc 
gaaagagggg 
tctcccacct 
acagatggcc 
gtagctggta 
cccccagctc 
ctaccctcct 
gccagctcct 
ggctccagac 
ccacgtggcc 
gtcttgggga 
cccgcggggt 
atttttagag 



ctgfcctgcgc 
gtctccaccc 
tgtgcctaga 
accacccagg 
ttcttttcct 
ggggtctcgc 
ctcggcctct 
cttcttattg 
tacacccttc 
cagctaggaa 
ctcatttttc 
agttgtctcc 
gacacttttg 
gttttattct 
aagtaaatat 
aaataaagtc 
cttgactttt 
ctctctctct 
ctcccaggtt 
cgccaccatg 
caggctggtc 
ggaattacag 
ggtgtgttcc 
tgcgcttgct 
attgcaatgc 
ctagaacatg 
tggtggtcca 
gctaaccatg 
actctacgac 
gatctcgtgc 
agtaaaagaa 
cccattttta 
cctttttaga 
gtgctggtgg 
ttgggtgggt 
tgagacggag 
ccacctcctg 
cacacaccac 
tggccaggct 
tgctgggatt 
ggtctttatg 
aaacgcagcc 
tctggttcac 
tgagggaagg 
gtctgtgggt 
gcaggccaca 
gttgggtggc 
agcgtccact 
gcttcccagt 
ttctgctgag 
aggcctctcc 
tccccagggt 
ttagccaaga 
ctagacaccc 
tctctctttt 



tgatacccct 
caggctgcac 
cggaccccaa 
gagacctggt 
tttttatata 
tctattgcgc 
caaagtgcta 
ttgactaatt 
acccagcttc 
actaaccttg 
cactaatgtc 
ttattctcct 
aagaatcctg 
gatcagaatg 
tgaggttaaa 
tcaacactaa 
ttttaaaaaa 
ccctcaggct 
caagcgattc 
cccggctaat 
tcgaacttct 
acgtgagcaa 
ttactgttat 
gcctagaggg 
cagtgctgag 
gtcactttct 
tttcctaatt 
gccgctgaag 
atctgttacc 
aagaaagaat 
taaaagaatg 
tggttgtt'tc 
ccatatatgg 
gagtgtagca 
tttagccagc 
tctcgctcca 
ggttcaagcg 
catgcctggc 
ggtctcgaac 
acaggcgtga 
acctgtatct 
cagtaggtct 
aggcctctga 
aaactcaaaa 
gccacaagtc 
gaatgtcaga 
caaacagagg 
ctgcccctca 
cacttctgcg 
cttctcttct 
cccagtctgg 
tcgccactga 
gggcagagtt 
cctcctgaag 
tttttttttt 



88680 
88740 
88800 
88860 
88920 
88980 
89040 
89100 
89160 
89220 
89280 
89340 
89400 
89460 
89520 
89580 
89640 
89700 
89760 
89820 
89880 
89940 
90000 
90060 
90120 
90180 
90240 
90300 
90360 
90420 
90480 
90540 
90600 
90660 
90720 
90780 
90840 
90900 
90960 
91020 
91080 
91140 
91200 
91260 
91320 
91380 
91440 
91500 
91560 
91620 
91680 
91740 
91800 
'91860 
91920 
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ttttttgaga 
ctcactgcaa 
ctgggattat 
ggtttcacca 
tggcctccca 
attttatgat 
ttcctggaga 
tacaccaacc 
aaccagctca 
ccaccagaat 
agtggaaacg 
ggaccaaacc 
ttatttcaac 
ctacaactga 
gagtggagcc 
ggcaacagct 
gaacgcccaa 
tcctgtgccc 
ccaccccgtc 
ctcagggcac 
atatgtcttt 
gcagtggtgc 
tcagcctcct 
tttttctgta 
tggtgatcct 
ctatagccaa 
tctgtccccc 
tcaggtgtag 
gtgggaagaa 
ctccagggga 
gcagtcatgc 
tcccaggaca 
tcaggggctg 
cagcagtgtt 
ggtcgaagct 
gattcactgg 
ctttaaatct 
cttgatctgc 
aagctgtgtg 
cagcttctcc 
gtcataaact 
gaggggtggg 
tgcctacctt 
tcatgagagt 
gcaggagacc 
tctggaagga 
ggtgcctcac 
gtgaacgccg 
aagagtccag 
gtccagccca 
accctctcag 
gagcctcatg 
gacaccaccc 
ttggatgtcc 
ccttgatgat 



cggagtcttg 
cctccacctc 
aggcacgtgc 
cattggccag 
aagtgctggg 
ggtttaacat 
tcagaaacaa 
acccccttca 
gggccaggca 
gattcaaaat 
gcactgtggg 
tagtgcctcc 
agcattggcc 
ggcaggaggg 
cgaggtgaag 
ggctgatata 
gggaggcagg 
agacacatgc 
agccaccacc 
cccatgggtt 
ttaaaattta 
caccatagct 
gagcagctag 
gatgcggtgt 
cctaccccag 
catgtctttc 
cgacacacac 
cactgggatt 
acaggaagga 
gtgtggactc 
cctccacaga 
gagccctttt 
tgtgaccttt 
ctccagccct 
ccactcgggg 
ctctgcccct 
gttcatttta 
tgagaatgag 
gcagagagcc 
ctccccagcc 
catttccata 
agaggtggga 

gggggtcagg 

tctcagggtc 
ctggggctgg 
ggcaggaggc 
tcacccaggg 
cccagggggt 
gaggttggca 
gggagttctg 
ctcccttcct 
ccctcggctg 
agcagcctgg 
gccccttcgg 
tccaagccat 



ctcttgttgc 
ccatgttcaa 
caccacaccg 
gctggtctcg 
attacaggcg 
ctcggggtgg 
ctcatgcctt 
aggaactctc 
cctacccaga 
gccaatccta 
ctgtggcctg 
ccactgtggc 
tcfcgtgtcat 
ggctttagag 
ttgcctccct 
ggccactcag 
agtgacagag 
ctggggctct 
ccaccaacca 
gaatttacat 
ttttagagac 
cactgcagtg 
gcctacaggt 
cttgctgtgt 
cttcccaaag 
ttttgacttc 
agaatggggg 
gggggtgggg 
aggtccccaa 
agtcgaacca 
gctgggtccg 
gggatggcca 
gcttctggga 
caggaacttc 
aagaaacttc 
gcagttcaag 
ggagctacct 
gaggatatgt 
catcctcact 
caggggcttc 
ataacctgag 

gggtgggcga 

cgagactgct 
tgggagaaat 
ggaggggcat 
atgagccctt 
gccagggctg 
ggtccctgtg 
cagtgatcct 
cgtttactga 
gctccctctt 
gcacctcctc 
cactctccag 
cagctatgtc 
ctttgactgt 



ccaggctgga 
acaattctcc 
ggctaatttt 
aacttctgac 
tgagccacca 
gggcttgttg 
tcatatgcaa 
acatacgaaa 
caattagagc 

CGCtcttCCC 

tgccttccac 
cctgcatggt 
cagtcacctt 
agcactctca 
gcactcagtt 
cagctcttgc 
caaagagggt 
ggctgccatc 
ccaccccgcc 
acagaaaaga 
agggtcttgc 
ttgaactcag 
acacaccacc 
tgcccaggct 
tgctgggatt 
tactttggta 
agaggctgtc 
gctcaggaac 
aattatgttt 
tagggcccca 
tggaaattgc 
gcactaccca 
ctgatggttt 
tagtgtggct 
caagctgcct 
ttcctggagt 
ctcaccagag 
tgtcccctaa 
cagtgaggac 
cagcatcctg 
cccagaaacc 
tgagaggggg 
ggcagaggct 
ggtggagggt 
aggggactca 
cccttctccc 
tccagtggct 
tgggccatct 
gagtgggtta 
gtttcttggg 
caggggagcc 
tctctagtcc 
atgctggcat 
tgctctcctt 
ccccatccca 



gtgcagtggc 
tgccgcagcc 
tgtattttta 
ctcaaatgat 
tgcctggcct 
gctggagaga 
accgaccagt 
ccagtatttc 
ccaaccccac 
ctggcctgcc 
tcgctcctga 
gggaaactgt 
cataaattaa 
cctggcccct 
ttgggatggt 
atgaagcaat 
gttcaaacta 
taggctcggc 
aaccaccacc 
ctaaccaagg 
tctgtcaccc 
gctcaagtga 
acgcccagct 
ggtgtggacc 
ataggcatga 
tcttttctta 
agattctgag 
cacctagggg 
gtttgcagag 
ggaccactag 
atgtaggaga 
gcctccactg 
attgagctgg 
ctgggttcct 
gcaggtgctg 
ggctgtcagt 
gcaggatctt 
ggactggggc 
cagtgatcca 
gtctccatga 
tgattagggg 
aggctttgaa 
tctcagggtg 
aaatgttgtg 
aggtgactgg 
ttccctctcc 
gtggggccca 
ttggggctga 
ttgcctcccc 
gcacccatct 
ttgggatcca 
taacatttcc 
cgctcagctt 
gcccctgggt 
tcccccaagg 



acgatctcag 
tcccgagtag 
gtagagatgg 
cctcccgcct 
tagattctct 
aactgcttga 
cttgagtcca 
ccctgcccta 
gccccaaacc 
ttgcctcccc 
ttctgtccct 
gagtaataac 
aattttgcaa 
tgggcttgtg 
tttgttatgc 
gggagaatgt 
ggcaacaccg 
ctgccagcca 
ccgtgtggtc 
tccaagtgta 
aggctggagt 
tcctcctgct 
aattttaaaa 
tcctggcctc 
gccaccacgc 
aatggttccc 
ctccagaacc 
agaagacagg 
gccagccagg 
cttctggcca 
cacaccagac 
gtgagggagg 
agagtgtgcc 
ggagtgggtg 
gaggtccggt 
ggccacctgt 
ggcatctgga 
cccaggctgc 
ggaaaagcca 
taaccaagag 
gcagcaaact 
tccaggtccc 
gctgctgggc 
aatatggtca 
gtgctgccca 
acctccccct 
actccatggg 
gcaacgtgat 
gcagcatggt 
gctccaagtc 
ggctccaagt 
tccaggctct 
ccaaagaacc 
gccctgctgc 
ccttgtcatt 



91980 
92040 
92100 
92160 
92220 
92280 
92340 
92400 
92460 
92520 
92580 
92640 
92700 
92760 
92820 
92880 
92940 
93000 
93060 
93120 
93180 
93240 
93300 
93360 
93420 
93480 
93540 
93600 
93660 
93720 
93780 
93840 
93900 
93960 
94020 
94080 
94140 
94200 
94260 
94320 
94380 
94440 
94500 
94560 
94620 
94680 
94740 
94800 
94860 
94920 
94980 
95040 
95100 
95160 
95220 
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tcbtgtgatg ttccttcaaa acattctccc ctgccctgag actcccgcct ggggatgaga 9528 0 

agcagccgcc accctctgca gcgccccctc cgtgctgaca cgccaggctc tggccacctt 95340 

gctcctctgc ccacagaccc tcaatcacaa ctcgctttgt cagggtcctc ttagctgcca 95400 

cccggggccc aggtggtgcc ctgcccctgt ctgttatgcc ctctgccccc atctctggcc 95460 

caaatcatgc catctccctt ggcttgcccg gagcactccc aagaccaggc tatgtcagac 9552 0 

atggccacag agtgcctgcc ctgcctaggg ccctggtgca gggtgagtcc taggacagcc 9558 0 

atgcttagta ttatgtgact ccccactccg ccaccaccca ggtcacagag aactgggtta 95640 

aggcagggcc ctggcacagg ggcagccagc accgcagctg accagtggta tggagtgaaa 9570 0 

agatgtgctg ggcccagcat ttgggaactt caagggggtg acagaggtga tttgtgcaga 95760 

ggaagtggca aagggccgaa aactggtgag acagaggctg gacaggcctc cgggggcagc 9582 0 

atggtacagg gactgcaatc tgagccaggg aaaaacaggg cgaagtcaag ggtgaggcag 9 5880 

ccagctggtg ggagaagcag gagagtggac aagaggagct gtactgggag gtagagggcc 95940 

atgccttgcg gtgctggtgg tggggcaggg atccaccccc tcgcttgact gggaggccac 9 6000 

tggaacctcc tgttcaaagc tacttctttc catggcctct ggggctgctc tctgcacctg 96060 

ggggcaaggc tgagggcctg ccccagctcc cacagcccca gcagagctct gaggagggga 9612 0 

accgcaggag taggctcagg aagcaggcgc tcggagccta cccactgcac gcagggtccc 9618 0 

ttctgcagcc ccagctgcat cgctgcagat gggctcctgg gagtcggtag caacaccagg 9624 0 

ccaggccggc ccctgggagc agaggcagca ggacgctgag gagcatggcc agcaggaagg 963 0 0 

tgtcatggtc tgcggggatt gggggaaggg gcgctgagtc ctgagcaggt gcaccacccc 963 6 0 

agctcctgcc cacatgcccc ccactggcat actttcagcc tgcacagggc cactgtccat 9642 0 

gctgccacca aagcctggct tgtcacagaa gggtggagcc cagcctggag cacagtggca 96480 

gttatggttg ctattgcaaa cctgcagaga agagaagagg agggtcacgt aggattagga 9654 0 

accccaaggt cacccccact cctcgggcta tcaccccgtg gctgtggcag gcagtcaggc 9660 0 

agcgctgaag ctcctggaag gcattcttcc tgcagcgcct gctctggcac acctacgggc 96660 

agtgcaccag gcagtgaggg ggacactggc ctgcgggatt caaacggcaa ggaggggtcg 9672 0 

ggtgggcaga gctcaccatt ctaggtccac actgggtgcc tggctctacc aggcccaggc 96780 

caagcaggtc cagctgggca ctggggagtg ccaaggctcc ccgacaagtc acttcctggc 96840 

catctaggtg aacggtagag tccactggca ccatgtgcgg tgcgagcagg ctgggctttc 9690 0 

caccctggca ctgcagcttc ccacacaggg catccctggg gaggaagtag aggggggtca 96960 

acagctgcag tacccccctt ccccaaaccc actccatagc ttctgctccc tccactcagc 97020 

tccactccct acctccctgc acagggcagg aagtggccct cgctgtcctg gccgcagttt 9708 0 

ccatgagcat ctcccgcaga gttcaccacc tggaaacagg cctcgggagc tgggtgggag 97140 

cctgaggaag catgggccag gctgggggca gctcggagag gggctgcgct cagcgggcct 97200 

cagtttcccc tgcccatctt ccccacagta gaaaactggc cccaccagag gatggggggc 972 60 

agggtgcaag ggtgctcgtg tcctctcacc aggcccccag agctgctggc actgctgctc 97320 

cagcgtggga catgcgccat cccagcagta gccactgccc ctggcacagg gtgagccgtc 973 80 

cagtaggtaa acgtctgggg gacagtggga ggaggtgccc gtgcaaaact cagggaggtc 97440 

acagtcaccc atggcctggc ggcacagcgc tccagccggc ttcagctgcg caggtgacgg 9750 0 

gtggtgggga aggcagagag aggccacgtg cagtgagagg tccatgccga gagcgcggct 97560 

cggagctggg ggagccaggc ctacccaagc ccagcaccca acgggggaac ctgagggcac 9762 0 

caattaacta aggccaacag gccggctccc aagctccccg aaaccctcac cctgaacctt 97680 

ccatgccctc accaggcagc gcacgcagca gtccccgtgg gcgcactggg cccccgggcg 97 740 

cagcgagcag ttgtgagcaa agcagcagag gtcgcggcac tcctgggacc agaaaggcaa 97 80 0 

gaagggccca ggtgagggcg cagcgcccca gacctgagcg gagagggcaa gtgggggccg 97 860 

ggcgagccga cttaacctgg ccagggccgc agtcacactc ctcgcccgct tccacgaagc 97 92 0 

cgttcccgca gagcgccggc ggcaccggga gtccggggtc cggggcattg gagaggcaag 97 980 

cgccgccccc cttgcggaag aaggcgcgca gctggcggcg gctgcaggcg ctgaacacgc 98 040 

gcggaaacgg gtgcctaccg gcacggggag ggcattgggc atggagggac agtcccccaa 9810 0 

cccccgcgct tctctgatcc ccacccctgg gcttggctac agccgccaga cgcgcagagc 98160 

ccagagaggg gaagtaaccc gcgcaaagtc acacaacaag cgggacaggg gacgatgcgg 98220 

ccccaatagt gagcagcccg ggacccaagg tggaatcgcg acccgacggt gctcctcccg 9828 0 

gtgtaggagt aacctcgcca ggttactcgg aaaataatct tcataccgtt gagaatccac 983 40 

tttgcctgag cttcttccct ttaagcctca taaaccaccc tgaagcggac actatgatca 98400 

ttatccccat tttacagaag aggaaactga gggacgacca aagaaacgca gcggaggaag 98460 

tccccaggac tagccgcccc gccgcagccc cgacccccca cccgcgtacc cggtggccgc 9852 0 
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agccatgacg 
gctgaggccg 
ctccgagtgg 
tctccctcgc 
cacatgccct 
gcggagcgcg 
tccgggtcag 
gcccccggcg 
tgcggtcccg 
tgagaagctg 
tccttcgtgg 
cgccaccccc 
ttccaggaga 
cagaaatccc 
gggtctctcc 
ctgcgcgctt 
ggcttgagcc 
cttgtttctt 
tctgatttgg 
ctgctctggg 
ccacaggttc 
tccttggagc 
atcaggccac 
aggcccagcc 
ggagtcgggg 
ccagccccaa 
gctggctcca 
tacctctggc 
caccactggc 
gctgagaggg 
tctcccagcc 
gtgtcttagg 
agaggaagga 
gaggacgccg 
caggcagcag 
gacttggagt 
gttgggcatc 
cagacttcta 
cctccagaga 
acacccactc 
ggatggatca 
cttggcctca 
tccaccacct 
aggtaggtga 
acagtggaac 
gtcaagtgct 
ataatctggg 
aggcaccttc 
gatggtccag 
ccaaattcat 
aaactgctgc 
tttttctttt 
gagtggtgca 
tctcccgtct 
ttttttgtat 



cagcctccgg 
aggctgtggc 
tcctgggggg 
ccccgcccgc 
cgacgggcgc 
gcgtgaccag 
aggcacccac 
ccactgcagg 
ctcggtccac 
agggcgaggc 
ggcgcccttc 
atcaccgctc 
cgctgtttgg 
ggtggcttga 
tcaccagggt 
ctcgcctgcc 
ctgaccccca 
cagacttcaa 
agggctatag 
gaccaccagg 
ctttccaggt 
cccggggtgg 
tagggtgcag 
tcctctccca 
aagcccctta 
atctcagcca 
accgcccctt 
ggtgcatccc 
tgcccatctg 
ggtgttacag 
ttcctcccta 
gaagggacag 
gccttggtga 
gggaggagtg 
gatccacagg 
tggggaaatg 
tggtcgaccc 
gaccccacat 
aaacagaatc 
cggtaatgat 
atcataagtc 
ttacctccag 
gccccagctc 
tgtgaccacc 
tagagtttgt 
cagaacagcg 
tggttttaag 
ctgagcacag 
gacactgagc 
tcccccctgc 
ccccaagtgt 
tctttttttt 
gtggtgtgat 
cagcccccta 
ttttagtaga 



actcggccgc 
cgatctcatg 
ccgtgggagg 
ggggctcacc 
caggcccact 
gcggggc c gg 
gtgagcagct 
aaggcccaga 
acctccaggc 
ggggctgaag 
ctcttcccca 
tctccccgcc 
tgtggttcaa 
ggggctgagc 
gtggtctgcc 
ctgcggaggt 
accccagctc 
taaaaatact 
atacagcatt 
aaggctggtc 
gagcagctgc 
ccagggacgc 

aggggtagga 

gagctcacct 
ctcgcccttg 
gggctggagc 
aggaatgcaa 
agagcccatg 
ggccgtagtg 
ggaacactga 
aatgctaatg 
gagcaatggt 
tggcttagtt 
ggaatagggg 
atcgggaggg 
cggtgacctc 
tcttgccccc 
acctcccacc 
gccacctgcc 
tccatcttca 
aatctgtctt 
accacccgct 
tgccccctac 
ccattgaaag 
tccctgtcca 
cctgcggtca 
agcctgcgct 
gtctgcctgt 
accaaaatgg 
tcccctccct 
agccactgca 
ctcttttttt 
cttggctcac 
agtagctggg 
gacggggttt 



agcctccacg 
ggccatggtg 
gcggtcactg 
gtgctcacgc 
gtggcgccct 
gaggtgaggc 
gcgcggagtc 
gcgtggcgtt 
cggtcagcgc 
ccgggacagg 
aaccccacca 
gcccccaacc 
gtttcggtgc 
tggccccatc 
acaatgtaca 
gcaaatgggg 
ccagaaggaa 
gggactcgag 
cccactccca 
atgcccgctt 
tccatccgaa 
agataatagc 
gcgggtgtga 
catcccagag 
gtagtggcaa 
aagaggggca 
ggaggagtag 
gaagcatctc 
ggtttctatg 
attcagcttc 
gagcagcttt 
gacttgctca 
gtgggagatg 
aagagtttgt 
gaggagtcag 
ccccagttcc 
acctgcccta 
tgtccttcag 
cacgctgctt 
ggctccatct 
ctttaaagaa 
aatgatggct 
ccctgcccca 
ggtagggacg 
tccgactcca 
agcactcagt 
ccagcccgga 
cccttcccca 
ccttctgtga 
gtagatggag 
tctttttctt 
gagacggagt 
cacaacctct 
attacaggcg 
caccatgttg 



cagcagccgt 
gctgcggcgc 
cggccgtaga 
ctcccgagct 
ggaaggcgcg 
cgccccaccc 
gtggggccgc 
ggcgtcctgc 
cacctgaatg 
gcgccccatc 
gcacctgcct 
tggtccacgt 
cgagtcaaga 
cctgaccccg 
gttccaggta 
accctgagtg 
gtttaacatg 
gcctgtgaat 
tccgatcgat 
tgttcccagg 
agatctcgtg 
tggcattcct 
gggagctctt 
caggtgcaga 
tgatcctagg 
agagggaggg 
gggtaggaat 
accgtgtggt 
tatcctgggg 
ctcctgcctc 
atgagtgaga 
gatcagaaac 
tgaatatggg 
ggtt ccc agg 
gagacactgc 
cctgcctgct 
gatccttgaa 
tgattgatgc 
ccaccctccc 
caacaggatc 
aatccttaac 
gcttcccccc 
cacacacccg 
tcgggaaaat 
cgagggagaa 
aggtgatata 
cacccacccc 
gctcaaaatc 
tctggccctc 
accttccagg 
ttttcttttc 
ctcactcttt 
acctccgggg 
ccgccaccac 
gcaaggctgg 



cggggtcgtg 
cgatggggag 
gcctcctgtc 
ctcggcgcgg 
gcccctgggg 
gggacccgcg 
tgcgcccaca 
gtgacgcggc 
tccagagtcc 
gcgccggtgg 
gtcctgccgc 
agttggcgac 
actgggaagg 
ccaacccctg 
cttccgggtc 
gaagctgctg 
ttttctggaa 
tcctgtctct 
gcccctgacc 
atccctgtgg 
gg 1 1 gagaag 
gctgagggtg 
tccccatccc 
ggactaccca 
gaggaagggg 
tgtggtaggg 

ggtggggggg 
tgggggccag 
ccagcagcct 
ctccaggatg 
cactcacagt 
tcttggggct 
aagcaccagg 
ggacctgcag 
cgaagaatgg 
gccctccttt 
atattttcct 
tcaccccctg 
tgccttctcc 
tttcccacac 
ccaacctcac 
tcccaggcat 
ccaccctagg 
atggttgggc 
taaaatacgt 
tactgataac 
acccagccca 
tttggctctg 
ctgggactct 
caggtcagac 
tttctttttt 
gcccaggccg 
ttcaagcgat 
gcctggctaa 
tctcaaactc 
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100860 
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ctgacctcag 
attcccttag 
tatcttgcaa 
ctcctggggt 
ttcagcctcc 
ctggggtcct 
actgatcccc 
agcaggagct 
tgggcaagaa 
taggtggcct 
cagcaggacc 
cccaaactca 
tgccaggcac 
ggccccgccG 
tgctgaggac 
tacctgaggc 
cctccacttc 
gcccccgtcc 
cattccagcc 
Gcagccctct 
tcccccttgt 
tgggaggctg 
tggcctgggg 

ggggagcaga 

catgaggctg 
actcgaggtc 
tgccccttga 
tacctagcct 
accggtgggg 
agacagggag 
atggggaggg 
ttatgggcct 
cctctcagcc 
gctgaaaatt 
aaatcgttca 
caatttcccc 
tactgccctc 
ctggaagcgt 
gcacatgggg 
ccagaagtgg 
ctgagaggga 
ttctttttgg 
ccacaaactc 
agggagggaa 
cacgagat cc 
gcaggggcct 
gtaatgtagg 
catggcactg 
gtgcggggtg 
ggggacctga 
ccagggaggc 
ctggagaagg 
aaagagaagg 

ggggggcagg 

gaaggaaaga 



gtgagccgcc 
cctggggtgc 
atgtccacta 
tgggagggcg 
agggagccaa 
ctgcccatcc 
tcctcccagc 
cctggccttc 
agagtgtgga 
gtggagccgg 
ccccacatac 
ggatgtttgc 
accagcatcc 
tcttctttca 
attatggggg 
ttggttccaa 
caattttggc 
ctccggctcc 
aagagagcac 
ctccttcccc 
ccactcagcc 
ctgcgccttc 
ccctgtccaa 
tggtccttgc 
gaatgccact 
cctgttcctc 
tatatacccc 
ctactctttc 
gaagcagagc 
gcccagaggc 
caatctcaaa 
aggggatccc 
ttcatggcaa 
ccaaacaaag 
gaccttagtg 
aagctctctg 
ttagtttgca 
gtgaagagcg 
tggtcttaca 
attccacagg 
cagcagtgtg 
tgcttttttt 
tggacccatt 
agagggaggt 
agatgtccaa 
at aggagtag 
gagagacaaa 
accggctcct 
actggctgcc 
ggaacacagg 
actggaggcc 
agggacgccc 
ctggaaaggg 
cttgggctcg 
gcgcggcagc 



cgcctcggcc 
cggctcatct 
ggtggtgctc 
gaggtgcccc 
ccacaggcct 
agccacccgc 
cctaccccag 
agcctccagg 
gctgagatgg 
ctggggagga 
tatggagggc 
agtcctgaca 
tccctagaga 
tgcccagaag 
catcgggtag 
atgagctgct 
aaactgctcc 
gacacctgct 
actggctctc 
caccctcttg 
tggctggggg 
ggaatgttta 
gggtctcaga 
ctccctggcc 
tgctttattg 
tgcagcccct 
tcatcttgaa 
ttctgtacca 
ccctttgggg 
agagtgac cc 
gctcaggcca 
agtgagggcc 
tgctcccctg 
agggctctct 
ggtctcaggc 
ggcttccgtt 
gcacagatat 
ggattgggaa 
tctgggggac 
agccagtgag 
tccacagggt 
tttttttttt 
tctgcctggt 

ggggtggaga 

accccagtgc 
tgacttggtg 
agggctggga 
ccaggctgac 
cagggatatg 
ggcatgggac 
ttttggggca 
gtgggcatcc 
acagctcagc 
gcgatcacaa 
cttagggatt 



tcccaaagcc 
tttccctcta 
actcatcgcc 
acagcggggc 
gcgtgactct 
acagatctgt 
cctggcactc 
gccaccagcc 
tggcctccag 
agttcttggg 
gtggaggagg 
acagccactt 
ccagaggtcc 
gggcatcaaa 
ccatggtcag 
gctcatttcc 
ctcttccaaa 
tcactggacc 
agctaggtgt 
gcgtctccca 
cccagagttg 
acccccgact 
gtctttggag 
acaggcccca 
gggaaaggtc 
gctatcttta 
atgtgagtgt 
tcttggcagg 
ctctcctctt 
cagtgtgtgt 
gtgccgtgct 
ctgggttggg 
ccttccctct 

aggaggggca 

tcccagccta 
tgcccctctg 
tatggcacaa 
gaggctgggg 
atcaggagag 
gctgaaggtt 
cacatgttct 
tttttagagg 

gggggtgggg 

aagctgaaat 
cttcttctgc 
gttctgggga 
ggttcagggc 
ggtgcgccag 
tcctggagta 
aaagcagagg 
gacttcacct 
ttggatctga 
tggggacacg 

ggaagaggcc 
tttagatggg 



accgcatctt 
ggatttcttt 
agcagggagc 
tgacagcctc 
ccctgtcatc 
gtcagtccct 
actggttctt 
ccatgtctgg 
gcctcctgcc 
gagaacgtgg 
tgagaacata 
ggaagggcgt 
cagaaaggtg 
agcaggggaa 
ggcctcctca 
tatagaattc 
gttttcctgg 
cacgaagtaa 
caggaggctg 
ccctgtggga 
gagccggccc 
ccttttctcc 
ggagttcttc 
caaggcctcc 
tgcaccggga 
ctcttgccct 
ttcctgcctt 
cttcctgggg 
ggtcacagcc 
ccagccttcc 
tgaccagtgg 
agctgctggg 
tgctggattt 

ggggtgtagc 

aagagctgtg 
taaaatgagc 
acagatgggg 
cagagcgtcc 
tgaccactgc 
caggccttcg 
ggtcaacttt 
ctcctgaaaa 
gtggcccaga 
gacttccatg 
ccacttgagg 
ccccagcaaa 
ccctgtggag 
ggttgtccat 
aagacagagc 
gaggggggta 

tcaacacgcg 
ggagctatca 
ggagtcccct 
aaggccgcca 
cagcagatgc 



ggtccctgcc 
agactcagca 
taacaagccg 
agcggtcctc 
tgcaccctct 
gccccccaac 
ctccagctca 
cttcgagacc 
tgccagggag 
gctggggagt 
caaagatgtt 
tggcacagcc 
cccctcccct 
gacagagggg 
gagcctctgc 
aaatttgact 
gcctccagca 
acatggacgc 
gcttggacgg 
acaccccact 
aggagcttcc 
aaaaatgcac 
cttcgcaagt 
agcatgagct 
aaaaggccat 
cctggtaccc 
ttggagggga 
gcaggggccc 
caggccagac 
cctcctgggg 
aatgggggcc 
tctctggggg 
ggacagtagg 
caatggttta 
tgaccatgga 
atatcaaggc 
catggttatt 
tgcagaagaa 
cccccccata 
tggcagggcc 
gcaaaaggtt 
gcttcaggac 
tcatccaggg 
tgtgcgggct 
ggcaggggag 
actagaagct 
ggcggggaga 

ccaggaccca 
acagggtgag 
gaggacatcc 
tgggctcagc 
aggaggagga 
gacctttgtc 
gtgcagaggg 
ctttagggtg 
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agagatgtac 
gatgttgcct 
agggttttgg 
tccggacgga 
ccagagccaa 
tccactggcc 
gagcctgcag 
cccccctcca 
ctcagctcct 
caaacatcaa 
aggcagatga 
agggagggct 
gcacttaact 
aagaggtagg 
gtccacatgg 
aggggcagag 
gaggg a gcca 
ccctgacccc 
tggggccaga 
caccgcccag 
tgttcccggc 
ttgttccagc 
gtctttccag 
ccaaacaagg 
gaggggtgag 
ctgggggact 
gtggaggagc 
gaatccaggc 

gggggaggga 

gtccgggccc 
ccccccaaag 
ctccgcgcgc 
cctgcccctc 
ctggccagag 
gcctccagcc 
cgcccaggct 
ccttcccgcc 
cgctcggagc 
cccggaccgc 
tcggatgacc 
ggaggagaga 
ggcaaaggcc 

tggggccgat 

gttgagtcgg 
ggctgccgct 
cctttcttcc 
ctgggagacc 
ctcggtgcaa 
ctccctctgt 
tccctggttc 
gctaccaccc 
aggatggtct 
attacaggca 
tttttttttt 
tgatctgggc 



gaagagagga 
tgcgaggtcc 
taccctggcc 
gccttggtgc 
gcattactca 
ccaaccactc 
cagccctcct 
cgcagccctg 
gctaggccac 
agacccttcc 
gggccgagag 
ctgagtggat 
gagagtagag 
gtgggcccgg 
ccttgcgcgc 
ctggagctgg 
ggctggtatt 
aggatagggc 
acagaccagg 
tctgcatagc 
caggcacttg 
cccatcttaa 
agcactgcta 
caacaatata 
cagggtggag 
cttgggcatg 

a gg a gcgggt 
a 9 a gggggct 

gccatgccct 
ggggccagcc 
agtccccacg 
tgacccgagc 
agggcgcacc 
cage age agt 
catagctgtg 
ggattgegge 
cccacctccc 
ctggccaggg 
cgcccctcga 
ggcccgagcc 
accgctggga 
gcctctcgct 
ctctgtcccc 
gtcatccacc 
gtcttggggg 
ctggcggtgg 
ctttcggcct 
atttggatga 
cgccaggctg 
aagegattet 
ccagctaact 
caatctcctg 
tgagccaccg 
tttttgagac 
tcactgcaac 



cacttgtgcc 
accttgccca 
cagaaggccc 
tccctggccg 

gagggegect 

tcacaggccc 
tcccccggta 
gcctcagtcc 
cccaactcct 
tccaagaagc 
gagtctgeca 
gtgggggagg 
aaccaagctt 
gcacaggtgt 
acgtgctggc 
agctggagct 
gggcacatag 
tcagagggga 
cccctgtacc 
acagcccagc 
ctgtgcagga 
catctttgtt 
aacagacacc 
aacaacatac 
gccacagcta 
ggggcttcct 
cacacggtgg 
aeggctatgg 
ctgctttgcc 
caagcacact 
ccctgaccta 
tctgagcaga 
ccgcccgcgt 
agtagcagca 
agctcctcgg 
cgacccgtgc 
accgcccgcc 
gccccgacgg 

gggggtggag 

gtccctcccc 
aggttcttgg 
gctgaagttg 
tccttgctac 
tgcactgggt 
ettcattget 
ccagagggca 
cagtaacatc 
gatgttaact 
gagtgcagcg 
cctgcctcag 
tttgtatttt 
atetegtgat 
cgcccggccg 
agagtttctc 
ctctgcctcc 



ccccccatca 
gagctccctc 
ctcctcatct 
ggtgtgccta 
ggaatgttcc 
gccctgcagg 
cccagtccca 
tgctgggctg 
tttttctcct 
cctccttgat 
ccttggaaag 
ggaaggaggg 
tgctgctcct 
ggtgtgggtg 
cacgggcacc 
ccggtgaccg 
acgcccctct 
gggagcagtg 
tgtttggtcc 
cccgcatgcc 
ctggctaatc 
tggaggggtt 
ttctatctgg 
acactgccct 
gttctgeage 
agaggatgga 
ectgeggatg 
gctgggggct 
agcggagtgg 
tgagcgtccc 
ctggccgtat 
acccatcccc 
cctcaccttg 
geagcaaegg 
cctctaggca 
ccggtgcagc 
ctcgctcccg 
cgcgcgccat 
ctgggcggag 
gteggtctea 
aggaccgegg 
gccccaggag 
ccgtcctgcc 
gcccccaagg 
gggcaaaggc 
gaattgeata 
tgtttcatgt 
tttttttttt 
gegegatett 
cctcccgagt 
tagcagagac 
ccacccgcct 
gagatgttaa 
tcttgttacc 
cagattcaag 



tctgagaaaa 

ggggactctg 

cttcaagggg 

a gggggctct 

cctggaatgc 
agccaggccc 
gctcccagaa 
atggctgcct 
ccaccttctc 
tggatgagtg 
gaggctagag 
gaggtctctc 
aggectctaa 
cagtgtggtg 
ctgaccccaa 
ggtgaatggg 
cccaggggtc 
gaccgcctgg 
ccacacagtg 
ccctccctgg 
ctcccacccg 
accccgagga 

agaggecett 

gctgccctgg 
ctgagagcaa 
gccccgctga 
gaagctggtt 

gggggctggg 

cagccgggca 
tgggcaggtc 
ggtgccgggg 
gccaccacca 
aagcaccccg 
ggtcccccga 
gcggctcgca 
ctcaggccgc 
cctcccctcc 

gggggagccg 

gagggaatcc 

ga-gggectet 

cgtggtggga 
cgcgatcttc 
ccgagggtgc 
ataggaaggt 
gatgeagcag 
aaagctgcag 
tttaaacttt 
tttttttttt 
ggctcactgc 
agctgggact 
gaggtttcac 
cggcctctca 
cttttaagca 
cagactggag 
tgattcttct 



acaacagcca 
tcctggtggc 
aggggacget 
aggaggaatc 
tcccagcccc 
caggcaccca 
cagacagcct 
gtggaagtga 
tcccagacta 
aattgecate 
gggccagtgc 
ageccagaga 
gggtttgggg 
tgtgggtgct 
tgagggagag 
ggtggaaccc 
ccatcacctc 
ggccctcccc 
ctgtggaagc 
ttgccctccc 
ettgeagagg 
gaeagctgea 
ctctatctca 
gaggagggac 
agcagggact 
gtcctaaggg 
gtgagagega 
ctgtccccga 
gtgtgggcaa 
ccacggagac 
ccgtgagacc 
ccgcgcctag 
gcgcctggca 
gctctccggg 
actccggctc 
cgccttcgga 
ccgccaaccc 
ggtcgccact 
gtgcggcccc 
actcctgaga 
tgaggcggtg 
cgtggtctcc 
ectggeggag 
tcaggcaacc 
aeggagacaa 
actcccaggc 
tgttttccta 
gagatggagt 
aacctccgac 
acaggcgcgc 
cattttggee 
aagcgctggg 
aatctttttt 
tgcaatggca 
gcctcagcct 
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accgagtagc 
agagatgggg 
gcccgcctcg 
ttttaagtaa 
tcgagcagtg 
gggaaaagtc 
tttcataggc 
ttattaacgt 
gatatcagtg 
tcactctgtc 
tcctgggctc 
ggcaccaggc 
ccaggctggt 
gcactttggg 
ccaacatggt 
gcgcctgtag 
tggaggttgc 
actccgtctc 
tgcctcagct 
gagattatta 
tacacacaat 
ataactacat 
tgcccttcca 
tgtctataga 
actcttctgt 
tttgcatatc 
ccacagtttc 
attggaacta 
atagcaattg 
ctggagaacg 
gagcaccagt 
caccccagag 
ctggcttcag 
gcttactggt 
taatgtcctg 
acacttctcc 
ccccaagctc 
gtcagtgcta 
tcatccttac 
aggaattttc 
ggagatgagg 
ccgggggcgc 
ctccagaggg 
agaagtgctt 
cgcagaacag 
gcacctggtg 
gggccccaaa 
tcccttggca 
gatattgtca 
gtttaaaaac 
gggggctgca 
gggtgtgttc 
atgttacact 
acacaactaa 
tgcagaagca 



tggcattaca 
tttctccatg 
gcctcccaaa 
atctatttgt 
agagaaacct 
tccgtgctct 
ttttagatta 
ttttctgcac 
cgtttagagg 
acccacgcta 
aaggtatcct 
ccagctaaaa 
ctcaaactcc 
aggccgaggc 
gaaaccccat 
ttccagctac 
agcgagccga 
aaaaaataaa 
tctcaaatcg 
cttattatta 
aaaatgcact 
aagcatcact 
gccaaacttc 
taagttttgc 
gtctggcttc 
aatagtttgt 
tctactcacc 
tatttgcaca 
gctcacgtga 
aggaaagcca 
atggaggtgg 
cagcccctca 
tgtggatgtc 
attcttacct 
tcattcagag 
ctgcaacacg 
ctgagctcca 
ataactgact 
acagttgaca 
tgccgaagtc 
ccatcaaacc 
cfcgacagcac 
gcatcaggaa 
ggcatgggtg 
gatgtctgag 
acgacaggct 
gccaacgctt 
aggtgggtgg 
aacagttttc 
aggcaaatca 
actggtacag 
gcttttggaa 
ttaattgaat 
cccctctgag 
ggtgaagact 



ggcattcgcc 
ttggtcaggc 
gcgctggaat 
ttttgagaat 
acattcacat 
tagccctcct 
ttttttcaca 
tttgattttc 
gtccttgttg 
gaggacagtg 
cccacctcag 
tttttaattt 
tggtcttggc 
gggcagatca 
ctctactaaa 
ttgggaggct 
gatcatgcca 
aataaaaata 
ctggaattac 
ctactttgga 
tattttaagt 
ataatgcaga 
cccactccca 
ctgttctaga 
cttccctctt 
tccttcgtat 
agtgcaccat 
gagagagaga 
ttatggaggc 
gtggtgtgat 
ctcgagctca 
gcgacacctc 
aggtacagaa 
cgaactggcc 
gctgtccctt 
tttcagaagg 
tgtggtctgg 
gtagagggga 
cctcccaaac 
tctgagaaag 
aaagctgagg 
ctgggaaaga 
atgtcctggg 
acacccacca 
gga-99 cactc 
gatgcttggg 
catccctgac 
aatttcaggt 
caaagtcatt 
aatctatggt 
agtgtaaggg 
aagtccttga 
gtacaaaaag 
agcctccaat 
cctaagccgg 



accacgcctg 
tggtctcgaa 
tacaggcgtg 
ttggaatgta 
atctcaaaat 
caccaacagg 
ctcaagacaa 
tttttttttt 
ttcttatgat 
gcctgatcat 
cctcctgagt 
ttctgtagac 
caggcgcagt 
ctggaggtca 
aatacaaaaa 
gaggcaggaa 
ttgcactcca 
aaaagaactc 
aggagtgagt 
ttttcaaatc 
ggccagtaag 
cacattccct 
accccagaca 
atttcatata 
tccgatgttt 
tgctgaatag 
agggttattt 
gaggaagaaa 
caaaaagttc 
tcagtttgag 
gaacaagttg 
ttcagtaaag 
ggcagctcga 
aactcctact 
acccaaagcc 
ccctgtagtc 
gaatatgtgt 
cacagtcgaa 
ccagatgagc 
ggtttattta 
aagagggatc 
tggcttctcc 
aaccaggcaa 
tctgtaaaca 
cgactccagc 
tgccccagaa 
agcctggtgc 
tcatagggca 
gtacacactc 
gttcgaagtc 
gggactgtag 
gttgcattta 
tctcaggagg 
ccaagatgga 
atattcccaa 



gctaattttg 
ctcccgacct 
agacaccgca 
gtaatttggt 
caaaaagtac 
aaaccaatat 
tacagacata 
tggtcgctta 
tattatttag 
gcctcattgc 
agctggaact 

agggggtctc 

gtctcatgcc 
ggagttcaag 
ttagccgggc 
aatcgcttga 
gcctgggcaa 
ctgatcttaa 
caccacagct 
aacttcatta 
atgagtttcg 
cactcacaga 
gccactgatc 
aatggaatca 
ttgagattca 
tgttcggtgg 
tccagttagg 
gagggagaga 
ccgaatctgc 
ttcaaaggcc 
gggacaggaa 
caaggctgaa 
ggagctactc 
taaactgcag 
aggttagcat 
gtccacttcc 
gttgctcact 
aagccacatg 
tgtgtccaag 
cattttgagg 
ctaggatgca 
actggcttgt 
accagtgagc 
cgacttctcc 
cttcagagat 
aaggtcatgt 
atttagaggg 
agggtatttt 
tgtgattcta 
aagacagtag 
gatggtctat 
taatgtgtga 
cctcagacca 
catatgtccc 
atccccccag 



tatttttagt 
caggtgatct 
cccagcctac 
tagtgaaagt 
agaaagcata 
gattagtttc 
tttttttctc 
atacacctta 
agacagggtc 
agccttgaaa 
acaggcacac 
actttgtttc 
tgtaatccca 
accagtctgg 
atggtggtga 
actcagaagg 
caagagcgaa 
gtgatcctcc 
gtccagctac 
aggtataatt 
ataagtgtat 
aagagccctg 
tgttgttctc 
tgcggcatgc 
tttacactat 
tttgagggaa 
ggctcttata 
gatatttatt 
catctgcaag 
tgagaaccag 
agcagagcag 
cacagagggg 
tggcgttctt 
gccatggctt 
cccctgactg 
ctgtctctct 
tcctagcaca 
gggatcagag 
tgcaggtcag 
ttgcagggga 
ccgagcagct 

tggcgtcacc 

attaaccctt 
caaggagtga 
cgccagggtg 
gtgtgaatgg 
gaactttttg 
agctttaata 
cttatgtaaa 
ttacccttgt 
ttcttgatct 
acttttctgt 
ctggaagcgg 
cttggaagta 
tagccgcagc 



108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
110820 
110880 
110940 
111000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
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ttcagcagct 
tggctgcatt 
cctccccact 
gtgccatgca 
ggagcagaaa 
cttgcagccc 
actcctctgt 
ataggatgga 
ggagcatgaa 
cacccggccc 
tctgtttttc 
aggctgcagg 
aggggcacgt 
aggaaaagga 
atgctgacaa 
tccctggggg 
tctggcacaa 
aacaccaaac 
gcccacaacc 
aacatggcct 
tgggagtcct 
tcccagtctc 
agtggaaaag 
cctccactct 
tcacaatcaa 
caacatcata 
atgttgaagt 
acagaatgcc 
gtgactttcg 
aagggaggag 
gggtcaacac 
gggccaggtc 
aacactgcct 
ccggagggca 
tgtggctgca 
tgggagccag 
ggagtgtgca 
gaagggacac 
gcttgggagt 
ggaaatggag 
cagctgacac 
gctcctggtt 
atgacagcag 
gtgtgtgttg 
cgcccatgct 
cctctcccct 
accacaagag 
cttgctttag 
tagggacaat 
gtgaacttaa 
tgccttaccc 
cccccaggcc 
gctggaactg 
gctgcaggcc 
ttgctttatc 



gcttatggtc 
tgacttgctc 
gtcagcaatg 
cggggctggt 
gggacacaaa 
ctggccccct 
tctagcccag 
gttttattgg 
aatgaatgag 
ccaactgcct 
aacccaaatc 
gggagtcact 
cattccttct 
caggagctca 
catctgtgtc 

ta gggggggt 

ggatgtctgc 
cttttttatt 
agaatggcca 
gctggtggca 
ttggaagtcc 
ccttgtagct 
tcctgtgcag 
tccttcttct 
aggcatcatt 
aaccatcgta 
cactgtattt 
ttggtgagcc 
aggagaagga 
atccagagaa 
cctcctgggc 
ataaaaagtc 
cattcacatt 
ggcaacacca 
tcaggatcaa 
ggcaggcccc 
tggctggaag 
accacagggg 
ggggaagagg 
gggatgctgc 
agggattccc 
ccctgcacac 
tctccagatg 
aatggatacc 
ctgcttacaa 
ccccacctcc 
cgtgggaacc 
cacagccagg 
tctagaaggg 
taggaggtcc 
caatcttctg 
caacagcagg 
atgcaggcgg 
cccagtgctt 
gctgaagctc 



ctccctacac 
tctccctgtc 
tagacaagac 
cattgtgggt 
tcacttcagc 
cctacagatg 
ctgtttgggt 
gaggacattc 
cctggaatga 
cccagcgtgt 
ctagagcaga 
agcccactca 

tggtaactca 

cacagtcatg 
tgccctccac 
ggtggtggga 
ttaggcaact 
tcctcagtct 
ggggcacacc 
ttcccaacaa 
aattccaaat 
tgcagtagcc 
cttctgagtt 
ggctggaggg 
ttagggatag 
acagccagac 
ggtctcttgt 
tctgaggatg 
tgtgaaaggg 
gagagagcga 
agacctctca 
agatgtcaca 
cataggctgg 
ctggtcagcc 
tgagcttccc 
caggccccca 
tgtgacttgg 
aggagaaact 
cctcagaaca 
caagtccaga 
tctgccccga 
tctccccaac 
ccagttggca 
tgcgtggtct 
acacgcctcc 
tgcccccatt 
tcatggagag 
gcagagagga 
atttccaatt 
taaaggacct 
gcccactccc 
agcaggaggc 
tgcagggctg 
ccttccaagg 
tccacacaag 



cctctcttcc 
ccacctctgg 
aaacgcttag 
ggcaaatgtg 
gtaaggtaat 
gtgtctaaga 
tgcaagaaag 
tggacgggct 
gggagggtgc 
tagctcctgt 
gggctcttcg 
agagctctct 
ttctctttaa 
aacagagatg 
tcttgtaagt 
atgagggcat 
ccactgcctg 
tagtgtgaga 
ctggccctca 
tgtcaaaagt 
gtcctctgga 
atgtgacaat 
gcgtccacca 
catgtggaca 
agaaaaggaa 
tagcgtggga 
tagagcaggc 
caggatgctg 
cagggcttcc 
gatcagctca 
ccacccattt 
gagctgtttt 
agtcatcaca 
caggggtggg 
acagagaaaa 
gcctcaggtg 
agccctggcc 
gactttcact 
aaggtggggg 
aagtctcccc 
gcagaacagg 
ctccccttgg 
ctgtgggtta 
ctttctgaaa 
ttctgcaagg 
cctctcccgc 
tgggtgcttc 
ggaaagaagg 
tggaggggca 
gggggaattt 

agctccagac 
ccaccagcag 
gaacacagag 
ggacccaccc 
ggtgccccga 



ccagacagcc 
atttagtcca 
ttcacgtgcc 
agcaacacac 
ttgtgataaa 
ataaacccca 
agactcactc 
cccagcaaga 
ggcctcggct 
ggcaactccc 
ctcctcacac 
cctgttggtc 
cagcccgaca 
catctggagc 
gccagtaagg 
ctcttctttc 
gcatgctctc 
gtgatcacct 
gtagatgggc 
ctcagggatg 
ctcaaaataa 
actccagcca 
tgggcatggg 
aagcggggcc 
gatagaagga 
gagaaaaata 
tgactgatac 
caatccaacc 
tttgagggag 
gtgctcttca 
ttgggggggc 
cgtagaggcg 
gagtctggcc 
gcacaggttg 
ccctggtcac 
gggtcccaag 
taccaggtga 
tctgcccagg 
agtcagaaca 
tagtcctcta 
gctctcctat 
tcccaggcac 
gctgggtatt 
agccatctcc 
cc cggagtgg 
tccccacagg 
ctctgtcttc 
aaactacagc 
gtctgggaag 
ggggagaaga 
ccacctccag 
tcccaggacc 
cgggactcag 
aagatgacac 
gcaatcagtt 



cccaaacatc 
tgttctccac 
cacctactgc 
gaagcctcaa 
tgtcatgtaa 
ctaacatgtg 
cagttgcgtc 
gtctggcaat 
acacaaagtt 
cacttctctc 
catctccacc 
cctgattcgc 
cagggtctcc 
aggaataagg 
tagtacccag 

a ggggccaaa 

tccctaggaa 
cttccaggaa 
actgagacaa 
agctcacagt 
aaactacatt 
atgggaagtg 
tgtgcttcta 
atggtgagcc 
tcctgagccc 
aaattctatc 
tctgttgaat 
aaagtccagg 
aaatggagag 
gtgtcctcca 
aagaggcttg 
ggcaggactc 
actttctcct 
aggtctcaca 

aggaggcctc 
aggattggtg 
atcagctgca 
tgacgggcag 
tacgtgtctt 
agccagccgc 
ctcctgaggg 
catcctctaa 
atatctgccc 
ac.cgaattct 
actcagagag 
ctggctccag 
ctccctagcc 
aaggaggatt 
taggcagcct 
cgaggtgggc 
gtgtagcagg 
cagagcagct 
agcagccaca 
cttctaactt 
tagagtcgac 



111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
114840 
114900 
114960 
115020 
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aggcaaatca 
ggcttttgtg 
taggtggagg 
tcgctgggcc 
atgtgtggct 
tggagagtca 
ccaccctcag 
aagggagtgt 
agtgactttg 
tgactcagtt 
aagaggcctt 
agcctgctgg 
tgggcaactg 
cacagaattg 
agcaatagct 
gatgagggct 
cttcttttct 
ggctggagtg 
agtctcctgc 
taattcttgt 
tcctgacctc 
gcccctgcac 
cccattttac 
aatatccgtg 
gtgagatgtg 
tgactggagc 
cttgagcacg 
ggtcattcca 
ggcagctgag 
gtgcacctgc 
gcctctgggt 
ggtatagggc 
cagattaggc 
ccagcccctc 
ccactcacct 
tgtgcaaaca 
tcgggccacc 
cagcaccttg 
ctgtatcaca 

aggggcatct 

tcattgccta 
agaagtgaca 
tgtctctttc 
agtcctttgc 
gccctggcca 
ggtgccctgg 
caggaatgag 
atagagtgtg 
aggcacggca 
agccgggccc 
gccttaccct 
gagcagctgt 
ttggggacag 
tgggcgggcg 
tccaggaggt 



tgctgcagca 
cagagtggcc 
tagaggcaga 
tcagcgcctc 
ctgcaggagc 
ggctggcgag 
gctccacgaa 
ggtgattgca 
ctgctcctcc 
tgaccaattg 
gtagcttctg 
gcacatgtgg 
cagccacatg 
agcagataac 
gactgataca 
tatcgtgtgc 
tttctttttt 
cgttgtggtg 
ctcagcctcc 
atttttagta 
aagtgatccg 
cagccattta 
aaggtcaccc 
aagccttcac 
gggagactaa 
agaggcagca 
agccacgtgg 
gaaccatgca 
gttcagggca 

gggcggagga 

gggacctctg 
ttfcccaaggt 
ttctcaaggc 
tcgtggactt 
tctacctgca 
taggtgtcat 
tggacatggc 
ccatcatgag 
gccatggatc 
gtgggcaggc 
gctgcctggg 
ccttcccagg 
tctagcacct 
ccactcccac 
aggaccctct 
gcatcctggg 
agtgaggcag 
ggtgcgtggg 
gcctccggag 
agccggacac 
ccagccgcag 
agaaaccctc 
aggctgctgt 

ggcggctgtc 

aggtcaggcg 



ggagggatac 
tgggagatgg 
gcccaggaca 
gatgtccacc 
accctggggc 
cggctcgctg 
gaccatcatg 
gaaaatgacc 
cattaagagg 
aagaaagaag 
ctctacttgt 
cccaactggc 
aggtgtgaac 
atgacttttg 
gtgtaggcct 
cagacacact 
tttttttttt 
atcttggctc 
tgaatagctg 
gagacggagt 
cccacctcag 
tttcttatac 
aaccagacag 
tcttggcaga 
ggcggaggag 
gccccagtgg 
gtgaaggcga 
aaggtggagt 
gcgccctcag 
tagagagatg 
aggggcctct 
ggaaccatgc 
agagctttct 
ggacctgaga 
accgcccgat 
cacctgcagg 
ctgtccctga 
atagggccag 
tggccctgga 
agggcacaga 
gttggtcagg 
agtcctgtgg 
atgtcctctt 
cagagcccag 
gctcacagag 
cctggcaaga 
ctggaccctc 
cagcagggtc 
ccaggatcac 
cacagtgggc 
ctccagggac 
atccctgggc 
cgaggaggcc 
cacagtgcac 
caggttgcgg 



agggaaggaa 
agccccctcc 
tttgaggcag 
ctcagggcat 
tgactggagg 
tccactcggc 

gtgggcgtct 

acaaatttct 
cagaacctat 
tgatgttgtg 
tggaacacaa 
agccaggagc 
aacagaagaa 
tttgaagcag 
tggtgaaggc 
cagcacgctt 
ttttgagaca 
actgcaacct 
ggattacagg 
ttcgccatat 
cctctcaaag 
aacttgcagg 
cgggagagcc 
cagaagagga 
ggggttcaca 
ggagctcagc 
gagtgggcac 
tgcccacagg 
gcacgtccag 
attggggatc 
gcacattgtg 
cccctccagt 
cagaccagac 
gcatcagagc 
ggtgctgatt 
cacatcttgc 
tgccaagctg 
ctcagcaggt 
gtcccggaag 
tggggacctt 
ctgaggcctc 
gcagagtgct 
tccccaactg 
aagcaccctc 
gacttgcagg 
gtaggcgcct 
ctgcagccaa 
cgcacaggtc 
ccgcacacct 
ttccatccca 
gtgttggcct 
tgtggccctc 
aagaggcgac 
agtaccaggg 
ggcgcgtctg 



gcctttgggc 
cctcacctct 
aacagatgta 

tgggggaagc 

ccaccagtcg 
aatccaggat 
tgggagggtc 
ccctccctgt 
ttcctcactc 
caacttcaga 
caatgtggga 
gtgcagtcat 
ctccccagct 
taagttttgt 
tggagtggga 
tatttctttt 
gagtctcagt 
ccacctccca 
tgtgtgcctt 
tgaccaggct 
tgctgggatt 
atgaggcgaa 
aggattccaa 
ggaagaactg 
ctcacagagc 
caggcagtgg 
aggctccgcg 
cccagggcca 
gccaggctct 
tgtaggcctt 
ggaaggtctc 
gggagctgtg 
agcccgctct 
ccctcctcct 
gagcccagca 
acctgcagcc 
tggaccccgc 
ggctcactgt 
gaggacagga 
gcttaggcac 
tggaccaatg 
c caggc aggg 
cccattcctg 
caccaccctc 
gcagcaggac 
gcatgagccc 
cgaccgttgt 
actgtgatgg 
gtgccaagga 
gtgggcttgg 
ggcccagagg 
gcagctccag 
cggcgtggct 
ccagctgccc 
cagggcatga 



ctaagagctg 
gaccccaaag 
ttccccttgg 
caggacatgg 
actgccaagg 
gccccggagg 
tgtggggagg 
acccatgcac 
cttgaatctg 
gccagacctc 
agcccaggcc 
ctgagaccac 
gagcccagcc 
tgtggtcaat 
gaccaagact 
ggttttttct 
ctghcaccca 
ggctcaagtg 
catgccgtgc 
agtcttgaac 
acaggcataa 
tgatgtattc 
cccaggactc 

aggtgggcgt 

acacggagca 
tacatcccag 
tgcagccgcc 
cccaggaggc 
gccaccacgc 

gggggctgga 

agtctgactt 
ccccctccac 

agtgggagct 

cccctctctt 
agttttgggc 
gtagggcgtt 
tgctcgtggc 
ccacagtgca 
cagcgtcctg 
cctgtactgc 
gccacttcct 
cttataggac 
ggcccactgc 
ctcctgggca 

gggagctgcg 

gcgtggccac 
ggtaccaagt 
gggcaccttc 
ggccaggatc 
cctaggccct 
gctgcgggca 
gcgcagggtg 
gagggccagc 
gccatggctc 
gaggcttaca 



115080 

115140 

115200 

115260 

115320 

115380 

115440 

115500 

115560 

115620 

115680 

115740 

115800. 

115860 

115920 

115980 

116040 

116100 

116160 

116220 

116280 

116340 

116400 

116460 

116520 

116580 

116640 

116700 

116760 

116820 

116880- 

116940 

117000 

117060 

117120 

117180 

117240 

117300 

117360 

117420 

117480 

117540 

117600 

117660 

117720 

117780 

117840 

117900 

117960 

118020 

118080 

118140 

118200 

118260 

118320 
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c*ggagtcaca 
ggccctggct 
ccagacagga 
gctactgggt 

tgtgggcttc 

tccaaggtga 
taggaggtgg 
tgctgcccat 
caggtcaggt 
acagctgggg 
gccacctgtc 
agccttgggt 
gctcctgagt 
gaccctggct 
ctgaaattcc 
aggttctcct 
actattcccg 
acacacacac 
agtcagctga 
catcctccag 
gcagcctggg 
gatccccgtg 
ggcccagtcg 
ctacacaggg 
catgtgacac 
aagtccttac 
ggggcctggg 
agcgtggccg 
ccctctggga 
tcaggtgatg 
gctctccacc 
cccaacccca 
ctatgccttg 
tcccaccacc 
gccttatggg 
gtgttagctc 
atcatcccag 
ggaagggaat 
agactccagg 
cccctgggtg 
aatgctaagc 
ggaaaagctc 
gaaaagatac 
cgtggtggaa 
tttcctggtt 
atccaggcta 
cgattcgcct 
gctaattttt 
gtgatctgcc 
ggcctgaggg 
acacacacac 
ataagaatac 
taatataaat 
aacaagttat 
cctagtggct 



ggcggcagcc 
ccccacccct 
ctcaccccag 
accgagttcc 
tgtcattcct 
taggtctgga 
catccctgac 
cctggtacca 
tcaccagctg 
agagggaggg 
tccatgtggg 
ggcccaccta 
ggtcccaagt 
gggggcaaat 
ctgtttagtt 
aagaaactca 
tgcccccagg 
acacacacac 
ggccgaggcc 
catagccccg 
agagctgcct 
gagcagccgc 
tccaggacct 
tggggagtgg 
agcccaggac 
aggacacgtg 
cttggcaatg 
aggtcgactc 
ctcctgtggg 
gtgacacctg 
acacctctca 
tgtgctggag 
gcccctcctg 
tggtgggatg 
cactggactt 
ctgacaaatg 
ggccattgtc 
gttagcagct 
ccacaagagc 
tgaagcagga 
aagaacacag 
ctaaaactca 
acagaaccaa 
gaaacactca 
tatttgttta 
gagtgaagta 
gccttagcct 
gtatttttag 
cgcctcagcc 
aggattttta 
acacacacac 
acataatgta 
atacatatgt 
ttcagaaagt 
gctgcagtta 



tgcaggaggc 
atggctccca 
cgccccctac 
ccactggaca 
gtgagtccca 
gaggcggggt 
tgtgacgttg 
tgtgtaggtg 
cccctcccgc 
cacaggacac 
tgagctactc 
taaatgtggc 
agagttccac 

gggga-gacca 

ttatccactg 
agtctggagt 
cttctagaac 
acacacacac 
tggcccaggg 
tggatctcca 
tcgggtcccc 
agctgggccg 
gtgtccatca 
tcaggaccca 
tgggggactg 
gaggctgatg 
ataggcccca 
ctggaggggc 
tacctggcag 
gacccaggct 
cagtctggga 
ccgcagcccc 
ctccccactc 
gttttactgt 
gtgtgacctg 
ttggttcaac 
ctaactagga 
atgttatctt 
acaggacctg 
gaaccccagg 
accatgcctg 
aaccttcaat 
ggctcaaagt 
tgttcctttg 

gggagaattt 

caattttggc 
cccgagtaac 
tagagtgttg 
tcccaaagtg 
tagagactat 
acgtatgtgt 
tatttgtatg 
atgagaatgt 
gctcccacca 
ttacaggtga 



cagtttgcag 
accacccagt 
tcttaatgct 
cctgtcgggt 
ccctgcatcc 
gcccgaccag 
ggcaggggga 
agctgggccg 
acggtagccc 
cagtgagggt 
ctactgctgg 
ttagaactca 
agagtcctgg 
cagacctccc 
ggcatacttg 
ccaccatcgt 
cctgtcccac 
acacacacac 
tgttggaggc 
gacgcagtgt 
ccacaccttg 
gagggtcact 
gggtagtgag 
gccagggggg 
cattccttcc 
ggtgcagcta 
gcttgtgtca 
tggccatccc 
ctcaggacca 
cc tgcagggg 
ccatgtgcgt 
ttccagacca 
tcctgcacag 
gtcagtcata 
taaccccagt 
ccataaagta 
tctggactca 
gagcgctctc 
gggaccaggc 
ttgcagggag 
agaccacagc 
ttcttcctgc 
aaatgtccca 
accttcgcat 
ttttttttta 
tcactgcaac 
tgggattaca 
gccaggctgg 
ctgggattac 
atatatatac 
atataaataa 
tgtatatact 
ttactatgta 
agcaagccac 
aaatatctag 



gtggcattca 
ctgaggcact 
ccttgcccag 
tccggccatg 
agccagactc 

gggggcccac 

tggagtgggc 
ggtgagtggt 
cgggccacac 
cttgaggctg 
ggagcccctt 
agcttgggaa 
aaagccctga 
ctcccctgct 
taaaatttgg 
taggaagggt 
ctcctctcct 
acactgacct 
ctcacagata 
gttgggggcc 
tagggtggag 
gtccacacgg 
tgtgacgtgg 
tccctcatcc 
cccacacctg 
ggctcgtggt 
aagttatggc 
gataccaacg 
cagcctggcc 
aaaaccaaga 
gtccacctag 
ccgccagggc 
gctccatcct 
tctcgcctac 
tccaggcact 
ctgaaaaggg 
gcatgggtga 
gccaggccag 
aaagggcagc 
tgagatggag 
tagcagggtc 
gaaataggta 
aaattggtgc 
tatgggcatt 
agacagagtc 
ctccatctcc 
aatgcccacc 
tcttgaactc 
aggcatgagc 
acacacacac 
agaatatgaa 
catacataaa 
acaaaattat 
tttgtgaatt 
ctggaggcaa 



aactcaggag 
gctcctctgc 
tgctccacta 
ccgtgatccc 
acagaggacg 
accgcagcgg 
atccaggcgc 
ccacacaagg 
ctgcacattc 
tgtacagcag 
ggcctttggg 
tagcccactg 
gcccctccct 
gcactttctc 
tgtttataaa 
gggaatgggg 
cttttaaagg 
tgagcgtcga 
tagacaccct 
acagccacat 
gccacaaggc 
cacaggagga 
cgtggggcat 
agggctctgt 
gatgggacaa 
ggctgagcct 
tgcaaagcgg 
atatgaggtc 
ctcttggagc 
gcaggtgagg 
gactacacaa 
ccactcagcc 
ggcatttacc 
caaagcgtaa 
gtgcctgcat 
agggatttag 
ccgagggctt 
ccctgcatcc 
tcctcagtgg 
ggacagctgg 
acgcaactgg 
tgctaagtaa 
tgtcggtcac 
atctttcctt 
tcactctgtc 
caggttcaaa 
accacaccta 
ctgacctcaa 
catggtgccc 
acacacacac 
tatattatgc 
catatatgca 
atgatcgcaa 
gccattgcct 
agggaggcct 



118380 
118440 
118500 
118560 
118620 
118680 
118740 
118800 
118860 
118920 
118980 
119040 
119100 
119160 
119220 
119280 
119340 
119400 
119460 
119520 
119580 
119640 
119700 
119760 
119820 
119880 
119940 
120000 
120060 
120120 
120180 
120240 
120300 
120360 
120420 
120480 
120540 
120600 
120660 
120720 
120780 
120840 
120900 
120960 
121020 
121080 
121140 
121200 
121260 
121320 
121380 
121440 
121500 
121560 
121620 
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tctgctggct 
attcacagca 
tatcttcaga 
caagaacatt 
cctctaactt 

gggga-cccag 

ggcggctgtc 
ctgacagctt 
ctagctcacc 
taagccatgc 
ctccaccctc 
acgggagtgg 
tcagttctgg 
ccatttcgga 
gtcaagttgg 
gctggagagc 
gccgcccagc 
aaaagcaagc 
ctgggcagag 
gggagtcctt 
gaaggagcag 
cacagagtgg 
cctgccccca 
tggaggcgtt 
aagggttgtg 
aacagccccc 
ccttgtggag 
ctcgccgtcc 
gtcgaggggg 
cccacaagcc 
ccaagcagct 
caagctaggc 
aagaaggagc 
cataagtccc 
tctgctccca 
agcttatctg 
ttcctgaagg 
gatctcactc 
gtggctccgg 
gagcctcccc 
ttggtgtcta 
tgtcccaggg 
ctgggacaga 
ggccccagcc 
tgtgacccag 
gaggaggtgg 
agagcacagt 
ggcagtggta 
gaccctcgtg 
ttaggccgct 
tgatgaggag 
ccccacatgc 
ggtagctctg 
attggtagta 
caacacctgg 



ggctggaaac 
cccagcccaa 
agactgacac 
tgtggcctca 
cagggagttg 
gctggtggcc 
cacagtgcag 
tggacggtcc 
acattacaac 
tctttcctcc 
caagtcccca 
agagctgggc 
ccacgggcag 
accaggagaa 
cttctgtgcc 
aggcggcaca 
ctggaaggag 
tagctcacag 
agggttgcag 
cctcagagac 
ctccgctctt 
ttttgcctga 
ggccacccat 
gcccagggca 
aatctccaca 
acaggtgctg 
cagctgcagc 
agccccggcc 
gtctgcaggg 
tggctaggct 
gtgcagactg 
tcatgctccc 
accattttct 
atctgctgct 
ccctgcaccc 
tgatctccaa 
cttctggaga 
agacaccaga 
cacacatgtg 
gcttggcgga 
tgggagccca 
ctcacccatg 
ggagcccaca 
tcatttccta 
ccctcccgat 
ggtggctggt 
gagaacagct 
tgagccggcg 
aagcagggct 
tctgcagctc 
acgggcggct 
tgccctatat 
ggctttgtgg 
gatggctgca 
cttggctggg 



tgcccttacc 
agtctgacca 
tcacctcgga 
cagcggtagc 
gcctcagcct 
aggaggtgct 
atgaacagag 
gggggatctg 
tgccacacta 
ccaaacccca 
ggctgcccat 
accagcctca 
cagtgtcacg 
gttagcaggg 
ctcctgaagt 
gcttactgac 
caggaaggag 
ggcagcctag 
tacagggaag 
cagctgtgtt 
cctgaatgtg 
ttagatcctc 
tccctgcctg 
ttgctggcct 
cgcagcaagt 
cagccacccc 
ctggcggggg 
ccagcggcat 
aggaagaaca 
cccagaatac 
tgcactgcac 
caagctaagc 
agctcagaca 
gagcagggtg 
aatgccttac 
gccagctcct 
cagttggcca 
aaactttgca 
tcaggggaag 
tgcagagggg 
gcacagcctg 
aggacaaagc 
gggctggcat 
aacttcagcc 
atagaccttg 
gggtcccggc 
ggcgaagagg 
tcagtgctgg 
ccattcaggt 
agtgtcactg 
gcggggagag 
agaaagcctt 
tgcatcagga 
gcatggttga 
catggtagct 



aggactgaaa 
gagtagtccc 
cctggaagaa 
tgccagagtc 
tagcctggaa 
ccccatggaa 
ccatgtggcc 
caggaacaga 
tgtcccccac 
gcccggcccc 
gccagtcacc 
gtcaggatgc 
gtctccaggg 
ctgccagcag 
gtgtgtgatg 
cacccccggc 
gccccctggg 
gagaagtggg 
gaggacaagg 
gcctatcccc 
gaggagggca 
ctcggagcag 
actcaccctg 
cacagaggta 
tgggggcttt 
ctgatggcag 
ggtcgctgtc 
caaggtccag, 
tgggcactca 
actggacaaa 
aagggcagcc 
tgtgccctga 
aaggtgctgt 
cccacctagc 
cacggagcct 
gtgatactag 
aaagagagca 
gaggcagctg 
cctctggaat 
ctgtctgtcc 
tggggcatgt 
atggatgtgg 
tggagggtaa 
ccacgtggaa 
gggcagggca 
aggaggctgc 
ggccactggc 
aggccgcggg 
accaggagaa 
cctgtccttc 
gaagaggctg 
ccagggttct 
gggtttgcce 
agattcagag 
catgcctgta 



aagacatatt 
cagggcatca 
gagtgaggtg 
cccaaggccc 
ccgaccatgg 
caaggccagc 
ctggcccatg 
gggagctgag 
ctcctgccac 
tgcttctccc 
cacagaggac 
ggcaggcgta 
gaccctgggc 
cttcccggct 
gtgcaatggc 
ccccaggagc 
gtgcccacag 
ccctggggac 
cagcccaact 
ggcctgaaca 
gcgaagggcc 
aacagtccag 
gccattgaag 
caagccctcc 
ggtgaccttc 
ggaagtggcc 
cacacggcac 
cctggtggtg 
tcccacggat 
ggcagatccc 
agaccagggt 
tgcctactga 
atgggctggc 
aatccctggg 
cccctcaaga 
cgaatgcata 
atcagccaag 
ccctctgaca 
caggcccttg 
ctgtgctgag 
gcacaaatag 
tgcccatgct 
agacatgggg 
agacccactg 
ggacagggct 
aacaggtggt 
actgtggccg 
gagcaggagg 
gcgggcatca 
caccacctcg 
ggaagggtcc 
cctttcccca 
tttaatgcca 
ccagagccca 
atcttagcac 



tctgcacaaa 
gagtctacca 
ttggatgatc 
agttctcgga 
gatgggacct 
aaggccaggg 
tctaggaggg 
gccacccagc 
acacacagcc 
cagaccaccc 
actcaggagc 
aagggcagca 
ccacagcacc 
cacgttgcaa 
caggacagtg 
caggggtcca 
ggttggggag 
tgtgggggcc 
gtcccagctg 
gaga t cat gg 
cccactgctg 
agctccagcc 
gtggctgagg 
tcttccagca 
atgcgtgggg 
acaacacggt 
aaaaggaggc 
aatgttggtt 
gctccagggc 
gaaggctgcc 
gggtggaact 
gtcttcccag 
tgccaccttg 
ggtcatgctg 
tctgccctgc 
gaatggcctc 
gcccccatgt 
gttccgggac 
gtcttacacg 
gggctttgcc 
aggagtgtgg 
gagcttgagg 
aggtagagaa 
tggccaggtc 

ggggggccta 

ggctgctcac 
tcccgggccc 
ctgctgccgg 
ggtgtggggc 
gctgccgggc 
ctcctctcaa 
cactactgta 
aatcagatct 
ggcttatgtc 
tttgggagac 



121680 
121740 
121800 
121860 
121920 
121980 
122040 
122100 
122160 
122220 
122280 
122340 
122400 
122460 
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122580 
122640 
122700 
122760 
122820 
122880 
122940 
123000 
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123120 
123180 
123240 
123300 
123360 
123420 
123480 
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123600 
123660 
123720 
123780 
123840 
123900 
123960 
124020 
124080 
124140 
124200 
124260 
124320 
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124500 
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124740 
124800 
124860 
124920 
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tgaggcagaa 
ctccatctca 
agtgcagtgg 
ctgcctcagc 
tacatgtagt 
ctggtgatcc 
gccagctttt 
ggctggagtg 
attatcctgc 
taatttttgt 
tcctgacctc 
gccaccatgc 
tagtctcagc 
tgcagtgagc 
tcaaaaaaca 
gaaggacagt 
tttttttttt 
tatcttggcc 
ccaggtagct 
agagatggga 
ccacctcggc 
accctacttt 
tttcataatt 
ggtgcaaaag 
tctgccgttg 
cctgcggttg 
ctccctccac 

ggaggttgca 

ctccaggtct 
gctgtgatca 
tgtggccagg 
ctccaggaag 
cctgtcaccc 
ccgctttgtc 
tgaatacact 
caaggccacc 
agacctggtg 
acaagctgac 
cagagggcat 
aacagcaggc 
cagtctcaga 
attgtattct 
atgtgaatat 
acctgtaatc 
aagaccatcc 
gcaggatggt 
cttgagcccc 
ggcaacacag 
cctgatcatg 
aagaggcctc 
tatgcatatg 
tgtgcataca 
tgtatgtgcg 
gttaataaat 
ttttcataag 



ggatcgtttg 
aacaattttt 
tgtgatcctg 
ctcccaagta 
agagataggg 
acccgcctca 
tttttttttt 
cagtggcgca 
ctcagcctcc 
atttttagtg 
aggtgatctg 
ctggccacaa 
tatttgggag 
catgaacata 
aaaagaaaaa 
gcacataccg 
ttttgagaca 
cactacaact 
gggactacag 
tttcaccatg 
ctcccaaagt 
tctatttggc 
aaaaaacaat 
gaatgatgtg 
ggtcactgca 
agtccactag 
tctccctggc 
tttccaaggg 
cggatctcca 
ctgtccccgg 
ggctcactga 
gctgtcagga 
tcagacaagg 
ccacactgcc 
tttgtccttg 
ctgcaagtgg 
agggacagca 
acagagtgcc 
ggtctcaggc 
ctgcactctg 
ctaacggaga 
ggaccagaaa 
agattatagc 
ccagtacttt 
tggccaacat 

gggtgcctgt 

agaggcggac 
tgagactcca 
atcattgcat 
atgtctgcaa 
tatatagagt 
cacaagtgca 
tgtgtgtgtc 
gggaaatcta 
tttaaacttc 



aggccaggag 
ttttttttga 
gctcactgca 
gctgggatta 
tttcaccatg 
gcctcccaaa 
tttttttttt 
atctgagctc 
ctagtagctg 
gagatggggt 
cctgcctcgg 
tttttttttt 
gctgagatgg 
ccattgcact 
cctggcttga 
aaagttggaa 
gagtctcact 
tccacctcct 
gaactcacca 
ttggccagga 
gctgggatta 
ttcccacatc 
gaatgcttct 
acctgttgtg 
cagcccctgt 
acttctagaa 
tgccagtgcc 
agttggtggc 
ggcgcaggga 
aggtggaggc 
ccacagagca 
ctggagtgag 
gcattccctg 
ctgcgagaag 
gctgggctgg 
ggataaaagc 
cctcagccag 
tcctgacagg 
cattcaagtc 
caagtttcaa 
atgaagagac 
aaaaatggct 
ttagataaca 
gggaggccga 
ggtgaaaccc 
aatcccagct 
attgcagtga 
tctcaaaaaa 
tgtggatatg 
cttcaggaat 
gtccatatat 
cctgtatata 
tgtgtagaca 
ggttaaaggt 
caaatataaa 



ttccagacca 
gactgagtct 
acctctgcct 
caggtgtgcc 
ttagccaggc 
gtgctgggat 
ttttgagacg 
gctacaacct 
ggaccacagg 
ttcaccacgt 
cctcccaaag 
ttaattagct 
gaggatggct 
ccggcctggg 
tcaattagct 
gaccgtactt 
cttgtcaccc 
gggttcaagc 
ccatgcccag 
tggtctcgat 
caggagtgag 
tgactgctag 
gagggactct 
tttccctttt 
cttttatgtg 
ggaacagact 
catcttacca 
tgagcacttg 
gttgggacca 
caggatatga 
gtgaaggatg 
aggcgggtct 
gataccctga 
ggggtgatcc 
gtacgtcact 
cccacttcaa 
gtgaccaaag 
gcagagcagc 
aggcaaatgt 
cgtcaagaat 
gcaacgaccc 
acaaaagaca 
gtcttctatc 
ggtaggtgga 
tgtctctaca 
actcaggagg 
gccaagatag 
aaaaaaaagt 
taagagaatg 
ataaafcatgt 
gtataagtgc 
tgcaagtcta 
gcgagaaaga 
atataatcat 
ataggaggga 



gcctaggcaa 
cactctgtct 
cccaggttca 
accacaccca 
agatctcgaa 
tacaggtgtg 
gagtcttgct 
ctgcctccca 
tgtgcgccac 
tggccaggct 
tgctggaatt 
gggtgtggtg 
tgagcccagg 
caacagagca 
accatgccct 
ttcttttttc 
aggctggagt 
gattctcctg 
ctaatttttg 
ctcttgacct 
ccaccgcgcc 
catagagcct 
ccaagtttag 
tcccttgact 
gcgtagccag 
ggagcaggct 
ttggcatgga 
tactccccac 
gaggtaccac 
cccccatgtg 
cccacaagtc 
gcgtggagac 
tacaacccgt 
caaagtgcct 
ctgttgtccc 
agatgagaaa 
tgagcatcat 
atgtccgcgg 
ggagtgaggg 
gacaaggaaa 
aatgcaatat 
gtattaggtc 
agccaggtgt 
tcacgtgagg 
aaaacacaaa 
ctgaggcagg 
caccattgca 
cttctatcaa 
atcttaggaa 
aggtagatac 
acatgtccac 
tatccataca 
ttaagcaaat 
tatttgtctt 
actagggagg 



catagtgaga 
cccaggctgg 
agcgattctc 
gctaatgtta 
ctcccgacct 
agccactgtg 
ccgtcagcca 
ggttcaagca 
cacacccggc 
gatctcaagc 
acaggcatga 
acatgggccg 
agtttgaggc 
acaccctatc 
caggaggagg 
tttttctttt 
gcagtggcgc 
ccttagcctc 
tatttttagt 
cgtgatccac 
cagccagaag 
gctcccagag 
ggtcagggta 
tccaggaagc 
ttagctcagt 
cctcctcagg 
agtccagggt 
tgtcagtttc 
tgaagcgtgg 
acagcaccag 
ccgcctgggt 
gagaggtggg 
gacctctgca 
gagtgcctgc 
agtcagtgtt 
actaatagag 
cagcaatggg 
ccttcccacc 
acctcctcac 
gactgaggaa 
gtgaactgtg 
cactggtaaa 
agtggctcat 
tcaggagttc 
aattagccag 
aggagaatcg 
ctccagcctg 
tgctaatttt 
tttagcggta 
ataaataaca 
atagagtgga 
tttatatgta 
gtgccaaaat 
tctctgcaac 
tcaaagcatt 



124980 
125040 
125100 
125160 
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125400 
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125580 
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126120 
126180 
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127020 
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127140 
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gcccaggtgt 
gttcccatgc 
tgtgtccagt 
agcgctcgct 
tggccaagtg 
accagctgta 
tctggttctc 
catagggagt 
tctcagccaa 
agcctcctgg 
cccttcgtgg 
catccaggaa 
tccctgaggc 
ctgaagatgt 
caggtgtaga 
gtttggaggc 
ttcacctggc 
aggaggatct 
tgcttgggga 
aaggaagtat 
tctgcccatg 
agctgatcat 
tcccaatgct 
gtaccccact 
tgggaggggg 
aagaggccaa 
tgtattcaag 
tggagtgcag 
ctcctgcctc 
tttttgtttt 
gaactccttg 
gtgagccact 
gacctgggtc 
gtctcactct 
cccccggggt 
atgccaccac 
ccaagctggt 
tgggattaca 
tgtgagacag 
ggaaagggat 
tgctgccaac 
ctctgcccag 
atttaacgaa 
ggagcagcct 
ggtttctgtc 
acttcacttg 
gcaggatccg 
cgccggtggg 
gccactgcag 
cctctgtgcc 
aggacagaga 
aaaccccagc 
ctactccagc 
tggaacagag 
aaggggtcca 



gcagagctgg 
ccaggacccc 
gccaccccac 
gccatggacg 
cagccggagg 
gcggagatca 
caggatggga 
gctggggggt 
tcagcctcaa 
agtgctctgc 
gggtattgtc 
gcactgggtt 
ctgagctcct 
ggaggctgat 
cgccagcatc 
gtaccccatc 
aggtgagtgt 
tcacacccct 
gggcagagga 
gcccagataa 
tccgtccctt 
gtgacgcttg 
gtccattgct 
gctagcttta 
ccaaaggcag 
caacacaaag 
tgttttattc 
aggcacaatc 
agcctcccga 
ttaatttttt 
cctcaagcaa 
gtatccggcc 
atctccttta 
gtcacccagg 
tcaagcaatc 
acccggctaa 
cttgaatccc 
ggcataagcc 
gaaccatcta 
aaatcccacc 
cctgaagctg 
cctggcctca 
gtccttgggg 
ggttttgcat 
aggagcaaca 
gaggtgaatc 
gccgtggtcc 
ctcaaacttc 
tctgacctgc 
ctcgagaagc 
cagcagtgct 
tgaggagaga 
cccacttggg 
gctgagactg 
gaagcagctt 



gacatgagct 
aacccttcct 
cctggcctct 
ttctgcacct 
gtatgggagt 
ctgggtgcct 
cctgctgggc 
cccagccaac 
tctctgcaca 
atctctttgt 
acctgtctca 
atgccagttg 
ccctattctc 
ggggggtgag 
gctccaggct 
cttgagccac 
gaccagctca 
gggggcatct 
tggcacactt 
aacagcagtc 
tcccgccccc 
gccagcgtcc 
cctgctctat 
gtggcctgtg 
catcaaacta 
aactgcattt 
ttattttttt 
atagctcact 
ggaactggga 
tgtagagaca 
ttctcctgcc 
tcaagtgttt 
tttgtttttg 
ctggagtgca 
ctagtgcctc 
tttttgtatt 
tggcctcaag 
accgcgcccg 
ttgttatctt 
tccccagcat 
catgggagct 
aggacaagcc 
ctcagcagtg 
caggagcaag 
gatgcgctca 
tcgctctgag 
tgccaggaca 
tggctgttga 
tcctgcaggc 
tccaccgggg 
caggacccgc 
gccctgggga 
tgaatacaag 
gctggggtta 
gggaatccct 



caaggccacc 
tggggctcca 
tacggttgac 
cacagaagta 
gggcatcctc 
cgttgggtgt 
tcacctggac 
ttccagcccc 
tcctacacca 
cctgctcacc 
tggtaaggca 
ctggccctgc 
tggccaatac 
accaaagagc 
gcctggggca 
ttaatggaac 
cctggaagga 
gcaagtcaca 
cttcttgccc 
cccttcaatc 
tgcgcctgat 
agcctacccc 
ttgggatgag 
gaagctattg 
gctctgaaaa 
ccttaaatct. 
agaaagagga 
gcagcctgaa 
ctacaggtgc 
gggtctcact 
ttggcctccc 
aatttgtgcc 
ttgttgttgt 
gtggcacaat 
agcctcctta 
tttagtagag 
tgatccaccc 
gcccatctcc 
cgtcatagac 
cctccagcac 
cctgttcccc 
ccttcgaagc 
tctctccact 
ggttccgttt 
gccacaaggg 
ccctgtgatt 
tggccatgtg 
aggtgacaga 
atacgtaggg 
aggcaatggt 
ttttgccacc 
gggggctttt 
ggagaaccag 
gattcaggac 
cccggggggc 



tccaggtaag 
ccatctggtc 
taccacgctg 
gaagccagta 
cagcaggaca 
gttgcagact 
ctcagccact 
agccccatac 
cccgcctgct 
aggatccccc 
gggctgggac 
cctgccctgt 
gcaaccttcc 
ccacgccgtt 
ggtgcagcac 
tgactgcagg 
tgttcctccc 
gtagggggta 
ccctttaaaa 
ctcacccagg 
gcattcctat 
accagctgta 
ctccacacac 
gtaagggacc 
tagttaccca 
tgttccaaag 
tctggctcag 
actcctgggc 
aagccaccac 
atgttgccca 
aaagcactgg 
aggcactctt 
tgttttttgt 
cacaactcaa 
gtaactggga 
acgaggtttc 
gcctcagcct 
tttaattttt 
tgagaaaaca 
ggggtaaatc 
aaccctttgt 
agtaggaggt 
tgccctaccc 
ctgtgggctg 
tgtgtcccca 
ggccacggag 
gagggtctcc 
gcgagcaggg 
agtggagcag 
gggcaccctg 
cctgagatcc 
aggggaagga 
gcctgggcct 
aagggctggg 
agccaggcca 



tggccttgaa 
ctgctctgac 
acagggcccg 
tcagccctag 
tggttcttgt 
agtgtcactg 
gcaagggcag 
agtccccggg 
gctacagggg 
tgacacccca 
tccacacctg 
ctcccctccg 
caagaactca 
ctcagcttgg 
accagtcttg 
gtagctgctg 

cgaggggctg 

ttgggtaagg 
gctcagtcct 
gcatgctctg 
gcccattgag 
gttttctgct 
agggtcatgg 
aactacctag 
gtttgtaagc 
cctctctctg 
tcagccaagc 
t.caagtgatc 
atccagctaa 
ggctggtctc 
ggttacaggt 
ctaaatcctt 
ttgagacaca 
tacaactcca 
ttacaggcat 
accatgttgg 
cccaaagtga 
atcataaatc 
gagccaggca 
ccggggcact 
gtctcccttt 
tgagggaacc 
tgg^ggccca 

gagaggggct 
gacaactcac 
agctggcagc 
aggtggccga 
tcctggcctt 
ttgaagtcca 
ggctcctctg 
ctcgccctgg 
aagacgcttt 
acgccggggc 
gctgagagcc 
ccccacttat 



128280 
128340 
128400 
128460 
128520 
128580 
128640 
128700 
128760 
128820 
128880 
128940 
129000 
129060 
129120 
129180 
129240 
129300 
129360 
129420 
129480 
129540 
129600 
129660 
129720 
129780 
129840 
129900 
129960 
130020 
130080 
130140 
130200 
130260 
130320 
130380 
130440 
130500 
130560 
130620 
130680 
130740 
130800 
130860 
130920 
130980 
131040 
131100 
131160 
131220 
131280 
131340 
1314 0 0 
131460 
13152 0 
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cacctgttac 
cgaagcggaa 
acaccctgtg 
gcttggggtc 
tggccgtgat 
gcaggcaaga 
ggcctggggg 
cagagccctg 
ccccgagaaa 
cgaggctgct 
tcctggccac 
catagtctaa 
ggcccgcaaa 
cccatagcag 
tgggctcaca 
gtccgagcct 
aagacctctg 
gcccaggaat 
atcccaagtt 
tttctccctc 
gcccagagat 
aggcttgtct 
tgccgaagtg 
tgcccgtgag 
ctccgcagac 
ctatctccac 
atgaatttgg 
tctctatttt 
actgaagata 
tgactaagag 
gaggtggcaa 
ccacaaatat 
catggagtgc 
agagatgcca 
atcttcatga 
aaaaaatagg 
gcctgtgaac 
aagacgagcc 
gtggtcccag 
gaggctgcag 
ctgtctgaaa 

ttgggaggct 

ggtgaaaccc 
agtcctagct 
acggtgagcc 
ctcaaaaaaa 
aaaaggagga 
actaatatcc 
catgaaatcc 
atataactgg 
aaatcaaaac 
acaaacaaag 
tgtattgctg 
tcaaaaagtc 
aaaagtactg 



tgtgaccaag 
gttgtaggaa 
ctcggggttc 
cgccgagtgg 
gccgtcgggc 
cccct tcaca 
aagaacggca 
cgacctgccc 
tgcacactta 
cagaggtttg 
tgcccgattg 
tctgccaagg 
gcttacctgc 
gttcttgtgc 
ggggcctcca 
gtgtccccac 
agtcagacac 
gccctcactt 
ctgctttccc 
ttccccactt 
gcccagttac 
cctaccacct 
ctgcctttaa 
atgtgaaaag 
ccactctggg 
cacagataat 
cttccagctg 
ccttgccttg 
tgtgccccat 
aggggtgagg 
ggcaagtcta 
gagctcaatg 
ccctggtcag 
gaaacacata 
ccttggatta 
taagttggac 
ccaacacttt 
tgggcaatac 
ctactctgga 
tgagccatga 
agaacaaaca 
gaggcatgca 
cgtctctact 
actcgggagg 
aggatcacgc 
aaaaaaaaaa 
caacccacag 
agaatatata 
aactccaaaa 
ctgataagga 
caaaccacag 
caacacagaa 
gtgggaatat 
aacataggat 
aaaacaggga 



gtgcctttca 
ccagagtcct 
cccatgaact 
ctcaccacct 
acctccacgt 
ccctgcacgt 
gggggacaga 
cagagaaggt 
gagcagccct 
gtagggtggc 
ggccctccct 
gctggactta 
tgggaagaat 
tgctcctgtt 
gggacacctc 
ctcctctgct 
cagcagagag 
ggtggcctgc 
agcccctcaa 
ccctgccccc 
tcactccaag 
gccttactaa 
cagtcactcc 
tcctcatttc 
ggcagttctg 
ctttgcctgc 
gaacttctcc 
ttttttacag 
agcagtgcgg 
tctttagggg 
ggtttgaaag 
tgtgcggagt 
ggagtgtcta 
taagtgaaac 
ggcaatgctt 
ttcatcaaaa 
gggaggctga 
tgcaagactc 
ggctgaggtg 
ttgcaccact 
acagctgggt 
gatcacaagg 
aaaaatacaa 
ctgaggcagg 
cactgtactc 
aaaaggatat 
aatgggagaa 
aagaattcct 
atgggcaaaa 
catgaaaaga 
tgagatacca 
aacaataaat 
aaaatggttc 
taccatatga 
ctctaacaga 



catctgacca 
cgggctgcag 

cggtgcggcc 

gccgctggcc 
cggcagggaa 
cctggggact 

ggggagggtg 

gccccagctg 
tctcagtgcc 
tcgaagacag 
tgacctcagc 
tgaaggctgg 
gaggccagga 
gcctaagagg 
tgggcacttt 
ggccaaccca 
ccaggaggcc 
ctcagaaaag 
ggataccacc 
tccccaccac 
acacccagct 
ctatagacca 
cctgctcaaa 
ctgcccccag 
tctgctcaag 
tgaaacctta 
tttgaggttc 
cttgtgagag 
agcctaccag 
ttcagagctg 
gtagcatgag 
gtaccatatc 

aggggtacca 

tataaaactc 
tcttaagtat 
tttaaaactt 
ggcaggagga 
tgtctctacg 
ggaagatcgc 
gcactccagc 
gcggtggctc 
tcaagagatt 
aattagctgg 
agaatcacct 
cagcctggtg 
gaatcaaagg 
aatatttgta 
acaatgaaca 
cacttgaata 
tgctcaacat 
cttcacatcc 
attggtgagg 
cgtcactgtg 
tccagcaatt 
gtacaccaat 



gcggttgacc 
gtccttcagc 
gcggaagcgg 
cgagtagtcg 
gctgaagatg 
ggagacgccc 
atacaggcct 
ggctcccaaa 
ccaggggtca 
atggcagctt 
aaccaaacat 
accctggaag 
ggagaagctt 
gtggtgcgca 
agccccagca 
ataagagggc 
acgttcccag 
cctgtgtgtc 
ctaaggcaga 
ccaaaccaac 
agaatgatgg 
taatggggct 
aaccttctgt 
ctctgtcccc 
ggctccctag 
ttcaaccttc 
ctgtagctac 
cccatttctc 
gcattcagca 
aggttcagag 
agcgctgtgg 
caaaaaggca 
tagacctgag 
ttagaagaaa 
gaaattaaaa 
ggccagacac 
tcactttagc 
aaaaattaaa 
ttgagcccag 
ctgggtgaca 
acgcctgtaa 
gagac cat cc 
gtgtggtggt 
gaacccagga 
acagagtgag 
acactatcca 
aatcatttat 
acaacaacca 
aacatttctt 
cactaggcat 
gttagaatgg 
atgcgaagtt 
gaaaacaatt 
ccactcctag 
gttcacggca 



tcactgatct 
agcaggttgc 
gcctccacca 
tagtaccaga 
caggggataa 
catgaggcct 
cagggtgcca 
ttctgccctg 
gtccactgcc 
cctgcccagc 
gtgacccagg 
acaggcacta 
gggcaagaag 
ctgcgctggc 
cctgctagaa 
agggctctta 
ctcaggctgt 
ccttggccct 
tgaacagctg 
aggaactgga 
tttcttcctg 
ttactgaact 
ggctceccat 
attccctgct 
ctgcccagct 
aaggagcagc 
cccagagcta 
aggacctaga 
aaccccttag 
ttggagtggg 
aacacatacc 
ggccctcaac 
cccaaaagga 
acaggtgaaa 
gcacgaaaaa 
agtggctcat 
ccaggagttc 
aaacaggcct 
gggaggggtc 
gagcaagacc 
tcccagcact 
tggccaacat 
gtgcacctct 
ggcggaggtc 
actcttctgt 
gagagtgaat 
ctgataaagg 
ccaaaaaaac 
caaagaagat 
taggaaatgc 
ctattcacaa 
gaaattcttg 
gggtcattcc 
gtatataccc 
gcactattcc 



131580 
131640 
131700 
131760 
131820 
131880 
131940 
132000 
132060 
132120 
132180 
132240 
132300 
132360 
132420 
132480 
132540 
132600 
132660 
132720 
132780 
132840 
132900 
132960 
133020 
133080 
133140 
133200 
133260 
133320 
133380 
133440 
133500 
133560 
133620 
133680 
133740 
133800 
133860 
133920 
133980 
134040 
134100 
134160 
134220 
134280 
134340 
134400 
134460 
134520 
134580 
134640 
134700 
134760 
134820 
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acfcaaaaggt 
ggtatgtaca 
gaaccttgaa 
ggttccaatt 
tagaggtaac 
agataatgaa 
atgtcatggg 
caccacatac 
ggccaagtga 
gcccatccgc 
ggaagggggt 
gcagaaggca 
gacaccgctc 
atttatggag 
tctgggaaga 
ctggggccct 
gggtcagatg 
gctatcttgt 
acctgatgga 
gaccctttcc 
ctaaccccaa 
aaggccagct 
aggagatctc 
cccccgcaac 

gggggaattg 

tgcagccctg 

ggaggttcct 

taaatcccca 
cacagtgggc 
ggttggctgg 
gaaagggcag 
tttcctgaca 
ggtgacatgg 
aattcctgga 
aattctcaga 
atttcacact 
aaactgtgct 
gtggaattgt 
gtcagggtga 
attgaggttg 
ttaagatacg 
ttgggatgat 
gtgaggtgag 
gagcagggtg 
gaagggtttc 
caccactcgc 
ttcagcccag 
gggtacctgc 
aggaaggacg 
gagaaggatc 
acccagaggc 
ctccataggg 
ggaccgctgc 
ttcaccagcc 
gaagggggct 



ggaaacaggt 
tacaatggaa 
aacattatgc 
acatgaaata 
caggggctgc 
aaagttctgg 
gtgtacactt 
acacccatgt 
acgaggaggc 
ctcaggggct 
gagccgaggc 
gaaggtgccc 
cccaggggac 
gcccaagggc 
tgcaccccca 
gggacctgga 
tgtccctggc 
cccagctccc 
agaatggtgg 
aggaccactc 
aggggtcagc 
tcagcccaga 
tgtccctgtt 
ttgggttgca 
cgggtcccgg 
aaccatccct 
gaggttcata 
ctcctactat 
actcacgttt 
ggccccaagt 
aggcagtttc 
ccaaggcttc 
ccagctggag 
gcagattatg 
gccatagaat 
tcattcctta 
ggacagcacc 
gctggggtta 
tgatcaggtt 
agtgatgggt 
gtggaggctg 
ggatggttgg 
ggtgagataa 
atggtcggga 
agttgagtcc 
acccagatgg 
ccccgctcct 
tccacagaag 
gccaagagct 
ccgtatcagg 
tggtcaggga 
gaaggccctc 
aacgctggag 
aaatccacac 
tcctgcacac 



caagtgtcca 
tatcaatcgg 
tcagtgaaac 
tctagactac 
agggaggggg 
aaatagtggt 
aaagacagtt 
tgccaacttt 
tggaagtggg 
gggctcagca 
ccagggccag 
cagcaggggg 
ccagccctct 
tcatctccag 
gcctccacac 
cacctcaccg 
cacaggggat 
ctgttcctcc 
ggttctggac 
ccatcacata 
cccaccggaa 
taactgttct 
tgaaagcctg 
gctgtggcct 
gaaagggggc 
cccttgggtg 
cttgggtctc 
tggtatgtgt 
gctgcctgtt 
ccctgacctc 
cctatggaca 
tccccaactt 
gttactggcc 
gtcaaaatag 
ccataactaa 
atctggctct 
agaagccctc 
gagtaaggag 
ggaattctgg 

gggggtaagg 

ggttgagatg 
gattagatga 
ggttggaata 
ctgggattgg 
atctttgatt 
agcccccccc 
aatgctccac 
gaagtggctg 
cctggtgcag 
ggctgcttcc 
ggcagctgca 
ggcaaggcca 
cctccccagg 
cccactctct 
agctctcctc 



tcagtgaatg 
ccataaggag 
cagccagaca 
gtatattcag 
agctaatgtt 
gatggttaca 
aaaacagtaa 
gcaatcctcc 
gacaggaaac 
gattccaatg 
tcaggctgac 
cacagtacag 
cgcaggctgc 
ttctctagga 
caggttcttg 
tgaagaaaag 
gtcagggtca 
catttggggt 
acagcgaccc 
atgtagaggc 
tccagcctat 
ggaaacagaa 
gagttgaggg 
agtgagcacg 
ggtgtgccag 
actcttttgg 
caagtcttgg 
tgcatcaggg 
tgagaccaag 
tgattccaga 
gctaggcccg 
ccccattggg 
cacagatagg 
tggaaatccc 
tgcttattgg 
aaatgcttct 
ctgtggacgg 
agattatcgt 
aagcaaatta 
gtgaaggttg 
aggatgatag 
gatgagtaaa 

ggggctgggg 

gatgcaggtt 
agtgtcttgg 
cgacccctgc 
tcaccctctg 
cggcggtgct 
ctggctcccc 
tctttcccaa 
ggccttgcgc 
gggacttagg 
caaagtgaga 
ctgagcccct 
cctgacacct 



gatgtggata 
gaatgaattc 
caaaagggca 
agactgaaag 
taatgattgc 
caatattgca 
attttacatt 
ctggtcctaa 
tcatgacctc 
actaccaggg 
caggtgggac 
ggcgggattg 
tggagtggac 
agccctaggc 
gccactggag 
cagcctgctg 
gctctgctat 
cctgaaaagg 
tggaacaggg 
cacctatgct 
tggctcagcc 
agagcaggga 
gacagtgccc 
cagcgccccc 
caacagggag 
aaatcattgt 
gtgctctgaa 
tggcttgagc 
tgcctcagga 
ggccaagttt 
ggtgtaggat 
ctagagaagg 
gagtcagggt 
caatcacagg 
ctttgacatg 
ggctggagtg 
acgaagtggt 
tgggctgggc 
aggctgggat 
gagttgggtg 
agtcggagtt 
tggttagggt 
ctggggctga 
ggaaggattt 
acttgggttg 
ccctatcccg 
ggcccaggga 
ggacctgcgg 
agggctctgc 
agcctcagct 
aatgccaaaa 
gactccagca 
aatggagtgg 
tagggaggct 
gagagggagg 



aacaaactgt 
taacatatgt 
aatattgtag 
tagaatagga 
tgagtctctc 
aatgtaccta 
atgtatattt 
atgctgactt 
ccagctccca 
gtcacacctg 
ttagcctgct 
ggacaggaag 
tgatctggcc 
ctcctcctct 
aatgatat ag 
ggcacactgg 
ggccagggca 
gcaatcgtga 
cgcgggggag 
tagcccggcc 
tgtcaccaca 
ccgctcagaa 
cgccccccgc 
tggtggtcga 
caggcagctc 
tccccagaca 
gacaggattt 
tggcctggta 
ggtcttggag 
agctggggaa 
tcagtttctg 
aagaacacag 
acggatgggc 
ccaaatgttt 
ggcagtagaa 
ctcactctcc 
gatggatgaa 

tggggctgag 

tggggtggac 
caggtgatgg 
gtggttaggg 
tggggtgcaa 
ggtagggtca 
gggggtggtg 
gtttggggtc 
ctcagccagt 
ccagggacag 
agaggagaac 
cggctcaaga 
ctactgtcca 
cgggaaagac 
agcagaagtg 
gggactccca 

gggggaggtg 

cgcgcccagc 



134880 
134940 
135000 
135060 
135120 
135180 
135240 
135300 
135360 
135420 
135480 
135540 
135600 
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135960 
136020 
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136140 
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136320 
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136500 
136560 
136620 
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136800 
136860 
136920 
136980 
137040 
137100 
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137400 
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137700 
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ct'ggggtggg 
ggagcctccc 

ggggacataa 

tcctgtacgt 
ttgcttttga 
gctcacggca 
gctgggatta 
gggtttcacc 
tcggcctccc 
accttcaccc 
attactgcag 
atgttaattg 
gtttgtttgt 
tggtgcaatc 
ctcctgcttc 
ttttttaagt 
ctgggctcaa 
caccactcct 
ctgcctacct 
caaggccacg 
cgcaggccat 
tacagagata 
tgcttgggcc 
tcaacagctg 
aatccttcag 
cccttgactt 
cacaaactaa 
tgccatgcaa 
cagatgacct 
caccagacca 
tggctacatt 
ttacctttcc 
tgcttcctgt 
taggctgggc 
gatcgcttga 
cgaaaaatac 
aggctgaggc 
cactactgca 
tgtaatttct 
tttttttttt 
tgcaatggca 
gcctcagcct 
gtattttttt 
agttcccagt 
agccaccata 
tagagaattt 
tagaaaacac 
gtaagtgttg 
tattttttgg 
tgcgctccag 
ccatgccctc 
taccagccac 
agtgaactat 
taacaacaac 
tagaagctag 



gatgcactac 
tgtaccactg 
ggccccaatg 
tttttggttt 
gatggagtct 
acctccacct 
caggcatgca 
atgttgtcca 
aaagttctgg 
ccaccttggg 
cccatcttat 
ctccaagcaa 
ttttgttttt 
ttggctcact 
agcctcctaa 
tattttgtag 
gtgactctcc 
ggccattttt 
gtactggtgg 
tgtgcaggtg 
ggtgccccaa 
ttaggtctct 
atttcctctt 
gtccggagac 
tccactggga 
tgaccttagt 
gtcttgaaac 
ggggcaggct 
gaatcagacc 
gtcaacctgt 
agcaagatga 
ttagcatgag 
catttccttg 
acggtggctc 
ggccaggagt 
aaaaattagc 
acgagaatcg 
ctccagcctg 
cagtaggtca 
ttttttttgg 
caatctcggc 
cccaagttgc 
gtagagatgg 
caagtgatcc 
cccagtgaat 
gccaaagaca 
caaataacgg 
gcatccaaga 
ctcagtcatt 
ccactgtatc 
tactttaaga 
tcaggaggct 
aattgtgtca 
aaaaaaggta 
aagttagtca 



acgatgccca 
agtactctta 
tccaggacac 
ttggcttctt 
cactctgtcg 
cccgagttca 
ccaccacatc 
ggttggtctt 
gattacagag 
acaggaaagg 
tgagggccta 
tcccagatac 
tgagacagtg 
ttggctcact 
gaagccggga 
agacggagtc 
cacctcgacc 
ttttttaatt 
ggagctggcc 
taaaaggatg 
agcagatgtc 
atcactctgt 
tcggccaaaa 
acccagctct 
ctctatacat 
catatgactt 
gtacttaagc 
ccaactaacc 
catggattga 
ggatgcgtga 
tacaaggcct 
gagagcaaag 
agtatacagc 
acgcctgtaa 
tcaagaccag 
tgggcctggt 
attgaacccg 
gatgacagag 
cactgttaca 
agatagggtc 
tcactgcaac 
tgggactaca 
ggcttcacca 
acccaccttg 
tattctcgtt 
aaactttttg 
tgttttaaat 
tgatatgaca 
gtcaatgggc 
tgcattcggg 
taactcctta 
gaagcaggag 
ctgcattcca 
attctttaaa 
catggtctta 



gaccaaggca 
agaaccccca 
tcctcaggtt 
gttgttggtt 
tccaggctgg 
agtgatcatc 
cggctaattt 
gaactcctga 
gtgagccgct 
aaagccccca 
gtttgtgcag 
aagcaacaag 
tcttgctctg 
gcagcttcga 
ttacaggcgc 
tccctatgtt 
tcccaaagtg 
tcacaactca 
acagcaaatt 
tgaaacccta 
tggtgacatt 
tcctctttat 
cacaaagggt 
aggcctgtgg 
ccaccccttg 
gctctggcca 
agtttgcctt 
tgctggtcca 
aacaaaactc 
atttattgct 
aagtcccaga 
acttgtctac 
aaactaggta 
tctcagcact 
cctggtcaac 
ggcaggcgcc 
ggaggtggag 
cgagactctg 
cattcatcta 
tcactgtcac 
tttgacctcc 
ggtgagtacc 
tgttgcccag 
gcctcccaaa 
ccagatagga 
gttgaaaaaa 
aaaatagaag 
actccacaat 
tgcttccagt 
aaagcaggat 
aattgctgat 
gttcacttga 
gcctgagtga 
ttgtacactt 
agtctttatt 



gttattctcc 
agaggcagtc 
ttctctttcc 
tttttctttt 
agtgcggtgg 
ttgcctcagc 
ttgtattttt 
ccccaagtaa 
gcacccggcc 
ccacagctag 
gcacttgaca 
actttttttg 
tcgcccaggc 
aattctaggc 
ttgccactat 
cccaggctgg 
ctggaactac 
aacttaattc 
aggcccctgt 
aatcagggtg 
ccaccctgaa 
agctcctgtg 
tcatcccatt 
ggttgtgaca 
gcttcatggt 
atggatgtgc 
tccccctgaa 
aagaggatga 

agctgagccc 

ggatgctgct 
acaacacacc 
cttgattagt 
aataaataaa 
ttaggaggcc 
gtggcgaaac 
tgtagtctca 
attgcagtga 
cctcaaaata 
ataacaatta 
ccaggctgga 
taggctcaag 
accagacgca 
gctggtctca 
gtgctgggat 
aaaccaagtc 
aataagtttt 
tgtttcaccc 
catgaaacta 
cattgtccaa 
ggaggaaagg 
gtggtgggtc 
acccaggagc 
cagagtgaga 
tattgctgct 
ctaacaggtg 



aagccattca 
ccgtgttgtg 
cctctcactg 
tgttttgttt 
cacaatctca 
ctctcgagta 
ggtagagacg 
tctgcctgcc 
actatcctat 
ttactgttac 
tgtgaagcag 
gtttttgttt 
tggggtgtcc 
ttaagcaatc 
gcacggctaa 
tctcaaacac 
aggcgtaagc 
aactcagtcc 
accctgaggg 
gatccagaat 
atgctcaggc 
tccactccta 
acttcctccc 
catgggtacc 
gggaaagcat 
acggacatga 
tttctgtcat 
cgaacacatg 
agcctacgtc 
gagaattttg 
cagaacttgc 
cagggagcac 
aataactagg 
gaggtgggca 
cctgtctcta 
gctactcagg 
gccgagatca 
ttttaaaaaa 
ttctttttct 
caggctggag 
tgatcctcct 
gccaattttt 
cactcctgag 
tacaggtatg 
atagggaggt 
gctacaagta 
tctccctcaa 
gatccctttt 
agtggctgct 
acaaagtagg 
actcctgcaa 
ttgaggctgc 
tcttgtctct 
tatattccac 
atgtgcccga 



138180 
138240 
138300 
138360 
138420 
138480 
138540 
138600 
138660 
138720 
138780 
138840 
138900 
138960 
139020 
139080 
139140 
139200 
139260 
139320 
139380 
139440 
139500 
139560 
139620 
139680 
139740 
139800 
139860 
139920 
139980 
140040 
140100 
140160 
140220 
140280 
140340 
140400 
140460 
140520 
140580 
140640 
140700 
140760 
140820 
140880 
140940 
141000 
141060 
141120 
141180 
141240 
141300 
141360 
141420 
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gtgaatcctg 
agcaaccagg 
gcccctgccc 
cagagccctc 
cagtctatag 
tgggagtcat 
caggcatatc 
gtcctctgga 
aatgttggct 
ctccacccga 
cctcacccag 
cacccacttc 
ctccctcctc 
aagccagaga 
aagacccatg 
acacatgcag 
gccggctggg 
gaatggcgtg 
agcctgggcc 
gcacggaagc 
gaggtcagaa 
acaaaaatta 
cttgaaccca 
ggtgacagag 
cacaccaaca 
tccaatacca 
gatggctggg 
aaagagggag 
ctgagagaaa 
accccaaagg 
gcagaggaaa 
tgaattggtt 
agaaacaaac 
agatgggtgg 
ctgtctctac 
cccgagtagc 
ttgaactatg 
aacaacaaca 
ggctagaggt 
atcctggggc 
cctagaggta 
gagtgagact 
aataaaaacc 
ttatgacaac 
gctggagtgc 
ttctcctgct 
aatttttgta 
agtatcaaac 
acaggcatga 
tagcaaccct 
atggtacagt 
aacttgaacc 
tatttattta 
tggtggctta 
gccagcagtt 



ggggcccagt 
accttaattc 
acctccttgc 
fcgcttcctca 
aggcccaagg 
gtagctcact 
catcacctgc 
aagcttgtat 
ctgctgttgc 
ggtgacctct 
tctgtcatct 
tgtccctgag 
aacacaacca 
cttccaggga 
tacatgtcct 
tccccaaagg 
cgcggtggct 
aacccgggag 
acagagacag 
ttatgcctgt 
gtttgagacc 
gccaggcatc 
ggaggtggcg 
caagactcca 
tctagttcaa 
ctgagagcac 
ggatgggaag 
gatcacgttc 
gctgccccag 
gttggtagct 
taccctaggt 
accaggtatt 
aggccaggcg 
atcacctgag 
taaaaataca 
tgggaggctg 
attatgtcac 
aaaaaggtaa 
tagtcacatg 
cccagtgcta 
gaggttgcag 
ccatctcaaa 
tacagcaaag 
tttttttttt 

tcagactccc 
tttttagtag 
tcctgacctc 
gccaccacac 
ctgagctaga 
aaaataacct 
caggcaacct 
tttatttatt 
cgcctgtaat 
cgagaccagc 



gctaagaagt 
agtcattcat 
cagtatctct 
gccttcagac 
gagatctttg 
ggctaaaagc 
tctacgatcc 
gaagagttcc 
ttctcttatt 
actctcatag 
gtccctctgg 
tgtttcaagg 
caaggtctag 
aagtcaacaa 
cccgttttat 
acaccgtctg 
cacgcctgta 
gcggagcttg 
actctgtctc 
aatcccagca 
agcctggcca 
tgtaatccca 
gtcacagtga 
tctaaacaca 
gatggtggac 
agtaagcaaa 
tatccactgc 
aggaacagat 
aggccggagc 
ggcacacgta 
gagttccagt 
accagacagg 
cattggctca 
gtcaggagtt 
aaaattagct 
aagtaggagg 
tacattccag 
ttctttaaat 
gtctcatgtc 
agaagtaggg 
agagcggaga 
aaataaaaaa 
aacaaagagc 
tttttttttt 
tctcagctca 
gagtacctgg 
agaccgggtt 
aggtgatcca 
ccatccagtg 
tgctattgtt 
tgaccgaggt 
gtttcctgag 
tacttactta 
cccaccactt 
ctggacaaca 



agagggagat 
tccttggacc 
gcagagcctc 
ccacttgctg 
agacccatac 
ctaggtcctg 
caggagtcac 

cagggcctgg 

gggtatctga 
ccacaccctg 
ctctcctctc 
ctctggggtt 
gtggaatgac 
gcccaaggat 
tccctgctca 
tacagagtca 
gtcccagcta 
cagtgagccg 
aaaaaaaaaa 
ctttgggagg 
acatggtgaa 
gctactcagg 
gccaagattg 
cacacgcaca 
tgagaacttg 
gggaaaagga 
acaggatttt 
gcgggtaaag 
agaagtggac 
cctctccacc 
atcaatcaac 
tgaggaagac 
cgcctgtaat 
caagaccagc 
gggtgtggtg 
ttcacttgaa 
cctgagtgac 
tgcacactac 
tttattctaa 
ggagatgaag 
tcatgccact 
taaataaata 
tattgcaccg 
ttggagatgg 
ctgcaacctc 
tattataggc 
acaacatggt 
cccgcctcag 
ttctgacaat 
atctccactt 
ttacatggct 
ttggactctg 
cttatttatt 
tgggaggctg 
aggagacagg 



gaagccctgc 
tacccagatt 
ctgcctcccc 
tgtctcaggt 
tgtagtcagc 
catagagctg 
tcaatgactc 
aacatcaaaa 
aggccgtact 
gaccctcact 
ctgttcatgc 
ttcagagaca 
ccactaaggg 
gcccgttaca 
gggtctgggc 
gatggagtta 
ctcgggaggc 
agatcgcgcc 
aaaaagaata 
ctgaggtggg 
accccatctc 
aggctgaggc 
tgccagtgca 
cgcacacata 
tctctgccat 
gacagaaggg 
gatttaattc 
gaaaccagag 
tctctacagg 
cccacatgta 
ctgccctttg 
caacacaaag 
cccagcactc 
cttgccaaca 
gggttctcct 
cccaggagct 
agagtgagat 
attgctgctt 
caggtgatgt 
ccctgccctc 
gcactccagc 
aataaataaa 
ttactgtggg 
agtatttttc 
tgcctcctgg 
acatcccacc 
ttcaccatgt 
cctcccaaag 
tttatactta 
acaggaaact 
ggtaagtggc 
aactactcag 
gagatggggt 
aggggaacag 
gtcttgctct 



cctcctgtgg 
ccagcctcag 
cagagggtgc 
ggggacaact 
ttggggcaga 
ggttcaaatc 
agaagctaga 
gagctctgag 
ctcctctctc 
cattcagtca 
atcttgtact 
ctgaagggac 
accggctctg 
agaaagttaa 
atacagtgga 
agaacattta 
tgaggcagga 
gctgcactcc 
tttagtctgg 
cggataacct 
tactaaaaat 
aggaaaatca 
ctccagcctg 
tttaaaccac 
tcctggccca 
ctgggaacag 
tagaagatag 
ccaaagcacg 
gactcaatac 
acactgcagg 
ttcataaaga 
agaaagat cc 
tgggaggcca 
tggtgaaacc 
gcctcagcct 
cgaggctgca 
cttgtctctt 
atattccact 
gcccgagtaa 
ctgcttgaac 

ct gggtgaca 

taaataaata 
cctggcagta 
tgtcacccag 
gttcaagcga 
acacccggct 
tggccagtct 
tgccaggatt 
ttaactcatt 
gagtcacaga 
agagaaatga 
ccatactgcc 
cagccgggca 
atcacttgag 
gtcacccagg 



141480 
141540 
141600 
141660 
141720 
141780 
141840 
141900 
141960 
142020 
142080 
142140 
142200 
142260 
142320 
142380 
142440 
142500 
142560 
142620 
142680 
142740 
142800 
142860 
142920 
142980 
143040 
143100 
143160 
143220 
143280 
143340 
143400 
143460 
143520 
143580 
143640 
143700 
143760 
143820 
143880 
143940 
144000 
144060 
144120 
144180 
144240 
144300 
144360 
144420 
144480 
144540 
144600 
144660 
144720 
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ctggagtgca 
cctcccacct 
atcgaaaaaa 
aactcctggg 
tgagccaccg 
caatcccgga 
tagtgccaat 
tcttggaaat 
acatctgtgg 
tttcctgctg 
cacgagagaa 
actgagggca 
tggatcaagc 
tcaaattcca 
gagctaccat 
aaaaaaccca 
aaggaatgct 
cacaggaagg 
ttcttctctc 
ccaggaccaa 
aagaaaagag 
ctttagggta 
ttctgcagta 
aacctgaaga 
aaactttgac 
tagcactttc 
ccaacatagt 
gcacctgtag 
cggagtttgt 
ctccatctca 
gatggcaggt 
agaaattctt 
ctataagtag 
gacatgggcg 
agtagatgct 
agtgatgttc 
tttgggaggc 
catgatgaag 
tggtgcacac 
tgagagtgag 
actccagaat 
ttgcattttg 
gtcatatcca 
tgagcaatca 
aaatttatat 
cattgattct 
gggactaaga 
cttatacaca 
atttgaaaaa 
agttaggtag 
gggcttccag 
gatttttgat 
tatttgaaat 
gtatacatgg 
agtggctcat 



atggtgcaat 
cagcctccca 
aaaaattata 
cttaagcaat 
tgcctgacca 
gggaacagat 
atgttcataa 
ttaaaataga 
ttttggtttc 
tattctcaag 
gcatgttgct 
gatgaaagat 
cttacctgaa 
tagaaggact 
caagggaaac 
agtcacatca 
ggataaacac 
tgggcacagc 
agaaagaagt 
atagaaagaa 
aatccatttg 
gaggaaatga 
aatgacgttt 
taaagcatgg 
catgaaaatg 
ggaggccgag 
gaaaccccat 
tcccagctac 
ggtgagccaa 
aaaaaaagaa 
gacctggggc 
tctaaaattg 
caccaacaca 
gggagatggg 
ctcttgagtt 
aaactccaga 

cgaggtgggc 

ccctatctct 
ctgtagtccc 
accctctctc 
atataaaaat 
tttcatctct 
cagcccccaa 
fctcctgcctc 
cacccagatt 
ccacttcacc 
gacactacag 
gatttttaaa 
actcacaaac 
gtcaggaatg 
tcaacagaag 
gatgaggagg 
ctgctggtag 
agaataaata 
acctgtaatc 



catggttcac 
agtagctggg 
tagagatggg 
ccacccacct 
atacttcctc 
aattcaagga 
aacaaaaaaa 
attgctgaaa 
caaaattagt 
ccctgtgggt 
gctcacagtt 
atatactgac 
gctaaaggga 
ccaattatcc 
tgccagacct 
ttgaactgaa 
aacaactact 
aagtagtctt 
aagaaaggca 
aacacatctg 
accttgacat 
tactatgttt 
atacaatcat 
aagatttcat 
taaaaaagag 
gcgggtggat 
ctctactaaa 
atgggaggct 
gatcacgccg 
aagaaaagaa 
tgggggaggc 
atgtatgagg 
ggactccagt 
aatgggtgat 
actaaattga 
aaaggctggg 
agatcactta 
actaaaaaaa 
agttattcag 
aaaaaaaaaa 
gaaaagaggg 
ctgtatatgt 
ccatccacca 
cttccagttc 
cccttccagc 
aaactggtca 
ctgaccctta 
aaaataaaca 
cacatagcta 
cataaatata 
gctattagta 
attagtgccc 
ctggaattgg 
aataaataaa 
ccaacacttt 



tgcagcttcg 
accataggca 
ggtctcacta 
tgccctccca 
tttaataaac 
atacaagaaa 
agcagctatg 
taaaaagaaa 
tcactcttct 
tggtggaatt 
cattatgagg 
cctgggtcct 
tgagtgcttc 
ttgcctcagt 
tggtgacaag 
agagctgcag 
tcatctcact 
tctggttgac 
attggagaac 
aagcatgatt 
ctgaaacagc 
ccttctcaaa 
aacatccgtt 
tataattacc 
cccaggcgca 
cacaaggtca 
aatacaaaca 
gaggcaggag 
cggcactaca 
aagaaaagta 
acaatttttc 
aacgaaattt 
aatcacacag 
ttccattccc 
gaaacaaaga 
cgcggtggct 
agcccaggag 
aaaaaaatac 
gatctcacta 
aaaaaaaaaa 
cggaagagtg 
tgaacttttt 
ggcatcaact 
gtcttcctgt 
taccttaatg 
ataattgtct 
aacaacatgg 
tattgaaaaa 
gaaatatcaa 
tatgttaatt 
attaagtatt 
ccaaccccca 
gcaactaaat 
taaataaata 
gggaggccaa 



acctcccagg 
cccactatca 
tattgcctag 
aagtactaga 
taaacaaaaa 
acttatgaaa 
aaaagaaagc 
ttcaatagaa 
ttgggtaaaa 
gacacctctt 
agccagcagg 
tgtggacatc 
tcactgttaa 
tgtgtgtcca 
cccacccact 
ttccccgaaa 
gcacctgcaa 
ataacaaaga 
cagaaacgac 
ttgaacaatt 
aaatattctc 
ggagctatct 
ggttttcaat 
gaacagcatg 
gtggctcacg 
ggagtttgag 
ttagctaggc 
aatcacttga 
gcctggtcaa 
gacagcagaa 
ttacatagtg 
taaatataga 
caaacaaaac 
tgctttaatg 
taaagtatat 
catacctgta 
tttgagacca 
aaaaaattag 
ctgtactcca 
aaaagaagaa 
gaactgcggg 
ttttaaccta 
gctgtgttta 
actgcagagg 
ctaccacctc 
cttctaaaaa 
gtttgaactg 
aattttggag 
aaaaaaaaaa 
ggctgtttat 
tgagtcaaaa 
tgttgttcaa 
tcacgagatg 
agtatttgtt 
ggcaggcaga 



ctcaagcgat 
tgtccagcta 
actggtctct 
attacaggtg 
gtgaacaaac 
agaaagattc 
aaaaaataat 
aagggtgtga 
ataccctgat 
ccatfctactg 
taaagacaac 
attgagtccc 
tagccactga 
gccctcggat 
gaaaatgggg 
gacaggagag 
agaatccagg 
caacatctgt 
aaaatctatt 
agtggggtgt 
catcaaggca 
ggttacgtgg 
tttcagaatc 
caaatgttac 
cctgtaatcc 
accagcttgg 
atggtggcac 
acccaggagg 
cagagcaaga 
attagaggga 
atagggctca 
ttgtttagaa 
tgagaaaaca 
atgaggtgtc 
tgtttagagg 
atcctggcac 
gcctaggcaa 
caaagcatgg 
gcctgggtaa 
gaagaagaaa 
gtagaaaatt 
catgtactac 
tatggagggt 
ctagaaaact 
attcagccat 
tattagaatt 
tgctagtctg 
agatgtgaca 
actgaaacag 
attattagta 
gttatacaca 
gggttagcta 
tggaacggtc 
ggcctggtgc 
tgacttgagg 



144780 
144840 
144900 
144960 
145020 
145080 
145140 
145200 
145260 
145320 
145380 
145440 
145500 
145560 
145620 
145680 
145740 
145800 
145860 
145920 
145980 
146040 
146100 
146160 
146220 
146280 
146340 
146400 
146460 
146520 
146580 
146640 
146700 
146760 
146820 
146880 
146940 
147000 
147060 
147120 
147180 
147240 
147300 
147360 
147420 
147480 
147540 
147600 
147660 
147720 
147780 
147840 
147900 
147960 
148020 
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tcaggagttc 
aaatcagctg 
gagaattgct 
ccagcctggg 
aaagaataag 
acccaaaata 
aaaatggaca 

ggga-ggccga 

agtgaaaccg 
gtcccagcta 
gctaagatcg 
aaaaaatggg 
aagcatatga 
gtaccactaa 
gtgaggatgt 
ccatttttaa 
taacaattac 
cacagtggct 
aggccaggag 
cgctctgtca 
cccggcttca 
cgccaccatg 
tagactggtc 
gggattacag 
caacacagtg 
cacacacttg 
agtttgaggt 
agagaaagaa 
acgaaaagaa 
agaaagaaag 
ccatacaaaa 
gaagcaacta 
aatggagtat 
actaaatatg 
cacaaaaagt 
gtgtgatttc 
agggttgccg 
gtcagtgata 
acccatagaa 
aatgtattga 
aatggggaaa 
ttttataaat 
taaaaagtca 
agtgcagtgg 
ctgcctctac 
tacaggcatg 
ccatgttggc 
tcaaagtgct 
aggcagtctg 
agcagggtgt 
tgcactccag 
aataaaattt 
agaggagagt 
ctggcatatg 
cctacataca 



gagaccagcc 
ggcgtggtgg 
tgaacctggg 
cgaaagagtg 
aaaacaagcc 
cacaaagaac 
aacgggccag 
gcagggcaga 
tctctactga 
.cttgggtgtc 
cgccactgca 
caaaagatgt 
aaatatgctt 
acccctatta 
ggagtgacag 
gacagtttgg 
actcctagat 
cacacctgta 
ttctttcttt 
cccaggctgg 
agcgattctc 
cccagctaat 
ttgaactcct 
gtgtgagcta 
atactctgtc 
tagtgtcagc 
tgcagtgagc 
agagagagag 
aagaaggaag 
aaaggaaaga 
ccctgcgcaa 
agaagctctc 
cattaagtaa 
catttagtaa 
aatatccata 
aactatttga 
ggggttctgg 
ttattctata 
agtacaacac 
tattagttca 
ctgcgtgtga 
ctaaaacttt 
attttttttt 
cacaatctct 
ctcccaggtt 
cgccaccacg 
caggctggtc 
gggattacag 
gctggggcag 
atctcttgag 
cctgggtgac 
taaaaggcaa 
ccatgaacaa 
ttcaattccc 
aaccctggat 



tggccaacat 
tgtgcacctg 
agtcagaggt 
ggactctgtc 
acagattggg 
tcttaaaacc 
gcatggtggc 
tcaccttagg 
aaatacaaaa 
tgcggcagga 
ctccagcctg 
gaacagacac 
aacctgatgt 
aaatggtcaa 
gactctcatt 
cagtttctta 
agttacccaa 
atcccatcac 
ttatttattt 
agtgcagtgg 
ctgtctcggc 
ttttatattt 
gacctcaagt 
ctgtgctcag 
tctacaaaaa 
tactggggag 
cacgattatg 
agagagagag 
gaagggaagg 
aataaaaaag 
gaatgtttat 
caatgggtga 
taaaaagaag 
tattaatatg 
ttactaaatg 
ctttctggaa 
ggacaggaaa 
tgatactata 
aaagagtgat 
ttaattttaa 
gggaggggat 
ttttttaaat 
ttttttttga 
gctcactgca 
caatcaattc 
cctggctaat 
aagaactcct 
ggttgaacca 
tggctcacgc 
cccaggagtt 
agagtgagac 
tctgcagcaa 
cagggagagg 
tgttcccagc 
ccttgacatc 



ggtgaaatac 
taatcccagc 
tgcagtgagc 
tcaacaacaa 
agaatatatt 
caacactaag 
ggtggctcac 
tcaggagttc 
attagccgac 
aaatagcttg 
ggcaacacag 
ctcatcaaag 
cattaaggaa 
aatccagaat 
cgttgctggt 
caaaactaaa 
ttgaggtgaa 
tttgggaggc 
atttatttat 
agtgatctca 
ctctgagtag 
ttagtagaga 
gacctgcctg 
ccaaggccag 
aaaatttttt 
gctgaggctg 
cctctgcact 
agaggaagga 
aagggaggga 
gaaaagaaaa 
agctgcttta 
aaggataaac 
acttagcaaa 
cattaagcaa 
aaagaagcca 
aaggcaaaac 
ggatgaatag 
atggtggatc 
tcttaatgta 
caatgtacca 
acatgggaac 
aaagtctctt 
gagggaatct 
acctctgcct 
tcccacctca 
gtttgtattt 
gacctcaggt 
catgcctggt 
ctgtaatccc 
cgaggctgta 
tctgtctcta 
gagaagatga 
aaagtggtca 
cctcaggaag 
aacttctcct 



cgtctctact 
tgcttgggag 
cgagattgca 
caacaaaaaa 
tgcaaaacaa 
aaatcaaaca 
atctgtaatg 
tcaactagcc 
cgtggtgacg 
aaccagggag 
tgagactccg 
aagatataca 
ttacaaattg 
gctaaaaaca 
ggaaatgtaa 
catagtctta 
aacttatatc 
caaagaggaa 
tttcttttga 
tctcactgca 
ctgctgggat 
cagggtttca 
cctcggtctc 
gagttccaga 
taattagtca 
aggatcactt 
ccagcgtggg 
aggaaggaag 
aggaagggag 
gagagagaga 
ttcataattg 
aaactgtggt 
ccacaaaaag 
taaaaaaaga 
gtctgaaaag 
tacagacagt 
gtggagcacg 
catgatatgc 
aactatgtgc 
cactaatcca 
tctgtactac 
agaaaacaat 
tgctctgtca 
ccttagttca 
gcctcccaag 
ttagtagaga 
gatccgcccg 
ctaaaatgtc 
agcaccatgg 
gtgtgctatg 
aaataaataa 
agttagcagg 
tggtggcagc 
cccagatgtc 
ttccttcatg 



aaaaatacaa 
gctgagggat 
ccactgcact 
gcaattgtta 
atatctgatg 
accccattaa 
ccagcacttt 
tggccaacat 
tgcatctgtg 
gttgcaatga 
tctcaaaaaa 
gatggaagat 
aagcaacaag 
ccaaatgctg 
aatggtatgg 
ccatacaatc 
cagggccgag 
ggattgcttg 
gacagagttt 
atctctgcct 
tacaggtgca 
ccatgtcagc 
ccaaagtgct 
ccagtcctgg 
catgtggtgg 
gagccccagg 
caacagagta 
gagaaaagaa 
gaaggaagga 
aaccttatat 
ccaaaaactg 
atatctgtgc 
taatatgcat 
cgtgtcaagc 
gctacgtgct 
aaaaagatca 
ggaattttag 
ttttgtcaaa 
tttagtcaac 
agatgttaat 
agctcaataa 
aaaaataaaa 
cccaggctgg 
agcgattctc 
tagctgggac 
tggggtttcg 
cctcagcctt 
catttttaaa 
gagaaggctg 
atggtgccac 
ataaaaatac 
agagcaggac 
tcccaggctc 
cccacctgcc 
tgctctaatg 



148080 
148140 
148200 
148260 
148320 
148380 
148440 
148500 
148560 
148620 
148680 
148740 
148800 
148860 
148920 
148980 
149040 
149100 
149160 
149220 
149280 
149340 
149400 
149460 
149520 
149580 
149640 
149700 
149760 
149820 
149880 
149940 
150000 
150060 
150120 
150180 
150240 
150300 
150360 
150420 
150480 
150540 
150600 
150660 
150720 
150780 
150840 
150900 
150960 
151020 
151080 
151140 
151200 
151260 
151320 
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agtttttgtt 
gtggtgttat 
ctgtaatcct 
gaccagcatg 
tgtggtggcg 
agcccagggg 
aaagagcaaa 
ttctgtgtgt 
agcctcccta 
aacctcctgg 
gtgtgccacg 
tctgccaccc 
gggctcaagc 
ccatgcccag 
tggtctcgat 
caggcgtgag 
tactcatact 
.accacacacc 
aacctatcta 
aaaaaccaag 
gatgaaattg 
gagggaagca 
tccgcctggg 
ccaaatttct 
ctagtcaaca 
agagactgaa 
ggaagtacac 
gctgcccctc 
aaggaccagt 
ggaggaacca 
caattccgtt 
aggctgggaa 
ctgttcttag 
tgcagcccca 
acatttcctt 
ttctgtgggg 
agagagaatt 
caaacagaag 
atgtcagcag 
ccagcccttc 
caccctttgc 
atgtggtata 
ctccacctgt 
gtcagtacag 
ttgggacagg 
gaaggaggag 
cttgaaccct 
cccatttcca 
gctgggtggg 
agggacacta 
aggtgggccc 
cccagcctcc 
gtgacctgtt 
ccccaccatg 
cagcacacct 



acttgtggaa 
gagaagacag 
agcattttgg 
gccaacatgg 
agcgcctata 
gcagaggcgg 
actctgtctc 
tttcttttct 
tgtcacccag 
gctcaggcga 
acatctggct 
aggctggagt 
aattctcctg 
ctaatttttg 
ctcctgacct 
ccactgcacc 
aaatggcaac 
atagctgctt 
tccacttctt 
cagagttctg 
agcacacaaa 
ggctcatctc 
tgagctcatc 
acccgcaatc 
ttttctctgt 
aattcagagt 
agccctaggc 
tgctatctcc 
cctcccactg 
caggcgccct 
tatatctcat 
atatactgtt 
gaaaatgaga 
caacctcctc 
tcccttagtc 

tgggctcact 

gtacaaggaa 
atgtccaggt 
agtaggacac 
aaccggctgg 
acttttgtct 
ctctcagcca 
ctgctgggaa 
aggtgtggga 
gtattgggga 
gaaccagctg 
gggccaccag 
gggaggaaga 
gcaggcagag 
gagacagggg 

gggtgaccag 

tgagctgtgg 
ctctctaagc 
gttgtggtga 
gaggttcagt 



catttaacta 
cagccaaaag 
gaggccgagg 
cgaaaccccc 
atcccagcta 
cagtgagccg 
aaaaaaaaaa 
tttctttttt 
gctggagtgc 
tccttccacc 
tgtgtgtttt 
gcagtggcgc 
cctccgcctc 
tatttttagt 
tgtgatccac 
ctgcctggct 
acattctcaa 
ttaccgtttt 
gtcctcccct 
gggaggcaga 
gccagaatcc 
tttccctctc 
gtcctcgtgt 
ccaaactcca 
gggatcgaat 
cctaaacaaa 
tggaataggg 
agggtgtgtc 
acattccagg 
ccagctgttt 
tagccaggct 
tctgctgggt 
atagactggg 
atgcttgatg 
ttgttccttt 
ggcaagggga 
gctgaatttc 
ggagcaaatg 
aagggcaggt 
ccaccaagca 
gtaccatctc 
cccctcaggc 
ccctcacatc 
ggtaggtcca 
ggttcagcag 
ttgacagagc 
gctcctgcag 
tgatgcagcc 
gaattcctgg 
tggcccagga 
ctgggagcca 
cacaggcaag 
ttcagctttc 
gggtaactga 
caacaaggcc 



acatcattaa 
acagctgcag 
tgggtggatc 
tctccactaa 
cttgggaggt 
ggatcgtgcc 
aaaaaaaaga 
ttctatttaa 
agtggcacag 
tcagcctcct 
cttttctttt 
gatcttggct 
ctgagtagct 
agagacgggg 
ctgactcggc 
tgcatgtttt 
gccccttcct 
ctccttaaaa 
caaacccact 
agcctcctct 
tcttttgtgg 
ttcccttccc 
cttcaatttc 
ctctgagctc 
cagataggat 
agactttctt 
atactgctgg 
cctcaaccac 
caacagaatg 
caaggaaggt 
tagtcacgtg 
taccgtatgc 
ttgggggagg 
tggctgaaat 
tctgtacccc 
ggtctcaccc 
cagctggtct 
cagctgagag 
ctctactgaa 
catcggctcc 
ctccaccagg 
tcagatccag 
acactgggag 
atcctggctg 
gaaggtggga 
agcaacagca 
gcaactggga 

tgaggcaatt 

aaggaagctc 
gctgcccacc 
cccctgggtg 
agcaccggcc 
tcatctgtag 
agtgtcaggg 
cactggtgaa 



ccaagtggac 
ccaagcacag 
acctgaggtc 
aaatataaaa 
tgaggcagga 
acttcactcc 
cagctgcaac 
ttaatttatt 
tcacagctca 
tcctagctgg 
tttttgagac 
cactgcaacc 
gggaatacag 
tttcaccatg 
atcccaaagt 
ctgacatact 
tttcttctcc 
actcaaaaaa 
ccattttgat 
tatgatattg 
aaagatgggg 
ttctctttca 
cacctccagg 
cagacccatc 
tatattctgc 
tttcttgtgc 
ggggctccgc 
actgtccaaa 
gagaagaaca 
tcctggaaga 
gcaacaccta 
atgactaaaa 
ataaccagca 
aaaaagagca 
ataccctagc 
aagctcagtc 
ggaggaaaca 
ttcttgtcca 
agcaccaggg 
actctgcctc 
agtgcccttc 
agccacctcc 
acgaagccaa 
gacctggctt 
cctcaaggct 
gggagccacc 
gcagctcagc 
ggccacctgg 
acccctcagc 
tcctatcccc 
ctcaggtacc 
cacggggtct 
aaagagggtg 
taggggaccc 
gaaaacctcg 



ctgtctcaga 
tggctcatgc 
aggagttcga 
attatccggg 
gaattgcctg 
agcctgggtg 
aaatgtcaag 
ttagagtcag 
ctgtagcctc 
gactacaggt 
ggagtcttgc 
tctgcctcct 
gcgcacacca 
ttggccagga 
gctgggatta 
gtcaaaagga 
tatctgcttt 
accttccccc 
tctgcatcat 
gaagggggtg 
gcagtgggca 
caccacacgc 
agatcgactg 
tactaattgc 
tgtgactaac 
tgaaaagtct 
ctccttgcag 
gtggtgactg 
ggctgaggaa 
tgctaaataa 
gaagcaagaa 
agagagcgct 
ttggctacac 
tctcccccta 
agaggggcaa 
aagggtgagc 
ctgaaaaccc 
aggggacaga 
gcaaagtcac 
ctgcccgtgc 
ccgctcctcc 
tccaagaagt 
aagtgggggg 

tgggtgccat 

ttctttttat 
aaggctgagc 
ctctcacagt 
aggaagttca 
actggggtag 
atgaataagc 
atgcttctgg 
gcatggggca 
aagagtcctt 
agtggggcac 
cccaccctcc 



151380 
151440 
151500 
151560 
151620 
151680 
151740 
151800 
151860 
151920 
151980 
152040 
152100 
152160 
152220 
152280 
152340 
152400 
152460 
152520 
152580 
152640 
152700 
152760 
152820 
152880 
152940 
153000 
153060 
153120 
153180 
153240 
153300 
153360 
153420 
153480 
153540 
153600 
153660 
153720 
153780 
153840 
153900 
153960 
154020 
154080 
154140 
154200 
154260 
154320 
154380 
154440 
154500 
154560 
154620 
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ogtccctcct 
gtgggtgcat 
gcgaatgtgt 
cttgggcagg 
tgctaacagg 
aaactgcctg 
ctcctgcctt 
gactccccaa 
gcctgaggta 
gttcagaagc 
ggctggaaaa 
gtcatgggga 
tgaaagtcac 
tctcatgaga 
gctgctcctg 
agatgcagga 
agggagacgg 
ccacctggac 
tgctatcctt 
atcagactga 
ggtttggagg 
ctcagagcct 
cagggaaacc 
accctgcaga 
cagcctcaaa 
acaaagacac 
gcacacacag 
gcaaagtctt 
acagacggac 
ccatgttggc 
aaggaggccc 
ctgtagctgg 
aaaatagccc 
ctcatttgat 
ggtggactga 
caggcagaac 
attcatccgt 
cggtggtgtg 
cagcctcctg 
ctttttagag 
caagccatcc 
cccagcccag 
tccccagaag 
gtgccccctg 
gcgcctccgt 
gcctgccctc 
tgtaggggca 
gggcagggaa 
gaaggggccg 
gcagctggac 
cccctctccc 
tcagggaggg 
aggccctgcc 
gcggcgcggc 

ggcggaaggc 



gaggcaggtg 
gcatgtgtgt 
gtgtgtgcgc 
agtccggggg 
ctgaggtggg 
ctggtctgcc 
ccaaacccag 
gcagacacag 
gggatggatg 
acagaaggca 
agagtttggg 
gccacggaga 
ctctggttgc 
ggggaaatcc 
gggaggacct 
agctggtgag 

gagggcacag 

aattgaatcg 
ctctcatgtc 
ggggggccga 
gcgctggcac 
gctgccgtgg 
aggtggaagg 
ccccaggaca 
cgaggtgcag 
aagatgcagc 
acacagtcag 
acacatacaa 
acacatagac 
ttggtatcca 
tctggggcag 
ggagcatttt 
ttgcaaaggt 
ggggctgtgg 
gaagcatctg 
gagtcagctg 
tcattcattt 
atcacactta 
agtacctagg 
acagggtctc 
tcccacctca 
aaagagacct 
ggccccggtt 
gtgcggagtg 
cgtctgaact 
atccgcaatt 
gtaggggtgg 
gtggaggggt 
aggctgagcc 
cgcaggaggc 
ttctcccacc 
gccctggacg 
gctggcctcg 
catccggcct 
ctcagggccg 



gtggaggcac 
gcatgcatgt 
acatgcatgt 
caaggggggg 
ggatgacatg 
acatcgggcc 
gaagtcttga 
cacccacaca 
tggacaagca 
ggacgtgccc 
aggtcagcca 
gtataggagt 
agatagagat 
ctgaccacag 
actcagaggc 
gtcagtgggt 
agaggtagag 
agcttcctct 
agactggatg 
aatgtggggg 
tgaggacagg 
ccctcagaga 
gaaaaagaaa 
tgcacagcag 
gcctgcaggc 
aacacacaca 
acaatccaag 
agacacactt 
gcacactggg 
gaattgttgg 
tggctctgag 
ggctgggcag 
ttggagtgga 
ggcccttctc 
gaacttctgt 
cctttgaaag 
atttatgaca 
ctgcagcctt 
actacagacg 
cctttgttgc 
gactcccaaa 
tcaccccaca 
aggacaagaa 
gtccccttgc 
tgagcaaaga 
ctgcctcttt 
gaggagagag 
ctagggcggg 
ccaaggtggg 
cgaccagggt 
ccctgggctg 
cgggaccagg 
cacatctggg 

gggggagggg 

ggcgcacgtt 



atgccctctc 
gtgtgtgcat 
gtgctaagct 
acatgcaaag 
tcttagaaac 
aagctgtggg 
aatccccatg 
agggtctaag 
gctcagggct 
aaggggccgg 
agggcagtcc 
ggtgatggga 
ggactggggg 
tccagaggaa 
tgattcttcc 
gggcccacaa 
ggcagggact 
agtgtggaat 
cggcccaagg 
tgggggaagg 
ggatacgccc 
cgagcccatc 
aaaggggcag 
acacacactc 
cccaaccaag 
cgtaatacag 
aaacacactc 
aagcacagac 

a gg c ggcctg 

tgcagcgcac 
aagaaaggca 
agaggaggaa 
agagcaaaag 
tgacgggcag 

gggggcacat 

ctccacgagg 
ggttctctct 
aaactcctgg 
tgagctacta 
ccaggctggt 
gcgctggatt 
aaccccctgt 
aaaggctgtg 
cgggcatggg 
cagagctagg 
tctggccact 
ggttccgggt 
cccttccccc 
c c t g t ggggg 
ctggaactcc 
ctaggggagc 
ctgggcccct 
ccgggagcgc 
gcggcggccg 
gagggctgcg 



cattgtgtgt 
gtgtgtgcat 
ctggagaaca 
ttctctccac 
aaacttcacc 
tacttggaaa 
gaggggacct 
ggaccccttt 

ggggctggcc 

acatggggtt 
acgccttggc 
tggccccaga 
cagggctgag 
aacagcgggc 
tcgtgctacc 
ccatgctgga 
ctgaggaccc 
gtgggctcac 
gtggggtctg 
agggagcttc 
cccatgagtc 
tttggagctg 
gcatgagtgt 
caatccctcc 
gacccctgtt 
aaacacagtc 
acacatgaac 
acaaacacac 
tggagactgc 
tccccatgcc 
gcaagcctag 
ggagagcatt 
ggatgtccca 
ccttcccagg 
ggggcagagg 
gacctagcag 
ctgtcaccca 
gctcaagcaa 
cacctggcta 
ctttgttgcc 
acaggcatga 
ctggaggcca 
tgcctcggag 
gcctcagtct 
cctagcctgg 
gagga cc ggg 
caagggactg 
tccgctgttg 
cggggcgggg 
tcttcccctc 
gggggaggcg 
cggcggaggc 
gggccgaccc 

gggtgggccg 
accgccgcag 



gcatgagtgt 


154680 


gcatgtgtgt 


154740 


agcagccagg 


154800 


agcccctcag 


154860 


ctctgcccta 


154920 


gagggccctg 


154980 


ggggccaggg 


155040 


ctccccaacc 


155100 


taggtggaag 


155160 


ggaggctagg 


155220 


tcttctctgg 


155280 


gttgtgcctt 


155340 


aggaagggca 


155400 


ctggggagag 


155460 


ccacccaggg 


155520 


gaggagcagg 


155580 


tcatccatgc 


15564 0 


tggggctatt 


155700 


tgactccctc 


155760 


caaacaggaa 


155820 


ttcacccctg 


155880 


gagttgggga 


155940 


gaggggccag 


156000 


tggctctgct 


156060 


acacacagac 


156120 


acacactagc 


156180 


agacatataa 


156240 


caacagatgc 


156300 


tgattggata 


156360 


tgacctcatt 


156420 


ggaggtgaca 


156480 


tcagggaggg 


156540 


agaactgtgc 


156600 


cacaaaggga 


156660 


ggaaggggaa 


156720 


aaattcattc 


156780 


agctggagtg 


156840 


t cctccaccc 


156900 


atttttaaaa 


156960 


cagccagctt 


157020 


gccacgggtt 


157080 


agtcccagca 


157140 


gagaggctgg 


157200 


gtgagactcc 


157260 


ggacccgagg 


157320 


ctagggaagc 


157380 


gaggggtctg 


157440 


ggccctcgct 


157500 


gagcccctca 


157560 


cctccgccgc 


157620 


caggaggggc 


157680 


ccgcgcaggc 


157740 


ggcggcgcag 


157800 


gccccaagaa 


157860 


cgggcacggc 


157920 
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ccaccgagct 
ggaggtgagt 
tgccgtcccg 
acagatctgc 
cgaccttggt 
tcagaggctt 
ggggcctgct 
gggagttgag 
ccgaagcgca 
ttccgggcca 
cctatctgtg 
agggaccctc 
ctgtgggcag 
acccaagggg 
gaggccattt 
cacatggtcc 
gccgtttctg 
ctgtggcctc 
gagcagagag 
tgggtgggct 
tcccatagtg 
ctccgaggat 
ccagcacttt 
tgagccgtga 
aaaaggagcg 
tgggatggga 
gactaatcgg 
actagaggtc 
ggctggggcc 
atcaaactgt 
ggcccctgaa 
gctttcggga 
ggtgtgtgtg 
ccatgggatc 
cactctgtca 
tccaggctca 
ctaccacgct 
ctggtctcga 
ttacaggcat 
tccttaaaga 
gaaaattcag 
gcttttatca 
ctacaggaaa 
aataggctct 
gagacggtag 
gatgtggaat 
gtctcactct 
tgcttcccag 
gtctgccact 
ggccaggctg 
gctgggatta 
ctaggttgtt 
acaaagtggt 
gctgaagaat 
cagtagtatg 



gggggcccgg 

tctggggcgg 
ggctgtcggc 
aagcccctct 
ctcccacctg 
aggctggaga 
ccagggagcc 
tgtatgtgtg 
acagagtact 
gcttgaggtc 
aggttagcgt 
cggagagcag 
agcatgggag 
gtgcggcagg 
ctgtcactcc 
catcttctta 
atctgtgctt 
acttagcccg 
cagcaggcgg 
ctttgtctac 
ctcctgggga 
tctgtcaggc 
gggaggatga 
tcgtgccact 
ggagtggggg 
gtctctggtt 
ccctgcctat 
aatatgaagc 
agagtttggc 
cccactcaat 
gcggggctca 
gctgggcttg 
ctcgcttggc 
ctccttcttt 
cccaggctgg 
agcaattctc 
gggctaatat 
actcctgagc 
gagccaccac 
actcatgatg 
aatctgcatc 
ataccttgca 
gggcactgag 
tgccatccct 
ataactggac 
ccagtgtaga 
tctcgcccag 
gttcaaacga 
acgcttggct 
gtctcgaatt 
caggagtgag 
tctcctccag 
tccccagtgc 
cagaacaaca 
ctggttaatg 



gatcgcggcg 
gggctgccgg 
tgagacccca 
ctgcagcgtg 
ggcttcgggg 
aactcctgtg 
aaactgaggg 
tgtgttgggg 
gagtggccca 
caggtggggt 
gtgttgatga 
acgcagcgca 
gggctctgtt 
atgtatggaa 
ctgccccagc 
agggctcctc 
tgggccctgg 
ctgacggagt 
aagaggcagg 
acaggcatcg 
catgagcgcc 
aaagggttgg 
agtgggagga 
gcactccagc 
cagggttcgt 
gtgagttcct 
tgggtagaag 
caagagttag 
atgtgcttgt 
tactctcaaa 
gcctcccctc 
tcccgtgttc 
agtcagccac 
aaca'caattt 
agtgcagtgg 
ccacctcagc 
ttgtattttt 
tcaagcaatc 
acccagccct 
actagaatgc 
atgaagggaa 
tatctggact 
gcacagagat 
gaaaccccag 
aagcttccga 
gcgaggactc 
gctggagtgc 
ttctcctgcc 
aatttttgta 
cctgacctca 
ccacggcgcc 
gtacaatagc 
ttagcacaga 
tacaaataag 
gtttccaatc 



gctggacggg 
gcgccccgag 
ccaaaaacct 
gctgcgggct 
gaccacctcc 
tcttaggggc 
acccggtgcc 
ggtgcctcag 
ggtgccaggg 

ggggtccagg 

gagagactgg 
ggaatcccag 
gtttgtgaac 
agcaaagatg 
gcaggatggg 
tccagaacag 
ttggcccagt 
cctgttccat 
gtcctagtga 
tgtggagccc 
cagctgaaac 
gcccctgcaa 
tcgcttgagg 
ctaggcaaca 
cctaggctat 
gctgggccac 
tcagggccag 
gggtcagtcc 
agccaggaaa 
cccataggca 
attttcagct 
tgtggtcagc 
ctcttggctt 
tttttttttt 
tgtgatcatg 
gtccaaagtg 
cgtagagatg 
cacccacttt 
taacacaaat 
gattctcagt 
agaacactgc 
tgttaacata 
gtggcatgat 
cataggggca 
tagctgggaa 
agtctttttt 
aatggcgcga 
tcagcctcct 
tttttagtag 
ggtgatccac 
cggcccagag 
ctcggtccag 
gcctggcatg 
gctgccagct 
agctctggaa 



gctggagctg 
tggagaaagg 
ccagactttg 
cgggaaggca 
cctgtccctc 
tggcaggact 
taggtctgag 
gggtcccagg 
tctgaggctg 
ttcccgctag 
aagggagtga 
gccagagggg 
tggctgctcc 
aggacaggga 
gcagccacct 
agtggggcag 
ctgtggcttg 
tacagggctg 
ggaggggagc 
tgaaacccag 
accccacaca 

ggtggctcac 

ccaggtggtc 
gagcaagacc 
tgtaagacca 
accctatcct 
gggtcagggt 
atgggcaggg 
tggagttcct 
ccaccagtct 
gtggactctg 
actctcctta 
acctccacct 
tttttttgga 
acttactgca 
gctgggactc 
gggttttgct 
gacctcccaa 
tgatttagtt 
aagaccaaaa 
tcaaacctgc 
agccctctgt 
gcactcacgc 
ctcagcatat 
gagtataggc 
tttttttttc 
tcttggctca 
gagtagctgg 
agacaaggtt 
ctgcctcagc 
tctttttttt 
aaaatccatt 
aagaatctcg 
cctccctcga 
gaaggataaa 



tcgggagggc 
cgaggaggtt 
tctggtgggg 
cttggggcag 
gcacagctgc 
ggtgtgggct 
gtggaaacct 
ccctgtggtc 
ggacctggtc 
ggcagtggag 
tcacgaaact 
taatgggggg 
catgatgagg 
ggggccatgg 
cactcagcca 
ccctttgggt 
gaagaaacgc 
gctgggtgct 
tggaatgacc 
aacactgcat 
ggaagagatc 
gcctgtaatc 
gaggctgcag 
ctgtctcaaa 

gggga.tactc 

ggtccccagt 
gaagctgatg 
ttagggatca 
ttctgtgtaa 
gaaattagta 
cccagaccta 
agtgtccctg 
ctcggggaat 
gacagggtct 
gcctcaacct 
taggcatgca 
gttgcccagg 
agtgctagaa 
cctacctcct 
tatctaacaa 
ccaactccct 
gtcacccatt 
taggtgccat 
cgcacaatga 
ccatggtttg 
ttgagatggg 
ctgcaacctc 
aattacaggc 
tcaccatgtt 
ctctgaaagt 
tctggtggtt 
cattcattca 
ataagtgttt 
gaccttgatc 
atcctgattt 



157980 
158040 
158100 
158160 
158220 
158280 
158340 
158400 
158450 
158520 
158580 
158640 
158700 
158760 
158820 
158880 
158940 
159000 
159060 
159120 
159180 
159240 
159300 
159360 
159420 
159480 
159540 
159600 
159660 
159720 
159780 
159840 
159900 
159960 
160020 
160080 
160140 
160200 
160260 
160320 
160380 
160440 
160500 
160560 
160620 
160680 
160740 
160800 
160860 
160920 
160980 
161040 
161100 
161160 
161220 
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atggtgtttg 
aatgtgatgt 
ggtgggaccc 
ctggctccta 
agctgctata 
atccaactcc 
atgcaggaga 
ggctggtttg 
gccagctctg 
gagaaggaat 
accaatgaga 
cttctcaaac 
cggtggctta 
gccaggagtt 
aaaattagcc 
ccccagctac 
agtgagctga 
tgcctcaaaa 
catgttctga 
caggtgatac 
tttgaaatat 
catcttagaa 
gccctcacat 
gcgtcactac 
agctcatgtc 
agtgtttcca 
acaatcttca 
cagcctttaa 
caacgaaggc 
gggcggatca 
ctctactaaa 
tgaggaggct 
gatcgtgcca 
aagacaaatt 
gttgcccttc 
gcccagccca 
tttgaccttc 
ctccatcaga 
gcacctggca 
aagccatttt 
tgttaatttc 
tgttggagag 
caaaggactc 
agggacaaat 
gatgttggct 
ttctgccctg 
gcaatgtacc 
gtgtgcatgt 
tctgtgtgca 
gaccctcatg 
tgtgcacaca 
tgtgtgagtg 
gtacgggagt 
gtctgtgtgc 
ccccgtgtga 



ccaatttctg 
cattgaatgc 
agctcagcat 
agaaccctcc 
tgcatgccca 
aatggtacag 
atcctggaca 
tgtgaactgt 
tgtctgcctg 
gcattggccc 
acatagtgtc 
ttcaatgtgc 
cgcctgtaat 
cgagaccagc 
aggcattttt 
ttgggaggct 
gaactgagat 
ataaataaat 
ttcagcaggt 
caatgctgct 
cctgcctata 
aataaatatt 
gtattcctga 
actccagcct 
acctcttgct 
gtctcatctg 
tgagaccatg 
atgtaggttc 
caggcacggt 
cctgaggtca 
aatacaaaaa 
gaggtaggag 
ctgcattcca 
gagtcatgcc 
atataaaatc 
gccctctgtc 
ctagcactca 
ccgttggttt 
cagagcaggc 
ttgacacaag 
attagacatg 
atgtattaaa 
cagccaaaag 
aaatcccact 
gtcccggaga 
acccatcacc 
tctcccacag 
gtgcagggca 
ccggtctgtg 
tgtatatgag 
ggagtgtatg 
tgtcagtgag 
atagtctgca 
acaggagtgt 
gtgcatcagt 



tcttgtaaat 
agaattggga 
cccactccat 
cctccccctc 
caaccagcac 
aggacagcaa 
tacaagagtg 
gtgagcaagg 
tgttctatag 
tgcgtagcct 
agtggtctgc 
acctgaatca 
cccagcactt 
ctggccaata 
aggtggcatt 
gaggcaggat 
cacgccactg 
aaataaataa 
ctgggctggg 
gatcctcagc 
gcctggcccc 
cttcctcacc 
tttcagccaa 
gggcaacaga 
cactcctcac 
gggacctttt 
aggcatctca 
tcctcgtgac 
ggctcatgcc 
ggagttcgag 
ttagctgggc 
aatggcttga 
gcctgggcaa 
cccaccccac 
aaatctgaaa 
cagacttacc 
gcaacgcttg 
ctagggcagg 
accagaaaac 
ctgggaactg 
agaatggtgt 
gagttcacgg 
taaacaaata 
ctgcttgtct 
tgggcctgca 
cactcacccg 
agcttcccaa 
ggtctgtgtt 
tatgtgagta 
tgtgggctga 
catgtgcgtg 
tgtgtgtctc 
ggtgggcatg 
ttgaatgtcc 
gagtgtgtgt 



actcccagcc 
agagatagaa 
catcctgagc 
tctgtgcttt 
ccatccttct 
gagtgtctgg 
tgtgggagtc 
gtgtgtgcat 
caaaactttc 
cctcctttgt 
agactggctg 
cctgggaatc 
tgggaggctg 
tggtgaaacc 
ttaggcaaat 
aatcacttga 
cactccagcc 
ataaataaat 
gcctgaggtt 
ccacactttg 
agaacttcat 
tcccttcagc 
gcttttcaga 
gtgaaactcc 
ctaagcttac 
acagggatct 
gtctcctggg 
agagtgacac 
tgtaatccca 
aacagcctgg 
atggaggcac 
acctgggagg 
cagagtgaga 
cctgccacct 
tagggtgcac 
atgtcctctt 
tcatttcaaa 
gacagtgtct 
atttagaagg 
tgggacttta 
tgcgatgaca 
gtgaaacaac 
aaataaataa 
gttcctgttg 

ggggctgtac 

agcaggcact 
cagctcttct 
tactggaccc 
tgtgtgcaca 
gtgcgtgtgt 
tgttttgttc 
tgcgggcagg 
catacagaga 
atgtgttttg 
ctctgcgggc 



atgactgatt 
agaatcagct 
tgggctctgg 
cccttttctc 
tctgattctg 
aaatctccat 
caggtgtctg 
gtgtctatgg 
agattgggcc 
ttagaggagc 
agagacactg 
ttgttaaaat 
agacgggcgg 
ccgactctac 
taggtggcgc 
acccaggaga 
tgggggacag 
aaataaataa 
ctgcatttct 
agtaacaaag 
agacaggatc 
tttagcgccc 
aattctagct 
atcttaagaa 
ctcccttggg 
agctttaccc 
ccggtatata 
cctatctcaa 
gcactttgag 
ccaacatggc 
tcacctgtaa 
cagaggttgc 
ccctgtctca 
caaacctttc 
aggccctgcc 
cccaccccgc 
atcacttgcc 
ctctgggtca 
atcctgattc 
gatgagattt 
ttgtttaaaa 
atgatggctg 
gcattcagat 
acagctacag 
atcggtaaga 
gagacccacc 
gtgagtgagt 
ccctgtgtgt 
caggactggt 
ttgtgtgtgc 
aggctctgtc 
tctctgtgca 
gttctgcgtg 
ttcaggccat 
aggtctctgt 



tgaagctacc 
cgcatgagct 
gctcctgccc 
ctcaacagcc 
attcctgagc 
ggtgctggcc 
tccacaatgg 
gcatgccaga 
gctggtgaca 
agccaggccc 
ttgagcaaca 
aggccagacg 
atcacctgag 
taaaaataca 
atgcctgtgg 
tggagattgc 
agtgagactc 
atatgaaata 
aacaagcttc 
ctgttgatgg 
gagcccctcc 
tccaactact 
caatatgatt 
aagaagttct 
ccactaaaac 
atctcttggt 
ttcctctgtt 
aacaaacaaa 
aggccgaggt 
gaaaccccgt 
tcccagctac 
agtgagccaa 
gaaaaaagaa 
aagatttgct 
tgcagggtaa 
attatccaac 
tgattatctc 
ctatctccca 
caagaaagta 
aaggaattac 
cgtctttatc 
ggatttgctt 
gagggaacaa 
ggcctgcaag 
acccccacca 
tacaggttgt 
gtgtgtgtgt 
gcagaaaggg 
ctattcacag 
cagaagggtc 
cacaggaccc 
cgggatctgt 
tgcatggagg 
gtgcacagga 
gcacaggatc 



161280 
161340 
161400 
161460 
161520 
161580 
161640 
161700 
161760 
161820 
161880 
161940 
162000 
162060 
162120 
162180 
162240 
162300 
162360 
162420 
162480 
162540 
162600 
162660 
162720 
162780 
162840 
162900 
162960 
163020 
163080 
163140 
163200 
163260 
163320 
163380 
163440 
163500 
163560 
163620 
163680 
163740 
163800 
163860 
163920 
163980 
164040 
164100 
164160 
164220 
164280 
164340 
164400 
164460 
164520 
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tgt'gtacggg 
gcgtctggcc 
gaggcaagag 
ccttcctgcc 
gtggcagggc 
ctttttccct 
cccctcatgc 
agacctgcct 
cagcctaagg 
ctccctccct 
tcacatgccc 
ctgccccata 
tactccctcc 
gtcctgccct 
agacttccta 
aggaggtgct 

tgtgggcatg 

tagggtggag 
aggaggaggg 
gcccttccat 
cccacactgt 
ggtcatctgc 
atgtgtgtct 
ccagcaagga 
ccctgcttgt 
cccagtgcct 
cacaccctcg 
gtgtggacag 
gatgggggta 
attggccttg 
atgtaactaa 
gctgaggcaa 
ttgtggacca 
aaaaaaaaaa 
gtaatcccag 
aacagcctga 
gtggtggtgc 
acctgggaag 
cagagtgaga 
acctccccac 
agcttaggga 
ttctctgggg 
tgtcactatg 
aactgcctgg 
atggtgtgag 
gcaggaaagc 
gctcagcctg 
aggtccaggg 
agcattgggg 
gcacttgcag 
cattgacttc 
catccacatg 
ttcctcatac 
tgccaccccc 

gggtccttgg 



agtatagtct 
atagcctggt 
ggacactcct 
tcccttccca 

tggagcctgc 

gcccctccag 
ctggcccttc 
ggcagcacgt 
cgagccagac 
agccagcctc 
cttagcaatg 
agccgcctga 
caccaaaggt 
gaaatgctgg 
atgttgtgga 
ccccctcctc 
ggagctgcag 
gggagcaacc 
cttctctctc 
ctctggactg 
gtgtctgagg 
tagggaaacg 
ttcctacacc 
ctgatggaga 
gccaggatga 
agcccacccg 
cagtctccag 
ggtcgtgcgt 
gcaccatgcc 
ggtgaccctg 
ctcccaaccc 
gtgttcccct 
cagcccacct 
aaaatagtct 
cactttggga 
gcaacacagc 
acacctgtaa 
cggaggttgc 
ccctgtctca 
ccccaagctc 
cccccttatt 
acatgcacgt 
gccacaaaga 
aggagcaaaa 
gtgggggtgg 
aggagcttga 
gagcctggga 
ctggtctgga 
ctgtggggga 
aaacccgagg 
ggcaccacgt 
atgaggtgag 
cttggtccca 
agtctggggc 
cctcaactgg 



gcaggtgggc 
ctgggagcag 
caggaagtcc 
agctcgagcc 
cctggctgcc 
cacctcagac 
gcctgtgttc 
gacaggactg 
ctgcctgatt 
cctggcctgt 
cccagtgtgg 
tcctcggggc 
ccagctcagt 
ctcagggtgt 
cacccccagt 
gatcctcctc 
agttccctca 
ccacctccta 
ctctactccc 
agtctccctg 
ggccctagtg 
ttgcctctga 
ccgcagtcct 
acctttgggt 
actgccgtct 
gctccccgag 
taagcccaga 
ggcaaagtca 
tcttgtcccc 
gactataaag 
ccccacacag 
gagcatgtgg 
taatccaagt 
ggaaaccatt 
ggccgaggtg 
gaaaccccat 
tcccagctac 
agttagccaa 
aataaaaagg 
cctgggtcac 
cttaggggag 
gccctctcct 
gcaagaggat 
ttgctcccag 
ggctaaggca 
gccaaaccct 
gggcaaggga 
ccacagtcag 

tgggggtgtc 

tccgagcccc 
ctagtggcta 
gtcggctggg 
aaagtactgt 
tccccactgg 
agcagctccc 



acgcatgcag 
gcttgggttc 
attcacaaag 
acagcaactt 
tgaggagagg 
tctggccctg 
ccaggttacc 
tggctgaggg 
tgtccatgga 
ccatggatat 
tggtggcagc 
gaggcctggg 
cccaggcgtg 
gtgtatgtgt 
tcaggactgg 
aagcagccag 
agtcctgatt 
gccagttagc 
tcccaggagc 
ccacacaatg 
gggagacagt 
cagcagaccc 
cctccccagc 
gcctggcttg 
cttctctgca 
gacccaggaa 
gcagggacca 
tgacagggcc 
gtacactcca 
ggtaacaatt 
gaggcaccca 
cttttctagg 
catatctgaa 
agcttgggct 
ggcagattgc 
ctctacaaaa 
tcgggaggct 
gatcgcacca 
aaactactag 
cccacctgga 
tccagccctg 
ctctgccaca 
aaaatatata 
ctgggcccag 
gtccagaggg 
gcctggggct 
gcccgagcag 
tggaggggag 
cctgctggct 
ccagcaggcc 
tgctttcagc 
ctgagagagt 
caccgagaca 
ggtaaaagtt 
aaccctctgt 



agagttctgc 
tccgcagggg 
acctggtggg 
ggcctctcgg 
gagcccttga 
tccaggcgct 
cccaaggcca 
agcacagggc 
gcttcctacc 
caaggctgcc 
taccagccag 
gctgtcctcc 
caagagggtc 
gtgtgtgggg 
gctatctcta 
ctgggcctgt 
agaaccacgt 
tctgtagctc 
tcacctgccc 
cttttctgtc 
gagctcctgg 
tcagctcctg 
agagagccct 
gggccccgct 
ggctccagcc 
agctgcggca 
ggtggtgggt 
tcagctagga 
gtcatgtgcc 
ctactgagaa 
cggccattga 
tccccagagc 
ggagaggaga 
gggcatggtg 
tttgagctca 
aatacaacaa 
gagggtggag 
ctgcatttca 
cttgaagtat 
tgaccttgcc 
gcataggaga 
gaatcagaca 
cactggtcca 
actaatggct 
agggtgggca 
acctgagaga 
gtgggcaggt 
tgtcttcccc 
gcttgctaat 
tccttctctg 
tttgccagtg 
gaggtgggga 
tggggt ct cc 
ggaggatggg 
aggaaaagtc 



gtgtgcatgt 
tgggaggccg 
ctgggccctt 
gctgcagctg 
aatgtggcca 
tcccagagtc 
gggagagtgg 
ctgcctggct 
agatctcact 
ccagtactcc 
gcaggccctc 
agggctactc 
ttgatggggc 
ggtgggaatc 
tgggtgtggg 
ctgtggccct 
ggcccaactt 
caccttttga 
tatctcccct 
tctctcccct 
ggaaagcatg 
cccctgcgtc 
ttgccttagc 
gccataccta 
cggagcggtc 
ttgcccccct 
gacctggctg 
aggaggaggg 
ccccagaagg 
ggcgagcccc 
agctggcctg 
actgggcctc 
tgtgttgcca 
gctcatacct 
ggagttcaag 
ttagctgggc 
aattgcttga 
gcctaggtga 
ccaggacacc 
ctggtgatcc 
aggcacacac 
gtctggtttg 
tgtgctgtaa 
gcctatgaag 
gaggctggaa 
cacgtccaag 
ggggtgggcc 
atgcagagga 
tctaagtctt 
tggtggtggc 
accctgaggc 
gggtggggag 
ttggaggccc 
ccccaggcct 
agactttgtt 



164580 
164640 
164700 
164760 
164820 
164880 
164940 
165000 
165060 
165120 
165180 
165240 
165300 
165360 
165420 
165480 
165540 
165600 
165660 
165720 
165780 
165840 
165900 
165960 
166020 
166080 
166140 
166200 
166260 
166320 
166380 
166440 
166500 
166560 
166620 
166680 
166740 
166800 
166860 
166920 
166980 
167040 
167100 
167160 
167220 
167280 
167340 
167400 
167460 
167520 
167580 
167640 
167700 
167760 
167820 
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tt6aattttg 
tattttcctt 
tggcacgatc 
agcctcccaa 
gttttttgtt 
gcaatctcag 
tcttgagtag 
tgtattttta 
ttcgggtgat 
cacctggcct 
tgcagtggtg 
acttcagcct 
aaaaaaaaaa 
aattcctggc 
gtgagccacc 
tacaaacata 
gcttcaacac 
aacggcctag 
tttagggggg 
ccttctttga 
gtccaacatc 
taaaacaaga 
tcacacagtt 
ggacctcaag 
gcaggcagta 
gtaagagggg 
acaggtaact 
ttccacatca 
gcatcccaga 
catccaggaa 
attccagcct 
aatacccatc 
ctcacctgga 
gcattctgtt 
ctcttggcta 
atctcacctt 
tctccctgct 
cacccaagaa 
agacccgggc 
cttccacagc 
gcgggactgg 
cagggctcca 
tgggacaaaa 
ggctaaaggg 
gctaatgtgg 
ggggtcgggg 
gaagctccag 
gaggcctcct 
gaggcagtaa 
ttcagggagc 
cccctatttt 
gccaccgccg 
agcctgcacc 
ccttcctgtg 
caagccagct 



aatttttttg 
tttttttttt 
ttagctcatt 
gtagctggga 
tttttttgag 
ctcactgcaa 
ctgggactac 
gtagagacgg 
ccgccagcct 
tttttttttt 
tgatcatagc 
cctgagtagc 
attttttttt 
ctcaagtgat 
atgcccagcc 
tatgtgtttt 
tattttgtag 
tattccatag 
ttttttcctc 
aaacttgtgc 
tcaaatcttt 
aataaaaata 
gttttgaaag 
gaccttggga 
ggtgaagcta 
cttcccaagg 
gctcccaaaa 
caggaaacca 
tgtcttgctg 
cactgtctct 
cctcccaagc 
tacttctctc 
tggctgcaat 
gctccatggg 
ctttaaatcc 
agcattcttg 
ctttcacatc 
gggagggttg 
gtggcccacc 
tttggctaca 
ctctacttcg 
gacagggagg 
ccaaaacgtg 
agacgtcgga 
gtggggctaa 
cagagctaat 
gaaacggggt 
gcagagccgc 
atggaaagac 
acgcccttca 
cccctcatcc 
gagctcagag 
aagccatact 
atttgattag 
ttcctctcac 



agatataaca 
tgaggcggag 
gcaacctcca 
atacaggtgt 
acggagtctc 
gctctgcctc 
aggcgaccgc 
ggtttcaccg 
cggcctccca 
ttctttgaga 
tcactgcagc 
taggactaca 
ggagagacag 
tatcccactt 
cacaaatatt 
tttaagtaaa 
atatttcatg 
tacagatgtg 
ctctcatttt 
cgcaatctcc 
tttcttttcc 
aaaacaagaa 
acatgtttca 
cctcaagacc 
caggagacct 
cctggagact 
tacatgggac 
caaaggctct 
atcaacacac 
gtcctgtgga 
tctctcttcc 
catggccact 
atcctcctcc 
agccagtgac 
acgaagacag 
tgttttctcc 
tgtaagtttt 
cactgactgc 
agaagacccc 
ccgcccgcga 
agaagttcaa 

cggggccagc 

tgaggaccgg 
ctccggtgtg 
tagtgaagct 
gtcacatggg 
gattttaaga 
cctgtgtccG 
gatgcccgcc 
ggtgcgctgc 
gaaaccgctc 
gtcatcttct 
gatgggaggg 
tacctccagt 
tgccccctcc 



aacataacaa 
tctcactctg 
cctctcgggt 
gcgccactat 
actctgttgt 
ctgggttcat 
caccacaccc 
tgttagccag 
gagtgctagg 
cagggtctcg 
ctcaaactcc 
gacagcacca 
ggtctcactg 
tagcatccca 
tttctaagct 
agtgagatat 
tcagcacata 
tcataattta 
tcactattac 
atggggtaga 
ttttttagta 
aatgaaggtt 
aaaatccatt 
ccagtcctct 
caaagctctc 
ggatggaagg 
ttgagcttca 
gagcacttac 
atggctgagg 
ggtgacagtc 
caccagacac 
gatcgtgccc 
ttggtttcct 
ctttctttct 
aattctgaat 
tgctgcctat 
tgcacatgct 
cctgtgcctc 
gacctgcctg 
ttactaccat 
gatgaagatc 
atggaaaagg 
cccgatggag 
ggcggagctc 
ggggttgcag 
gcaagagtgg 
gcgaggtcgt 
tgccaggatc 
ctggaggtgt 
ggccccacct 
ccccatcccg 
ccagtaccct 
gggccgattc 
tccgcagagg 
tgtaccagga 



taaaatatgc 
tcgcctaggc 
tcaagcgatt 
gcctggctaa 
ccaggctgga 
gccattctcc 
agctaatttt 
gctggtctca 
attacagatg 
ctctgtcaca 
tgggctcaag 
ccacacctgg 
tgttgccagg 
aaaatgctgg 
tgtagacaaa 
acaacttgct 
cgaagctacc 
tccattcccc 
aaaaagtgca 
ttcctagagg 
aaaaggaaaa 
ctaagggctg 
caaatgttgg 
gaatatgtcc 
aacccagact 
gtgaatggag 
tgagactgga 
agccctctag 
ccatgctggg 
acaggaaata 
caagccccta 
aagttcagcc 
ttctccatgc 
aaaggcaaat 
ggccttctgt 
ggagccgacc 
gttccctctg 
cgcaggaaat 
ctgctgactc 
gacctggacc 
cacagcgcca 
gcagggctaa 
tcgtggctga 
agaaatgagg 
gaggggtggg 
gacggtggta 
caggaaatga 
tcaccttgaa 
tcgcccatgc 
ctgccgactg 
tccccgacat 
cctccctttt 
ttccagctct 
gctgaagacc 
gctgagggag 



aaatgttttt 

tggagtgcaa 

ctcctgcctc 

cttttgtggg 

gtgcagtgac 

tgcctcagcc 

tttttttttt 

aattcctgac 

tgagccacca 

caagctggag 

caatcctccc 

ctaatttaaa 

gctggtcttg 

aattgcaggc 

cacacacata 

ttgttttaaa 

actttctttg 

tattgacagt 

gcagtaatat 

tgattcctag 

gaaaaaataa 

agttgacaag 

gttttatgtg 

tagagggtcg 

gggatgggag 

acatagacag 

ctgtccttcc. 

gagctgcagg 

gctaaggaca 

actgggggac 

acatctctag 

ccgaccttct 

tcccttccct 

gtggtcatac 

gctcttcatg 

catagctagt 

cctggattcc 

gggagggcgg 

cggagggcgc 

ccgaagaggc 

cggtgagtca 

tgggggtggg 

gagggggcgg 

tggaggcggg 

gctaaggaga 

aagaggaggg 

gtgccaagct 

gacccagcta 

cctgcgcttc 

tggcagggac 

tggatgggta 

tgtctggtag 

gctgggaagt 

accctccctc 

cagagcccat 



167880 

167940 

168000 

168060 

168120 

168180 

168240 

168300 

168360 

168420 

168480 

168540 

168600 

168660 

168720 

168780 

168840 

168900 

168960 

169020 

169080 

169140. 

169200 

169260 

169320 

169380 

169440 

169500 

169560 

169620 

169680 

169740 

169800 

169860 

169920 

169980 

170040 

170100 

170160 

170220 

170280 

170340 

170400 

170460 

170520 

170580 

170640 

170700 

170760 

170820 

170880 

170940 

171000 

171060 

171120 
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cgctgccaga 
cagccaagca 
aacactgctc 
gaccacagag 
ctcccaccag 
cccaggccag 
gagtcttttt 
gcgcagtggc 
tgcctcagcc 
ttgtattttt 
ccttgtgatc 
gcccggccaa 
aggcgaggga 
tctagcctca 
cacagagttg 
tgggagggaa 
gaggtagcag 
acattcactt 
tggttcttga 
tacaggtctt 
ccgtggagac 
tggatacaga 
ggacccccaa 
tctacacttt 
caggatggaa 
gcctattttc 
gggtaaaagt 
ccctacaaaa 
gatctcaagt 
tccctgctgt 
taccgtctga 
cacctcctct 
gccagcgctc 
ttgggaaatt 
ttcctgctgg 
cagcgtggct 
atgggtgggg 
cagtgccatc 
caggctggac 
gccgcctcgg 
gccccaggtg 
gagctgcccc 
ctcagccatg 
aggcagaaac 
caacctcgtc 
ttcaccttgc 
tgagggttgg 
ggaagtcagc 
cagccccaag 
tcccaaatgc 
ccaaagacct 
gagacctcct 
tacagggtgg 
tcaggagcac 
acgccatagc 



gaaggacact 
gttcatgcgg 
aggaagggcc 
tcattgtgga 
aagagggagt 
tgtagacagt 
tttttttttt 
gtgatctcgg 
tcccgagtag 
agtagagacg 
cgcccgcctc 
gatggagagt 
gggaggcggg 
ggtttagcct 
gggccggctt 
acctctccaa 
ctgtgagcca 
cacttggcat 
gcaggggaaa 
cctaccactg 
agagagggca 
ccctaagctg 
tcctcccata 
tcactggaag 
gcaaggctgt 
ccaccctcct 
ctccattcac 
ccataatata 
gtttttctcc 
atcagattcc 
caccagccta 
gagccatcac 
agctggccgc 
tgaaggggca 
gcaacttctt 
atgaggcagc 
tgtgatggag 
ttaggtgtga 
tagtgtcccg 
tatactgccg 
gtgggcgcct 
cggggacacc 
tctagtatga 
atggctggtg 
tcagggtggg 
cctaggggag 
gggagacaca 
ccactgcatc 
ctcctgctgc 
catctcccag 
acctgcctcc 
gcgttaaagg 
gagtttgggg 
tggctgctcc 
tggagcacct 



gtgcgctggg 
gaggctgcct 
aggcctgtcc 
aagaacttca 
gggcctgcag 
gccttaagtc 
tttttttttg 
ctcactgcaa 
ctgggactac 
gggtttcatc 
ggcctcccaa 
cttatatgtg 
acaggagaag 
cagctagatc 
cttgcacatc 
gtgaggccat 
atagcagcca 
cagtattttg 
taggactatg 
agatgctcaa 
tagcctcaag 
tccaaaaggg 
cctatgggcc 
gagctaaccc 
ccccagccag 
tctcacgctg 
acccttatcc 
cttcccacag 
acccaactgg 
ctgggaacaa 
gtggagaaag 
acacccccac 
ttgtacctgt 
gatcccaggg 
ggcatatttg 
tccaaatttg 
gaggaggtgg 
cagcccacag 
agagaatgca 
caagctgcgc 
gggtgagcgc 
acccacccct 
aggggagagg 
gagcctgtct 
acatgcccta 
caaagtctga 
gactacccca 
agctgtcccc 
caggaccagg 
ggcagggaga 
gctcctccaa 
aaggccctgt 
ttcaggattg 
cagtgcccat 
ccttctaatc 



tgttgacggt 
acctggtgag 
ccatgcttgc 
gcctgctcct 
gaacagggga 
aggtgtccta 
agacagtctt 
gctctgcctc 
aggtgcccgc 
gtgttagcca 
agtgctggga 
aggtctattg 
gagctatgaa 
ccatggggag 
cgtatcggtc 
tcccattcag 
acactcacag 
agggacaccc 
aaataaaggg 
aggtctagag 
actcattgct 
atctcaagcc 
aggcatgacc 
tgagcacagt 
aacaaacacc 
ttccctggct 
ctetccccta 
ggatccccta 
cctgtcttgg 
agttctcctg 
agggttcaat 
attgtgctag 
gaaggggaac 
gttctaggtc 
gccagggcca 
tgcagcacag 
gagccccaca 
tgctgcctga 
gage age tac 
ctgcaccagc 
cgctccatcg 

ggagggtcag 

gtacccaccc 
gaggaaggga 
cctagggcag 
ggagggcagg 
ggaggaaact 
ttcactcccc 
cacccaggcc 
agecctaagt 
acccctctaa 
gtccataaat 
ccctctccca 

gg^ggtcctg 

tcactccctg 



gcctgccatc 
gaegtgeagg 
atgcacccca 
gatgggagtt 
cagagggaca 
aaagcagagc 
gctctgtcgc 
ccaggttcac 
caccacgccc 
ggatggtttc 
ttacaggcgt 
aagaaggttt 
ggatgtggtc 
ctgcagaggg 
tgtcactggt 
ctgagggcag 
tggcaggcag 
ccaccaccgc 
aggegattet 
caaggagttg 
tggtegtaac 
tgcctgtgga 
cagggecagg 
ggacatccct 
tgctccccca 
gctcaggaaa 
tctggtcaga 
ccctgaagga 
tgcctggaat 
gaagtggggt 
gagctaaaag 
agtctctgcc 
ettgetgeag 
tgcattctgt 
ttccctcccc 
aggggectga 
ggccgcggtt 
ccctgcccac 
tcatcgccct 
tcctggacct 
actccagctt 
agggtcactg 
tggaggagee 
ggcacagccc 
cccaatctca 
actccgccca 
gctggggtac 
tcatcttccc 
aaggacagct 
cctgagtccc 
ccctgatttt 
atttccccac 
gtcaggagca 
ggcaggagga 
ctgtctcctg 



tggaaacagc 
cgggcccgag 
ccacccttga 
tgtagagttg 
aaagacacag 
ccaagatgga 
ccaggctgga 
gccattctcc 
ggctaatttt 
aatctcctga 
gagccaccgc 
ctcaggagaa 
tttgctggag 
agaactgtag 
tctggggaaa 
ttatccagag 
ggcacccaga 
ccacctttgt 
agecaatgea 

gggggggctc 

cctagagcct 
caccctggag 
tctgtaaatg 
gtgeaaagat 
gcgtcccctt 
gtctggccct 
atctggggaa 
aataactcca 
ttcagcttcc 
ggcctatggc 
ggctccccac 
aataccccca 
ccccgctatc 
ctagagtctt 
agectggtea 
gaggectaac 
ccccacctca 
cacccatccc 
ggageccgag 
gagtggccgg 
ccgtcagggt 
aagccagaag 
caatttgagc 
tgcccaaggg 
gggtggagat 
gccctgagtc 
tgacaaagga 
cgacacacat 
atggtcatcc 
ctctgagact 
gccatgacct 
agctgttgga 
ggttggagtt 
tgggagttga 
acccccagct 



171180 
171240 
171300 
171360 
171420 
171480 
171540 
171600 
171660 
171720 
171780 
171840 
171900 
171960 
172020 
172Q80 
172140 
172200 
172260 
172320 
172380 
172440 
172500 
172560 
172620 
172680 
172740' 
172800 
172860 
172920 
172980 
173040 
173100 
173160 
173220 
173280 
173340 
173400 
173460 
173520 
173580 
173640 
173700 
173760 
173820 
173880 
173940 
174000 
174060 
174120 
174180 
174240 
174300 
174360 
174420 
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cgggagcagc 
gagctgtggg 
ccctaccggg 
tcctccattc 
ctcacagcgg 
agttgcagag 
aggttcaagc 
cccggggcag 
ccgcccacga 
gcggcggagg 
aggagctcta 
gggctccggc 
cctgctccac 
tgctgtgccg 
cctgggtaga 
caggggcgct 
gccacaacgt 
cagggtttgc 

gggggtctgc 
gcggggagcg 
gaactggggc 
ttctctacac 
gcaaggggag 
tcctctgagt 
caccacgtgc 
cacggacccc 
ccaccaacag 
aagccatgaa 
acctgagctt 
ctatcagtgc 
gaacacctgc 
gccaaggcaa 

tgggttctca 

tgtcttgcac 
cctccacgtc 
aggactcccg 
gtacaggcta 
tcccccgccc 
ccctgctggc 
ccgagtcagc 
tcgtggtccc 
cgccgggcgt 
gctttgtgcc 
gcaccgacgt 
tgcggcgcag 
gctgcgcggc 
tcagcctcga 
gccgcgagat 
acgtcagcac 
ccggcgcggt 
agcgggttgg 
cccgggggag 
attgggcact 
gagaagaggt 
gaacgcaaca 



tgcgaaggtc 
cagagatgca 
caccatatac 
gctgtaccca 
cgcccctaac 
cctctggaac 
acctgaagcc 
ggctggaggc 
gtgtgtgccc 
caccgtggac 
caaggcatct 
cccgccactg 
cccagggggc 
catcttcggc 
tctgaccatc 
caacatctcg 
ggagaccgct 
cgacccggga 
ctgattcatc 
gcgagtgagt 
aagacagaga 
ccccaccccg 
gcccctccca 
gagtagggtg 
agtcggtgcc 
aaactggggc 
cgtgaacttc 
cgagctcttt 
ggtcccccac 
ctagatacct 
gggatgaagt 
aaaacttctg 
aaacgttatc 
acaatgtcta 
gtggtgggag 
tgagctctca 
gggaggggag 
cttctcctct 
acggccggag 
ggtgctgcag 
gcacgacgtg 

ggtgcgggtc 

tgggcgccac 
cttcgagcgc 
ctactgcccg 
agaggatgcg 
gcttgagccc 
ccgcgccgcc 
caat cgctcc 
gccagcgccg 

ggcggggg^a 

ataaggtcat 
ggtccgctga 
ttgcaagaca 
gaagagtcct 



ccgccacagc 
agcaggtagg 
tgatgggggg 
acctggccgt 
tcccactcct 
catttctccc 
cctcacgtcc 
tgggcgaggc 
gcgctcgccg 
ctgacggtgc 
ggtgagtagc 
ccccctggcg 
ccttatggcg 
gaggacttca 
gccttcgagg 
ctgcccttct 
ctgcgcagga 
atgaccgtgc 
ccacatatac 
gccccagccc 
atcatagcgg 
ggccctgcgg 
ggacctcgta 
gagataaggg 
caggcacttc 
aagcagctgg 
gtgaagtggt 
cagcccaccg 
ccgcccctac 
ccacacatcc 
gcagtggtga 
gtgcctcagt 
acagggataa 
ggtgtttaat 
agaagtgggg 
aagaaagtgc 
gcgccggtgg 
gcccccttca 
gtgcagggtg 
cacgcggtgc 
ggcct caeca 
cgccgctcgc 
ccgcccgaaa 
ttcgtggccg 
gcgcgtcccg 
cgcttcatca 
gccgactgcg 
atgcagtttg 
gtgcgcgcgt 
tctgcccggc 
acgatagttc 
gggagagtgg 
ctgccaggct 
gagegegcag 
ggtctgaact 



cgcacgttcc 
gggaaagggg 
aagggcatgt 
cccctcacag 
ccaggggatt 
ccacaccctg 
ctcccccgac 
tggagggggc 
cegcaggaga 
accagctgga 
caggcggcgc 
gcccggcgag 
cggtgggcgt 
tcgccacctt 
ctcgcaagcg 
ccttcattga 
gcaggtgggt 
actggagggt 
actaagecag 
agcaggctcc 
ggaggeggea 
gcccatgctc 
cctggaacct 
actaacatcc 
tgcctggagg 
gccctgaccg 
cctcacaggg 
teagegggat 
atgaacaaac 
atacactgtg 
ttaagagcta 
ttccccattt 
aatgagctga 
acgtgtaaaa 
agcgtgagtg 
tcaaatggct 
ccgcctagtg 
cccgcgtccc 
tgaagctget 
aggeggeget 
tcctcaaagg 
cgctcaccta 
agctgctggt 
ccgagcagtc 
gccagcggcg 
ccgaccccgg 
gccaggacac 
gcgacaccga 
ccatcgactt 
cccgccctct 
tgcagtctgc 
gtggggacac 
gaagggaccc 
cccggcaagg 
tggecgagta. 



tggtggagtc 
gacggagtgt 
ttgeaaagee 
tcacccgcac 
ctccgcggac 
cgcccatatg 
cctgcagaca 
gcagggctga 
ccgctacgtg 
gcagccccat 
cccggtaccc 
cgctgacgcc 
ggacctggcc 
caaaaggcaa 
cactgctggc 
cttctaccgc 
cctgagcccg 
cccgggcccc 
cagggegteg 
acccacggaa 
atgectatet 
ctcggtttcc 
ggagcaggct 
cgcaggtcca 
cagaggtaga 
atggatattt 
gatgetcega 
catccagcac 
agatgeagaa 
atgagaccta 
gagggttgat 
ataaaatggg 
agtacctaga 
tgeatatect 
ttggggaggc 
actttctagt 
gtggcctcag 
cccgtcctgt 
gttcctagtg 
gggcgcccgc 
cgcggtgctg 
tggcgtgggc 
tcgcgacggc 
ggtggccctg 
cgtactcatc 
cgtgcgcaaa 
cgccggcgcg 
aattaaggtc 
tctttccaac 
ttcggttcag 
gcctttccac 
acccagagac 
gecaaggact 
ggcatgtgac 
ggggtggggg 



aggegtagga 
tatccttggc 
cgtctcttcc 
cccca.cccca. 
gctcgggtgg 
tggtggtctg 
ggccttggga 
gggtgcgagg 
gtggccgact 
ggcaccctca 
agcgcgaccc 
ctcttcgccc 
ttcgagcagc 
cggccggcag 
ccacaccgtg 
aagcagcggg 
egggctcagg 
aaggaacggt 

gggtggggcg 

tccgcagccc 
cctcccagcc 
ctgcaccaaa 
ggcaactaaa 
gtcctccaga 
gaataaggac 
gcccctttca 
atgtcttgtg 
ataggtgagc 
taattccccc 
gaatcatcta 
atgtagtctt 
gtgatagtat 
gtgagcacaa 
tatctctcgt 
gaagccctcg 
cgecaggtag 
tggctctctc 
cccgcagagg 
ggeggctteg 
ggtctgcgtg 
tteggecagg 
gtgctcaacc 
cgccgctggt 
ggegaggagg 
aacctgtact 
tgtggcgcgc 
cctcccggcc 
accgccgfccg 
tgagggcgcg 
gggectgegg 
gccctccagc 
tggctttggg 
gaacgggfcaa 
cccgaaggaa 
tgggatggca 



174480 
174540 
174600 
174660 
174720 
174780 
174840 
174900 
174960 
175020 
175080 
175140 
175200 
175260 
175320 
175380 
175440 
175500 
175560 
175620 
175680 
175740 
175800 
175860 
175920 
175980 
176040 
176100 
176160 
176220 
176280 
176340 
176400 
176460 
176520 
176580 
176640 
176700 
176760 
176820 
176880 
176940 
177000 
177060 
177120 
177180 
177240 
177300 
177360 
177420 
177480 
177540 
177600 
177660 
177720 
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ggaggagccg 
tcatcgaccg 
tgactggaag 
cacccaaata 
cattattgtg 
gggaaggccc 
gagcagctgg 
ctattgactc 
agtgtgaaat 
ccctgaggtg 
tggagaccct 
agctttcagc 
tcccaaacaa 
attcaaatta 
ctcttgtggg 
ccaagtcttt 
aaatccacct 
agtcgcttaa 
tcctactcat 
ggcagcagta 

gggagtccct 

tcctcagaca 
agagaggagg 
cagaggtgag 
ggaagccaca 
ccggggcagg 
gtattccagc 
ccggtctgca 
cccactaagg 
gaggttgaga 
gacagaaacc 
cacccatgac 
ctagcagtat 
tataacctgc 
cccctggcac 
aagaaagggg 
cacaggtctc 
ccaatgtggc 
ggctgttttc 
gcagggtagg 
attgaagggc 
cacgaaactg 
gtgagcctga 
ttatccacac 
gaaggctgga 
aggtaaccat 
gacagggagc 
gaggtgccac 
accccatgtt 
gggacgggca 
tccttgctca 
ttcaggaagc 
cccacaggcc 
agagagccca 
acagacccct 



caggaggaag 
actcagcccc 
tggactccag 
tgactgtgaa 
g t ggagg c aa 
caaggtcagc 
agggtggatg 
ggtatgcctt 
gtcaggcact 
acaataaaac 
aatagtggga 
tgtgcatagg 
ctccactgct 
attgtcttgg 
gctaggtgct 
ccctggccca 
ggcctccggt 
aaaccaagta 
ggtggacagc 
atggggccac 
gtggcttggg 
gcctcctttc 
aagtgggaga 
aggcgtgggc 
ccatcacgca 
cggggcctcg 
tcggcgccca 
aggcagggtg 
ggacccctcc 
acccctgagc 
ctcgcagtcg 
gcgggcctgc 
ctcctctttg 
ccggggacag 
tcacgggccc 
cagggatggc 
aaaaacccat 
tccaagagtg 
tcccaaaagg 
ctttgagcag 
aaggtcatga 
acaccctagc 
taggaacagc 
ggtgtgtgta 
ggcttcaaac 
cacaccatgg 
aaggcctgga 
atccccgact 
ctcactccca 
ccacgctgag 
gcctgtgcac 
caacctacag 
cacccagttg 
gaaagcccaa 
tgagggtccc 



gaggttgtgc 
aaccgggagc 
agggacaggt 
ttgtggaaag 
ttatgagggt 
ttatgaagga 
gggaggccag 
taggacattc 
tccctctaac 
atttatgctc 
ggaatgcaag 
gaactgacca 
taagacctct 
aagccagctt 
ataataccac 
tctgcaaagg 
atccactgga 
acacagacct 
agtggggact 
gacgccacag 
gtggggagtg 
gggctcaacc 
ggaggacgag 
aaggccactc 
gcatgtcggg 
ggcctcagtc 
cacacttgac 
caagtcagtg 
ccttccctca 
cgggcaagga 
agcttgtgca 
accgtcacgc 
aggacgccct 
gggcaattgg 
ctacccacca 
ccttctggag 
tcttcagtga 
aatgaagtag 
aaggcaggga 
atccgatggc 
agaggctggt 
ctctgccagg 
agcctgaaac 
gctccagcag 
cagttcctca 
gcaggtacag 
gtgtgggacc 
tgctgagctg 
cacccatcca 
gagcttgagg 
tgggggcaga 
caggagaggt 
tgccacctat 
ctccctccat 
agccctacag 



agggtctgga 
caggcagaaa 
gtggtggcac 
ggcagtagat 
agcatttctt 
ccccacttgc 
agggagcaat 
tcttaccgct 
tggcatgcaa 
aaggggaagc 
ggttcccggt 
gaagggggtg 
ctggcctaca 
ctggcctaaa 
gcggtgcccc 
agtaggcatt 
agtttatttc 
gaggggtggg 
agggaggggc 
agccagctcc 
ggggacccac 
catttcttcc 
ggaggggcag 
cacccccaca 
gggacaaggc 
aaagccgtgc 
accatccagc 
ccatgctggc 
gggatcagct 
tcccttgttc 
tccctcctcc 
gcacacaggt 
ctttgtgcgc 
tcagcacctc 
ccatgggggg 
cccagagacc 
gctggacatg 
ggtccaattc 
gaggcggtga 
aagaggtaaa 
gacggggaca 
ctgggagtgg 
ccaaggtctg 
gtttttctag 
gcagctccca 
ggaagtatga 
tgccatgctg 
cccatccacc 
ggccctgctg 
tgcaggctgg 
gtgaaggtaa 
gagggaggga 
gatcccatgc 
cactccagct 
cttggccagc 



cctgcagggc 
aaccctgtgc 
agtcctggtg 
ctctaatgtg 
tcgagacaaa 
accccacccc 
gaggggtggt 
catgggcctc 
cagccccacc 
cacagcctgc 
gctagagaga 
ctgctgtctc 
catgaggtcc 
atgccaccac 
tgcctcctga 
accccaaccc 
tttagggttc 
ggctggggac 
a ggagaggtg 
gtcctctccc 
cccaggccct 
ggcaggagac 
ggtggcagca 
cccacGGcag 

ggggtttaag 

cagtcgctgt 
agcatgggcg 
ccccggcccc 
ggaggtaggg 
agccttggtfc 
aaccaggagc 
gccagtgcca 
cgagtactca 
ccccagcctc 
actgcatgct 
cattccccct 
cctccagcct 
aggcttccaa 
tgaggagtga 
ggcctgaggg 
tgaaactgaa 
ggcatggggc 
ggactggtgg 
agggttaggg 
tcttggttac 
gctcatggac 
ccacagtgca 
ctagttggaa 
gaggagggga 
ggacaggtgc 
tctcatacct 
ggcaccttca 
gccccagcac 
gatcgacccc 
aagatcctca 



tgaagttcac 
cgtaggaaag 
tggtgctgac 
gaggtgggaa 
acacccgtct 
agccatggaa 
cccagctctg 
agtttcctaa 
tgcctgagag 
tgatatggcg 
gaagggagaa 
ccatcaagca 
ctctctcctc 
ctgtgcatac 
gtgagtctac 
cagagaacaa 
tatcccaacc 
tgcacctccc 
gctgaagcaa 
agaccctggt 
ccctctccct 
tgaggcacac 
caaatgaagg 
agaggggcga 
gctgaggggc 
gctctgagtc 
tgccgtggtg 
acccatgcgg 
acctgccaag 
ccctgaggag 
ctcacgctct 
ccttcgcagg 
cacttgacac 
cccgatcctg 
aagccccccc 
agcaggggtg 
gtgggccacc 
aagaaggt ct 

ggggggcagg 

gtgtcaatcc 
gctggagctc 
agggcctgaa 
gtgctaggag 
aggcaggagg 
tgccccacgg 
ttctgttcag 
agctcacaaa 
gtaagggagc 
cgcaccttca 
ctcgcggaca 
gtgcaggatc 
cttcctgctt 
ctgcttcccc 
cagccctttc 
gcatctgtct 



177780 
177840 
177900 
177960 
178020 
178080 
178140 
178200 
178260 
178320 
178380 
178440 
178500 
178560 
178620 
178680 
178740 
178800 
178860 
178920 
178980 
179040 
179100 
179160 
179220 
179280 
179340 
179400 
179460 
179520 
179580 
179640 
179700 
179760 
179820 
179880 
179940 
180000 
180060 
180120 
180180 
180240 
180300 
180360 
180420 
180480 
180540 
180600 
180660 
180720 
180780 
180840 
180900 
180960 
181020 
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cacgggcccc 
gactggccaa 
tcctgctatg 
tcccatgggt 
cctccattcg 
aaattagaga 
accagccact 
tgcccactca 
cattcacagt 
tagtatgaag 
gccagcaggc 
cacagttact 
gttttagcca 
cccccctcat 
gctaactgtg 
atgtgcagaa 
acacccagga 
gttgtaaaat 
gcaagcaaaa 
ttaaaataag 
acattttaaa 
tgggaggccg 
gtgaaacccc 
taatcccagc 
tgcagtgagc 
aaaagaaaaa 
tagtatgact 
gtggatatga 
agcatgagtc 
gaaaacttat 
ggccgacacg 
accccgtctc 
ccagctactc 
t gage t gaga 
aaaaaaaaaa 
tgactgtggg 
tattaagaca 
ttccatttac 
tggacagaaa 
cagagtcttt 
cagcctcctg 
cgtgcgccac 
ggecaggcta 
teaatgggee 
tattgaccaa 
aacagggggt 
ggttaaaaaa 
catacattct 
aagaaactca 
tcaggccact 
gatgacagca 
ttgaaatctc 
gaeatgecta 
eggggagggg 
tgtcactctc 



aaagtcctca 
aggctatgta 
catggcaccc 
cattcacctg 
ctcataggat 
tatgacagta 
gaactggggc 
aaggctgtgt 
gcttggtgac 
ggtcacaagc 
aaagggtgcc 
ggggctgggc 
ttcaatggga 
getctgetgg 
accctgtgac 
aaatactctt 
acagggagee 
ctcgtggaaa 
accagtgtgg 
ctggatccat 
aagctggaat 
acaeggaegg 
gtctctacta 
tactctggag 
tgagatggcg 
aaaaagctga 
gaccgtacgc 
aaactggaat 
ttaccataca 
atccagggcc 
gaeggatcac 
tactaaaaat 
tggaggctga 
tggcgccact 
gctgaaataa 
tacttttaaa 
tgagtggatt 
tctcttgggg 
tggatacaaa 
ctgttgccca 
ggttcaagca 
cacgctcagc 
aacctgggcc 
agacctcccc 
cctgaaggcc 
gataatacta 
aaagttgcac 
tccattctca 
ggtgaccttc 
ggcaccacag 
gtggctctgc 
actgcctccc 
taagcccggc 
aagaagctgc 
ctgggtgacc 



gtggctcatt 
gtgtggccag 
agcaaggcac 
tccccatcac 
gacaccaggc 
gaccagctgt 
tcctggctgc 
ccaatggccc 
aaccaccctc 
cagggtggtt 
tgctgtactc 
agccccatcc 
ccaggttgat 
tagggggcac 
ccaggggtac 
cgctgcccta 
agctaagtca 
attatttact 
acaeggtgag 
caccatagtg 
aagccaggtg 
atcacgaggt 
aaaatataaa 
gctgaggcag 
ccactgtact 
aataacacac 
tgaagaggat 
ggacattget 
atctagcaat 
gagcacagtg 
gaggtcagga 
ataaaaaatt 
ggcaggagaa 
gtactccagc 
cacacatcag 
tttcttcata 
ttgeaatgag 
aagctgtcag 
cctgggcccc 
ggctggagtg 
attcttctgc 
taacttttgt 
ccttttgagc 
cgacccaagg 
tcagtctcca 
cttcaaagag 
cttgttattg 
caccaactct 
ccaaggtcac 
ccacgacact 
tcatcttgat 
atctgacaga 
tgagaggaac 
tgatggggta 
atgaccaccg 



tcatagacag 
ggtcaagtcc 
cacccactgc 
tgecagtate 
cacagatagc 
gtcagcccag 
cctgctggca 
atactcacga 
tgggacaaat 
gctgagcagg 
ccagcttgca 
ctgggggcca 
attgetacat 
atccctcaag 
tacttataga 
tttataacaa 
atgatgacac 
gacacaggta 
gecagctttt 
ttttcagaaa 
tggtggctca 
caggagatcg 
aaattagctg 
gagaattget 
ccagcctggt 
atcagaatat 
gegtattgae 
atgaegggat 
ttcactccta 
gctcacgcct 
gatcgagacc 
agctgagagt 
ttgettgaac 
ctggtgacag 
aatattaacc 
ttttctctgt 

gggagagagc 

ctgtaggaca 
ccaggttctt 
cagtggcaca 
ctcagcctcc 
atttttagta 
agegcagcag 
gcactcctgt 
ttttccccaa 
cctaactgag 
ccatagttag 
aggggatgtg 
agagecagaa 
gccttccatt 
ctggatgaca 

tggggcaacc 

ggggataggc 

ggtgtgcatg 

agtcatgcag 



gaaaatggaa 
tccctagact 
ctacaaggga 
ccagggtcag 
gtgggatgag 
agctggccca 
ctggactggg 
cccaagcaat 
gecattcttg 

ggggctggtg 

tgggcacaca 
actgggactg 
gacaaacttc 
aaggctttct 
attttccagg 
tgaaaacaaa 
atatcatgga 
ggagacagtt 
gtttttggaa 
aacaaataaa 
cgcctgtaat 
agaccatcct 
agagtggtgg 
tgaacctggg 
gacagcaaga 
taaccatggt 
tgtgctttcc 
ggagctttct 
gatgactacc 
gtaatcccag 
atcctggcca 
ggtggcacgc 
ctgggagctg 
caagactccg 
atggttattt 
cttccaaatt 
tatttttaag 
aagagtggga 
ttttgtttgt 
atctcagctc 
caagtagctg 
gagaegggtt 
accccccact 
tcaactctgg 
tccagctcca 
ccagagattt 
cttcacacct 
attgtctcca 
tggctggcct 
gctattgtct 
ttctaaatgc 
aaggcacagg 
acccagaagc 
taccttgacc 
cttctcatca 



accacaaaat 
tettggecat 
aagttcaaag 
ggatgctggt 
ggatgggatc 
cagacctggc 
gatgcctgcc 
ttctgttgtt 
gaaactccag 

ggggtgccac 

gagtcctget 
getgeagaga 
aatccatgtg 
ggctgtacct 
ggaaatagag 
aacagcegta 
ccagtaggca 
cacaacagaa 
acaaagcttt 
taaataaata 
cccagcactt 
ggecaacaga 
cacgcgcctg 
agctggaggt 
ctccgtctca 
tattttggga 
ttctttgctg 
tacaaagctg 
caagtgaagt 
cactttggga 
acagagtgaa 
gectgtaate 
gaggttgcag 
tctcaaaaag 
tgggatagta 
ttctcatgtc 
tgttggtttg 
acttcagagt 
ttgtttgaga 
actgcaacct 
ggactacagg 
tcgacccatt 
cagggccatc 
acccctgatt 
ccaagcagaa 
ggcaaagaag 
gtatcacata 
cttgagagaa 
agacctgaac 
gggtgaagca 
tctcatctca 
gaaaggcaca 
caggcagagc 
agaaagctgc 
aagtggacgt 



181080 
181140 
181200 
181260 
181320 
181380 
181440 
181500 
181560 
181620 
181680 
181740 
181800 
181860 
181920 
181980 
182040 
182100 
182160 
182220 
182280 
182340 
182400 
182460 
182520 
182580 
182640 
182700 
182760 
182820 
182880 
182940 
183000 
183060 
183120 
183180 
183240 
183300 
183360 
183420 
183480 
183540 
183600 
183660 
183720 
183780 
183840 
183900 
183960 
184020 
184080 
184140 
184200 
184260 
184320 
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ggctgtggga 
tgttgcctag 
cccccttgtc 
gggctctttt 
cccgacctgc 
actggctaat 
agtcttccag 
cctgcccagt 
tgtgtcagtg 
gacaggaaca 
acactcaggc 
aagaagagag 
ctagcccacc 
aagggcagac 
tgaatgctca 
taggccctcc 
ttgagccggc 
ctggtgagcc 
tatcccttct 
cacatcacca 
acggcaggga 
gcaggggtcc 
ccaatcacat 
tgcgggccct 
gtcctaccac 
ccacgcagct 
accagcggga 
ccatggctca 
cagcagccca 
ctgcccacat 
ccagcatgtg 
cagcaccccc 
ccagggacca 
gtcacacaag 
gaacagggga 
gggccatatg 
accctgccct 
ggggcagtga 
caaaaagaat 
acagagatta 
cctcatttgt 
gaggaagaat 
ggaaagctct 
taagtcattc 
taaaccagag 
agaggacctg 
acccaggctg 
gagcgactct 
ctagctaatt 
cgaactcctg 
tgtgagccac 
tggaggtatg 
gcctgtcaca 
ggattcctgc 
tttgctaaaa 



tccttctgca 
gaagagtggg 
ccctcgtcac 
cagggctaga 
ccctcccaca 
atgacccttg 
actcccacca 
gctctccttc 
catgaggccc 
ggctctgggg 
cacctgggcg 
aatccaagga 
atcaaggccc 
ggatgagaca 
ggctgtagag 
caccaagagc 
ctcttggata 
ccacagaacc 
ggactctgaa 
gtctcagtgc 
cacaagagca 
tgctagggct 
tcgcacacca 
acctgcagca 
gccaggaagg 
cctgccagac 
gcctgaagga 
gggacaccag 
agtggtgcct 
tcaccaagag 
ctgagctgca 
agagactggc 
gttctggtgc 
caatgcagct 
ccacataaag 
caaaacagat 
gagtctcagg 
gaggcacccc 
aatgagtgca 
ttagaaaaga 
caccactgac 
tcagcctgtc 
tctttcggga 
tgcaacccct 
gggaaggttg 
gtatggtttt 
ggatgcaatg 
cccgcctcag 
tttgtatttt 
acctcaagca 
catgcctggc 
tgacttctcc 
ctgatgcctc 
caaaaaggtt 
acacagagga 



tcgtggcctg 
agagggctct 
ccccaagact 
tggacactag 
aaaatccatc 
cttggagggg 
agccagggca 
acagtgcaaa 
ctgcctgccc 
gatctgcctg 
aggctgcagc 
agcttccaga 
tcagctgtct 
ggcagggtag 
gccgagagca 
cctgcccagg 
ctcggaggcc 
ctcaaatctc 
agaaacccaa 
tcaccataag 
cacacctcca 
tgcgtgagcc 
gcacacacgc 
atggccctcc 
cacttaccct 
accgccactc 
tagggagtgg 
aggagctccc 
gtactctcta 
ccctcacccc 
gcttctaggt 
aggaggggca 
tgctgacccc 
gggtgcggtc 
taagcgtgcc 
gcctgtggag 
cctgcctccg 
aggaagcctc 
actgatggaa 
agagagaaag 
ggtagcagtg 
tcttggcctt 
agaattcttg 
aaaggaacaa 
acaggaagca 
tgtttttttt 
gcacgatctc 
catcccgagt 
tagtagagac 
atctgctcac 
cgaggacctg 
cagtcacaca 
ctgaagtaag 
taacctggaa 
gaggggaaca 



ccgcaaagcg 
gagggctttc 
gcccttgaca 
gatcatgagg 
ccctgagagt 
gtggggctgg 
gcgtgggact 
gctccccatc 
atcctcatga 
gggtggcaga 
tgccactaaa 
cccacctcaa 
ctgagctggg 
gcgggagccg 
cggccctagt 
gtcctcttgg 
caggggacca 
accagatcac 
actcccttcc 
cccctcggct 
cagaccagcc 
ccagaacctg 
actcacacat 
taccagggcc 
tgttggctgc 
acgctcagca 
ggagggcagg 
tttaggaagg 
gcacgggggc 
ttcctgatac 
gccaaagcaa 
gcctggaaag 
aaaggctggg 
agaggcaaca 
cttagagctt 
ggggtgtgcc 
cctcagctcc 
caatgggccg 
acagatggaa 
caaaacaaac 
ccctctttac 
tcttgtaacc 
ctaataaata 
atgctggagg 
gatactcaca 
ttttttctga 
ggctaactgc 
agctgggact 
agggtttcac 
ttcggcctcc 
gtgtttaaga 
ctggtctatc 
gcagcccaaa 
tctaaaccag 
aatttaatga 



gatactccgg 

cctgtcccca 

tcatccagct 

cttgaggcct 

gagcgtgggg 

tgccagagcc 

cagcccagtc 

cctggggcct 

cccaaacctg 

aacagggggg 

Gctcactggc 

gacccccttc 

tcgggttctc 

agctgaaggc. 

cctctctgac 

cgggaggacc 

gcctggcaca 

cctggcctaa 

acagcctgct 

cccccaaccc 

taaggagtca 

gcacaaacac 

gcacacacac 

caccggaaga 

agccatggac 

gcctcccatg 

ggtcagggcc 

actcaaaccc 

ttctgcccac 

cccagtgact 

cagcatagga 

gaggacttta 

cctggagcta 

gggtgctcga 

tccctcctgg 

cccacctgaa 

ccaacggcag 

agctgggacc 

cacataaaag 

aaacccaagc 

tctgaaagct 

aaattgccct 

acaaagaaat 

caacaagaac 

ggcgcccaag 

gacggagttt 

aacatctgcc 

acaggcgccc 

catgttggcc 

caaagtgttg 

ggaatggcac 

tctctactct 

gtcacagcat 

cctctaactt 

ctccatgaga 



actctgggct 
ggaaactcct 
cccactggct 
ccccgccggc 
ggactcaggg 
aggaagggac 
ctctggatac 
acccctaagc 
aaagggcagg 
atcaaaaaac 
ccctggcagc 
cttcctctgt 
caaggccaga 
ctggactgga 
ccctggcccc 
tgatccagct 
tccacaatcc 
aaccctccct 
gtggccctcc 
ccacagtgga 
gcagggctct 
tgacatagtc 
acacaggtgg 
gcttgtgcaa 
gccctccctg 
ctccagggac 
acccatggac 
ctaccgctct 
cctctaccga 
cagggtcctg 
ctcctatccc 
ttggtatttt 
ccttattcag 
tggtgggcaa 
tgatcggtca 
ccccattcag 
cctcagcaca 
atctgagcat 
cagtgcattc 
cctcaaaagc 
gaggattaaa 
ggtggttgat 
gaccaaatgc 
caghggatgc 
cgtcactgac 
cactcttgtc 
tcctgggttc 
gccaccacgc 
aggctggtct 
ggattacagg 
aagtttgcaa 
ctgcaggaca 
cactgacaag 
caacttctca 
aagcaatgag 



184380 
184440 
184500 
184560 
184620 
184680 
184740 
184800 
184860 
184920 
184980 
185040 
185100 
185160 
185220 
185280 
185340 
185400 
185460 
185520 
185580 
185640 
185700 
185760 
185820 
185880 
185940 
186000 
186060 
186120 
186180 
186240 
186300 
186360 
186420 
186480 
186540 
186600 
186660 
186720 
186780 
186840 
186900 
186960 
187020 
187080 
187140 
187200 
187260 
187320. 
187380 
187440 
187500 
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acgagcccag 
gttttcaaat 
cccagcactt 
aacccagtct 
ccagctagtt 
tgagccaaga 
aaaaaaaaaa 
caaactacag 
gtttttgaga 
taatctccca 
cacgcaccac 
ctccggagct 
agccgccagg 
ttcattcact 
gatagagacc 
agtttgccaa 
caggacaact 
gtgtttcgaa 
cctgtaatcc 
ggaccagcct 
gcatggtggt 
gaacctggga 
gacagagcga 
cgcctgtaat 
caagaccagc 
aggcgtggtg 
tgaacccagg 
caaaaggagc 
aaatgtaaat 
tatgtaaaga 
tgacactggc 
ggctgggcct 
agatgtggcc 
ccctgcctca 
tggaccaggc 
gaagtggagc 
cctggaaagc 
ccccccgact 
acaagagtgt 
gaggcagtga 
ctcaggttaa 
ctctgaactg 
tccccccgcc 
agccacacct 
aaatcccagg 
tgatatccct 
cccggcactc 
gaatgaccac 
gatatggtca 
ctcttttgcc 
cctcccactg 
ttgagaggga 
actcagtaaa 
aggaagcaag 
tggagaacct 



aaggcaggat 
gtcaagaaga 
tcacggatca 
ctaccaaaag 

gggaggttga 

tcgcgccacg 
aagaaagaag 
ctgattacac 
cagggtctca 
ggctcaagca 
catgcccggc 
taagcgatcc 
cctggcaaat 
tacatgttgt 
atatggcctg 
cccttgttct 
gacctgtttt 
ctacctatca 
cagcactttg 
ggccaacatg 
gtgcacctgt 
ggcggaggtt 
gactctgtct 
cacagcactg 
ctggccaaca 
gcgcatgcct 
aggcggaggt 
aaaactccgt 
ggcctggata 
attatgtcct 
tcatcagtcc 
gtggagggca 
tcacctcgct 
tgggcagggt 
cagcaacccc 
tacacccatt 
tcagcttgtc 
acagctgccc 
gaggccctcc 
cagcccagtg 
actgcaccag 
gtccatgcag 
ctgcctttgc 
tcgacagggt 
acaggactac 
cttcctcacc 
agcgcagcac 
atcacatttg 
gggccagtga 
tcacccctgc 
ggactcctca 
catgccccat 
gggcaaacac 
ggacttcagg 
gctggtcgct 



attctacagg 
aaaagaaagc 
caaggtaagg 
tacaaagatt 
ggcaggagaa 
gcacttcagc 
aaaaataaaa 

agctgatggt 

ctctgtccag 
atcctcccac 
taatttttgt 
acccacctca 
aaatttttgc 
ctgtttttgc 
caaagtctaa 
agatcaacac 
tttcaccaca 
aagaagacat 
ggaagccaag 
gcgaaacccc 
aatcccagct 
gcagtgagcc 
caaaaaaaaa 
tgggaggctg 
tggagaaacc 
gtaatccagc 
tgcggtgagc 
ctcaaaaaaa 
tgaggtaaca 
tattttgtca 
agggacagaa 
tgatctcaag 
ccccacttct 
gttcacagtt 
ctcttggagg 
cacctctggc 
cccagggaac 
ccaaagctgt 
tatcctgcag 
tcctcaaggt 
tggctggtga 
ggcaccctcc 
ccaggtcatc 
ctcagcccca 
agctcttttt 
ttacaagggt 
cagacaggga 
cttggagggc 
ctgcactggg 
cctggggggc 
ggccttgctg 
gcccactgag 
tgagggcctg 
aacccaaagg 
gtcctcgggg 



atgactgacc 
aggctgggaa 
agttcaagac 
agccgggtgt 
tcacttgaat 
ctgggtgaca 
cgagggagat 
aaggttgctg 
cccaggctgg 
cccagcctgc 
agagacagag 
gcctcccaaa 
aaataaagtt 
actaaaacaa 
accatttatt 
acacttaaca 
tcagctgcaa 
tttggggggt 
gcgggtagat 
atctctacta 
actcaggagg 
aagatcccgc 
aaaaaaaaaa 
aggcgggcgg 
ctgtctctac 
tactcgggag 
cgagattgcg 
aaaaaaaaaa 
ctaagaaatg 
gggaaacata 
gaggaactac 
gtccctctgg 
caatgtgggc 
tgggtaccac 
tgagcttcca 

cagggccact 

ttggtcttgg 
gaactcagag 
aggctgctcc 
ccccccatct 
gcaaaggctg 
ctgtcttcct 
ccttcagcca 
acatttcttt 
ggaatcccct 
agtgactgtg 
ggagctgtgt 
ctggccaagg 
ggctctgagg 
acttccctct 
cagtcagctg 
ggtaggaccc 
taaccctctg 
aaactgggaa 
agctagactc 



tgttttttgt 
cggtggctca 
cagcctggcc 
ggtggcgggc 
ccagaaggcg 
cagcgagact 
tattctagat 
cctgttgtgt 
agtccagctc 
caagtagctg 
ttgcccaggc 
gtgctgggat 
ttactaatag 
agttacttcc 
atttggccct 
gaccaaaaaa 
gaagaaaaat 
gccgggcgct 
cacctgaggt 
aaaatacaaa 
ctgaggcagg 
cactgcactc 
aggccgggtg 
atcacctgag 
taaaaaatac 
gctgaggcag 
ccactgcact 
agacatttgg 
agttgatcat 
ctgaacaact 
agggtggcct 
tgccaagctg 
aaagtccacc 
caggagcccc 
ggccccagcc 
gtggggtcag 
agagcagcct 
tatatgacaa 
agccacacct 
cccaggatgc 
caggcccttt 
cccaacctac 
ggaacaatcc 
tttaaggatg 
ttctcatgtc 
tttttgtcaa 
ttggtttatg 
cccgtacttc 
aagcagaact 
ccctggcccc 
gttacctgtc 
acatctcatg 
gatagtgaca 
aaccaaacct 
ttgtgcacac 



ttttgttttt 
cgcctgtaat 
aacatagtga 
agctgtagtc 
gaggttgcag 
ctgtctcaag 
aggaaagcag 
ttttgttttt 
actgccaccg 
agaccacagc 
tggtctcaaa 
tacagacgtg 
aacactacac 
aagtagttat 
tcagagcaaa 
ggataatctt 
aaaggagagg 
gcggctcaca 
caggagtttg 
aatcagctag 
agaatcgctt 
cagcctgggt 
aggtggctca 
gtcgggagtt 
aaaattagcc 
gagaatcgct 
ctagcctggg 
gggatctgga 
ggaattgtat 
tagagatgag 
catagtttag 
ctctgggagc 
caggcccaag 
ctttggccaa 
ca gggtgcag 
ttgcctcagc 
gccgtaaccg 
atggccaagc 
ctgcagccag 
ttcctgaagg 
tcacaaaaca 
tgcctgaccg 
cgtccctccc 
cacaccctgc 
tgggatcatt 
accctgtgtc 
tttgttgaat 
agccctaaat 
ccccacccca 
caccaagtgg 
catgcctgct 
cagtgccagt 
acatagaggc 
cctttctcaa 
acaattcctc 



187680 
187740 
187800 
187860 
187920 
187980 
188040 
188100 
188160 
188220 
188280 
188340 
188400 
188460 
188520 
188580 
188640 
188700 
188760 
188820 
188880 
188940 
189000 
189060 
189120 
189180 
189240 
189300 
189360 
189420 
189480 
189540 
189600 
189660 
189720 
189780 
189840 
189900 
189960 
190020 
190080 
190140 
190200 
190260 
190320 
190380 
190440 
190500 
190560 
190620 
190680 
190740 
190800 
190860 
190920 



64/189 



WO 02/083077 



PCT/US02/12063 



atcagaaaca 
agctaggcca 
aaggcctcac 
aggatctgcc 
atgtgtaaca 
gaagctctgt 
ctaaagcaac 
agccaggagt 
aggggtgagg 
tccctaggga 
tcacaaggaa 
atggtgcctc 
agcttccagg 
gcatgctccc 
tggttcgagg 
tcctctccta 
ctgtgtagcc 

tggggccaca 

gaacaaagcc 
agctttaatg 
tggctgttct 
aagggaccct 
agcattctcc 
cctaggccct 
ccagggggtc 
gctgcctcgt 
acaaaagacc 
gcagaaaaat 
gacttccctt 
tccctcctcc 
tccaggtgta 
cacaccccat 
ccaatccatc 
cccaccttca 
ctgccctcct 
catgcagcga 
acgcccccgc 
cagctgccac 
gctggggagt 
tcacggccgc 
gacctcggcg 
gggtgcaccc 
gctcaacgcc 
ccccgcctcc 
agcgggctcc 
gaggcagcgc 
cggcccaact 
ataggccccg 
gcttaggacg 
ggctgggtgg 
tcgatacggg 
gaccccaggg 
cccgtgcggg 
aagaaggcac 
tctgctcccg 



ggcaaagagg 
ccacacaacc 
catggctcaa 
cttgctagct 
caggaataag 
gacaatgcac 
agcagtttca 
gacccttaga 
gggcctccct 
gggtcagaga 
gatgtgcaga 
ccccccagca 
ccccagccca 
aacgtctgag 
tctttctctc 
ccccagcctg 
tcccattaaa 
ctgacccgct 
tggaaaggca 
tgatttaatt 
ctctccttct 
caccacccac 
ccactttgcc 
gccctgggcc 
agccaaggtg 
agctcaggat 
agaatgggag 
gccctacctc 
tctcagtccc 
ttcataccac 
ggacccttta 
tcttgtcatc 
ctgagctgcg 
ctcacctcgg 
tcctgagggt 
ggcagcggca 
cctcactacc 
cagttaacgg 
cttccaggcc 
tggggtccca 
aacgggaagg 
cggggccggg 
gagcacgagg 
gcccattggt 
aaagaacggc 
cctgctagtc 
aggccagggg 
cccccgctcc 
cacctccctt 
agctcccgaa 
ccgtgttggt 
tttggacctc 
cccgagacgc 
gcgggctggg 
gtctccatgg 



gcactgggcc 
ccaagcctgg 
tccacatggc 
gcatgggtga 
cagtggcctc 
cccccatctc 
gcagcaccgt 
gtgggcccca 
cctcctgcct 
acagatgctg 
atgaacatcc 
aaaaaaatta 
gggtgcagga 
acctcatttc 
tgagccagga 
gggctcagtc 
gctcacgcct 
gactctcagg 
aggctggagg 
tccttctccc 
aacagtgagg 
accttcctgc 
ctcagctagt 
tgtgatccag 
agggtgggag 
ggatagaatg 
ggggatgggg 
cactttccag 
tgcttactaa 
cgggagggag 
tcctctccgg 
tcagggggag 
aggcggcggc 
ctgacgcagg 
gctgaggctc 
atgaacaggg 
gcactcatgc 
tcactgcgcc 
cggcccccaa 
ggcactgact 
aaccgggagg 
ga-gggcaggg 
aagggagccc 
tgatctggga 
caactggcga 
cgcgcctgcc 
ccagaaccga 
ttcttccgcc 

gggcccttcg 
ggtggccttt 

ctcgttcagg 
ttccccttcc 
gtcccgggtg 
ggctgggaga 
aggcggtcgc 



tcccctgctg 
ccctgcctcc 
ccccagaggc 
gttctttcac 
ttccttccag 
tgcaaaagca 
cagagaggca 
gggacgtcag 

GCGGCttctC 

ggagtttcca 
tgggaaactg 
tacccttccc 
agaggagcta 
tcattccttc 
actccgcaac 
ctggctcaag 
gtgaaaagaa 
actggacaca 
tcagtagttg 
tccagtgggc 
tgctgggggt 
ctccccaaca 
gggtgcttag 
aggtcccaag 
cacacaggat 

tggggggata 

tgctgccccg 
tgccaagatt 
tggaaacacg 
gtgcagccca 
cagccatccc 
gggggaaatg 
tctgtccctc 
agtctccgga 
ggaggctcag 
tcgtggggag 
ccccggacaa 
ccgggaacct 
cagcggagcc 
gctcaggaca 
cagggagcag 
acgaccactg 
cgcgccgcgg 
gggtggggcg 
gggccgccta 
gggcgagctc 
ccactcgaag 
tgggggatag 
ctctcgggag 
ctccctgggc 
agctgaggaa 
accccatccc 
cctggccggg 
gcctgtgacg 
catggcagcg 



cagctgtgtc 
agcctcctcc 
agcagtcctg 
tt ctctgtcc 
ggtttgctgc 
aacccccaaa 
cttcaggcca 
ccttcttgga 
ccactcccaa 
gtccccctaa 
ggaaatgact 
catgagatgg 
cacccattca 
ttcagtcccc 
cttccaccct 
cactcagtcc 
cacccattga 
gcagaggcca 
tggcagcttc 
caaaggtgca 
gggggtgggg 
agtgctgccc 
cctccagata 
aagcagaggc 
tatctcccag 
tccagctaca 
actttcttca 
caagaagaaa 
ggtccagaac 
agcccccgag 
tgtgggtgtg 
taatcggaca 
ggagataatc 
gcccgcactc 
agatgctact 
gagggggcgc 
tcgcttcgcg 
gacatcactc 
cccccacccc 
gggccgggac 
aggggtggcc 
cagcggcggc 
ccgccctccc 
agggacgctc 
cgtcacgtgc 
tcgcgaggaa 
agggagaagg 
cgcctctagc 
ggctgtcggg 
ttccacgccg 
ccctccatca 
ctgtcttgaa 
cctggagaag 
cgcccccggg 
agatgcgcct 



cacagagaac 
tgtgccaacc 
gttcaaattc 
ctgtttccta 
aactgtgcct 
ggcctggttt 
atcctggagg 
aaacagctca 
agcagccagg 
ccagaggggg 
agggaggaac 
agtgtcagca 
cctctggtca 
attctcattg 
ccctacctct 
agcagaagca 
ggccttgaga 
cacatactca 
acatcaactc 
aagataagta 
gaatatggag 
tcctctgccc 
gcatgcccca 
caggctggat 
ggacagggct 
ttttccctcc 
actccccgga 
ggcaagcgga 
ctaaatccag 
gccccaaggg 
gcacccccgc 
tcccccccat 
ctgtgcactc 
ccagacatca 
ggtccaaggt 
tgaccccatt 
gaaaacaccc 
tagttcaggt 
agccctgcca 
cggggagggg 
tcaccttgcg 
ggctgcagga 
gtcggcacgc 
cggaccaatg 
cagggtcgcc 
gacgggcagg 
agggcctcgg 
cttgaacctt 
cgcgtctcgg 
gcttcggcca 
ctcctgtttc 
agaagcaacc 
catcagaaca 
gaccgcagcc 
cgctcagcac 



190980 
191040 
191100 
191160 
191220 
191280 
191340 
191400 
191460 
191520 
191580 
191640 
191700 
191760 
191820 
191880 
191940 
192000 
192060 
192120 
192180 
192240 
192300 
192360 
192420 
192480 
192540 
192600 
192660 
192720 
192780 
192840 
192900 
192960 
193020 
193080 
193140 
193200 
193260 
193320 
193380 
193440 
193500 
193560 
.193620 
193680 
193740 
193800 
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193980 
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194100 
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194220 
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cgcggggtgg gatgtgggcg cctgcaatga gccgaggagc gagaggcgtg gccctccggt 194280 

ctgcgggggt tcttgccggt gctctccgcc cgccggttcg cgaacacccc acctatactt 194340 

cgcccgtggg gacggattcc ccaaagtgcc ctcagtaagc cgtccggagc acgcagcgcc 19440 0 

ttgcttccaa cggaactaga gagacggcct gggcggccga aggccagcct cccttcagca 19446 0 

gggccggggt cgctgcctta aaggagccct caagtctgcc accctgtggc ccataacctg 19452 0 

tctgctgatc tccagtctgc acactgttgg caaattaatc tttctgagct cttgttttca 19458 0 

tcgcgtccct ctcctgctcc aaagccctct gggactgcct ccagtagcgc ttcacaaact 19464 0 

tcagcagcac tttgggtgac tcatgtgccc tcgcgtttga gagacagcgc ttacattgag 1947 0 0 

agcttttcac attctgatct cagcccatcc actcctgccc actctaccca ctcccaccac 194760 

actaccttta gaccttgtct tgaaagtcta aagttggcag acggtgggcc caaagtggcc 194820 

cgtaggtgta 'ttttgtttag ctcacatggg gctttttaaa agaagtcctt ccccctcctg 194880 

cctcctccgc ctctttctgg aatcaggaat ctggtttcgt ggcttttgaa atatcagaag 194940 

atctgacaac gtgggctcac tttccagcct ggcattacaa gcccctttaa gagcacataa 195 0 00 

atttaccata gtccccgcta atccctgttt ctcacggtta gttaccttct gggtctgtgc 195060 

agacatttgt gatcccctcc ccctcattat cccaccttcc aacctggaag gccctccccg 195120 

ctctcttctc atagccaaaa ttcaagccct cttcaaccaa gataagtttt gcctatttat 195180 

ttatttattt atttttgaga cagagtctca ctctgtcgcc caggcttgag ttcagtggtg 195240 

ccatctcggc tgactacagc ctccaccccc tcccccgccg ggttcaagcc attctcctgc 195300 

ctcagcctcc caagtagctg ggattctagg tgcgcgccac cacacccagg gctaattttt 195360 

gtatttttag tggagacagg gtttcaccat gttggtcagg tgggtctcaa actcctagcc 195420 

tcaagtgatc tgcctgcctc agcctcccaa agtgctggga ttacaggcat gagccaccgc 195480 

gcctggccca agcctgcctt tttcaaaagt ctttctagcc aacttctcaa aaccatctct 195540 

ggtgtgggct ctctcaaaca caaactggct gctaactgca cagcccgggg ttatttccga 195600 

attgtcaact cctttcagca aacatttact gaatgtatct gatgagtaag ttattgtgct 195660 

aggctctgtg gagtggaagg cactcacagt ttagtgtagg agggacacca cacacaaaga 19572 0 

actgtggtct cctgtccttg ccaaatgttc ctcacactcc caggcatcgg gcagtctggc 195780 

ccactgggag gagctgaaat acagagctgc ccatgcctag ggatactaca tgctctcagg 195840 

gaaggccaaa gatggttctg gaaagcctcc aggagaaaat tagactcttt gtgtcaggag 1959 0 0 

gaagtcaggg caggagttac ctggctgccc catcactgcc caatgtgtct gtgttaccac 195960 

acgagtagtg cccaggctct ggggcttcct gtgcggaggg ccttaccaaa gatagatggc 19602 0 

tctgcaaggg aaaccacagg cccctaccag gccccctgga gacagaggca ggacaagaag 196 0 80 

aatcaggaac aactccaggt taggagaaca gatggagcac gtataaaaca tccactcacg 196140 

gccgggcgtg gtggctcacg cctgtaatcc cagcactttg ggaggctgag atgggcggat 19 620 0 

catgaggtca ggagatcaag accatcctgg ctaacacggt gaaaccccgt ctctactaaa 196260 

aatacaaaaa attagccggg catggtggct gatgcctgta atcccaacta ctccggaagc 196320 

tgaggcagga gaatggcgtg aaccccggga ggcggagctt gcagtgagcc cgagattgcg 1963 8 0 

ccactgcact tcagctttgg gcgactgagc cagactccat ctccaaaaaa aaaaaaaaaa 196440 

aaatccactc accatgagaa cctggtgact tttaaaaaaa aacaaaaaaa cacaacactt 196500 

gctacaactc caaacacacg gttttttgtt ttttttttta tttttcagag acagggtctc 196560 

actatgttgc ccaggctggt ctcaaactct tggcctcaag cgatcccccg accttgggct 196620 

cctaatgtgc tgggaccaga ccagaggcac aagccactgt gcccagccct ctattctttt 19668 0 

taattaaaaa ttaaacaact ataacctcat tggaaaactg cttggcagta cctaataaaa 196740 

atgataaatg aaatgagtaa ttccactttt tttttttttt tgagacagag tcttactctg 19680 0 

tcacccaggc tggaatgcag tggtgcgatc tcagctctct acaacctctg cctcctgggc 196860 

tcaagcactc ctcccacctc agcctcccca gtagccagga ctacaggcac atgccaccac 19692 0 

acccagctaa tttttgtatt tttttttttt ttttgagatt acaggcatgt gccaccacgc 19698 0 

cggctaattt tgtattttta gtagagacgg gtttctccat gttggtcacg ctagtctcga 197040 

actcccaacc tcaggtaaac tgcctgcctc agcctcccaa agtgctggga ttacaggcat 197100 

gagccaccgt gcctggccaa tttttttgta tttttaatag agacggggtt tcaccatgtt 197160 

ggccaggctg gtctcgaact cctgacctca ggtgaccgac ccacctcaga ctcccaaagt 197220 

gctgggatta caagcttgag ccaccatgcc cagccatggc atgtgtttaa ttttaaaaga 197280 

aactggcaaa ttgttttcca aaatgactgt acttacttac actcccatta gcagagtata 197340 

agagtttctg ttgcttcaca tcctcatcaa cataaaaatt tgcaattttg taaaaaacgc 197400 

tacaccatgc taaccaaaca aagcagggca acggactacc agtcttcagc ctctcctctg 197460 

agccatgccc ttgtagagga caatttcaca acagctatca agattacaaa tgcacacaca 197520 
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aaaaaagaga 
ccccatctct 
cagctaccca 
gaggcaagat 
aagaaaaaag 
ctaggaaccc 
gagtgtatga 
aatgtccata 
gctttaaaaa 
acaggcgtga 
catgttatac 
caaaatgcta 
attagctagg 
atattataag 
tgggttggca 
gcctgagagc 
caaggaagaa 
tcaaagacca 
tatttaggtc 
ctcatctgct 
cccaggctgg 
agcgattctc 
cggctaattt 
gaatgcctga 
gtgagccacc 
ggtttacccc 
ggtgttcggg 

tggtggcatg 

cccaggaggt 
aaaacaagac 
acataaagtc 
gtgctgggaa 
ttatctattc 
tagggttgtg 
atttagtaca 
tagcagattt 
aatcccagca 
agcctgggta 
aaaagcccag 
gtggatcacc 
ctactgaaga 
aggaggctga 
tcatgccgct 
agctggatat 
ttgcttgaac 
ctgggcgaca 
gctcacggct 
gagtttgaga 
tagccgggtg 
atcacttgaa 
cctgggtgac 
ttaaaaatta 
acatcctatc 
attcaaggaa 
gttaggagct 



aaacacacac 
actaaaaata 
ggaggttgag 
agtgtcattg 
agagaggaaa 
cagttctgga 
ggttatttat 
aatagaagac 
gaaaaaaaaa 
gccaccatgc 
aagctggtct 
ggattacagg 
attctccaga 
gaattgactc 
agttggagac 
caggaaagtc 
tcaatgtttc 
ccaggcaata 
tcaactgact 
gatttctttc 
agtgccgtgg 
ctgcctcagc 
ttgtattttt 
cctcaggtga 
gtgcccggcc 
tgttttccca 
tcaacatggt 
agcctgtagt 
ggagggtgca 
tcttctctca 
attccggaag 
taagaagact 
aaaacattaa 
aacactggag 
tagtaggtac 
agtgtcggtt 
ttttgggagg 
acaaagcgaa 
gagcagtggc 

tgaggttagc 

tacaaaaatt 
ggcaggagaa 
gcactccagc 
ggtggtatgc 
ttgggaggct 
gagtaaaacc 
gtaatcccag 
ccagcctgac 
tggtggcgtg 
ccagggaagt 
agagcaatac 
aattaaaaat 
acatttgctg 
agggtacaca 
gtctactaca 



acacacacac 
caaaaaatta 
gcaggagaat 
cactccagcc 
aaaaacacaa 
atttatcatt 
tgttgcattc 
ttcttaaata 
aaagaggaag 
cgggataatt 
cgaactcctg 
catgagctac 
gaaacagaac 
acacaattat 
ccaggagagc 
aatgatgtat 
agtttgatcc 
aaaattctct 
ggatgtggcc 
ttttttcctt 
catgatcttg 
ttcctgagta 
agtagagacg 
ttagcccgcc 
tcatctgctg 
gcactttggg 
gaaacctgcc 
cccagctact 
gttaactgaa 
aaaataagag 
gatacataaa 
tttcactttt 
gtaacaaagt 
ggaacaaatc 
tcaataaata 
agagagagga 
ctgaggcaag 
atcctgtctc 
tcacacctgt 
agttcgagac 
agttgggcgt 
tagcttgaac 
ctggatgaca 
acctgtcctc 
ggggttgcag 
ctgtctctaa 
cactttagga 
caatatggtg 
tgcctgtaat 
ggagagtgca 
tctgtctcaa 
aagggagtta 
tattctactt 
tgggggtgac 
aggaggaaaa 



acacacacac 
gctgggcgtg 
cgcttgaacc 
tgggccacag 
caaagattac 
tagatatact 
ttgtgatagc 
aatttggtac 
ttttcaatgt 
tttgtatttt 
gcctcaagcg 
cgtgcccagc 
t aggaaagag 
ggagtctgac 
caatggttta 
ctccagtcca 
caaaggcgag 
cttatttggg 
cactcacatt 
ttgtttttga 
gttccctgca 
gctggggtta 
gggtttcgcc 
tcggcctccc 
atttaaatgt 
aggccgaggc 
tctactaaat 
caggaggccg 
attgtgccac 
taagacaaat 
aactaaagat 
tatgtgtttt 
ttaaataccc 
atgttttata 
gaggaaagaa 
gaggggccag 
aggctcactt 
tccaaaaaaa 
aatcccagca 
caactaggcc 
agtggcaggt 
ctgggaggca 
gagtgagaca 
ccacctattc 
tgagccaaga 
aaaagaaggg 
ggccgaggct 
gaaccccgtc 
cccagctact 
gtgagccgag 
aacaaaaaaa 
taatcttatc 
atcagaagca 
gtccaggagg 
agcaatgaaa 



acacagccaa 
gtggcaggtg 

cgggaggcag 

cgcgagactc 
agatgcatat 
tccccatgat 
aaaagattgg 
agacaatata 
cctcccaagt 
tagtagagat 
atccacccac 
ctgctaagga 
ggcagatagg 
cagtcccaag 
gttccagtct 
aagggcagca 
aaaaaagctg 
gagggccagc 
agagagggaa 
aatggattct 
acctccacct 
caggcaatca 
atgttgacca 
aaagtgctgg 
taaccttagc 
gggcggctca 
acataaaaat 
aggcacgaga 
tacactccag 
gtgcatgttt 

agaggttccc 

ttttaaatca 
ttgcatgcct 
ctcacttttg 
cgaatgaatg 
gcacagtggc 
gaacctagga 
aaaaaaaaaa 
ctttgggagg 
aacatggtga 
gcctgtaatt 
gaggttgcag 
tctcataaaa 
tagaggctga 
ttgtgccact 
agttagggct 
ggtggatcac 
tctactaaaa 

tggga-ggctg 

atcataccac 
aagaaagaaa 
taatgtaatc 
agtcactaag 
cagggattac 
ggatgaaatc 



catggtgaaa 
cctgtaatgc 
aggatgcagt 
tgactcaaaa 
attctttgat 
actaaatcat 
aaataactca 
atgttatatg 
agccaggatt 

ggggttttgc 

ctcagcctcc 
aagattttgt 
aaagactgac 
agctgcaggt 
gagtcccaat 
ggtttgagac 
atgttccagt 
ctttttcttc 
atctgtttca 
cactctgttg 
cccgggttca 
ccaccacgcc 
ggctggtctt 
gattacaggc 
c gggtgcggt 
ctcgagctca 
tagctgggca 
atcacttgaa 
ccagggtgac 
gtttatagtc 
tattagagat 
tatatatgtt 
tcctctcctg 
tattcctggt 
aaccagaaga 
tcacacatgt 
gttggagacc 
aaaaaaaaaa 
ccgaggtggg 
aacaccatct 
ccagcaactc 
tgagccgaga 
aaaaaaaatt 
ggtgtaaaga 
gcactccagc 
gggtgcggtg 
ctgaggtcag 
atacaaaaag 
agacagagga 
tgcactccag 
gaaaaatttt 
acaaaagttg 
tccagcctac 
tgggagccac 
taggatggtg 
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gttatctctg 
tttgtaataa 
attatggaca 
aatatttaaa 
ctgagcccca 
cacttgcccc 
atctctccat 
aacctcacct 
ggaccaggag 
gacaggtttg 
ggccaggtcc 
aagtctgtct 
taatagactc 
gatggctatg 
gctttcagtt 
gcctggatcc 
tgcagccttg 
cccacagaaa 
ataatttgct 
caaagagtgt 
aggccaaggc 
gaaacctgtc 
tctcacatac 
agtgagccga 
aaaaaaacaa 
gaacacagaa 
tggaattgtt 
taggctggag 
caattctcct 
gctaattgtt 
ctcctggcct 
agccaccatg 
aaaccattaa 
gcctggcctc 
ggccgagtta 
aacaatttga 
gggtttgggg 
tgacccctgg 
aagatgagat 
gaggacaagt 
aaatagaaca 
gcctcagtac 
ctccctactc 
cgccgaggtg 
gacagattcc 
tcggagctgg 
aggcccagcg 
aggctaggtc 
atctccggag 
atctcctccc 
gagagcagcg 
gtcggggctc 
cttacgct cc 
gagcaggtgc 
tcaagcctgg 



agtggaagca 
ttccattctt 
aatggggaga 
atattcaata 
tccatgcact 
aaactaaccg 
agtctgcaat 
gtaggctgct 
acagggcagg 
atggctggta 
tgatggttct 
ctgagaagtg 
tatcaactct 
tgtcaaggaa 
tgacagccca 
ttcctcagtt 
taggaaacat 
atgtgagata 
acacagtaag 
ttctgtgggc 
aggtggatca 
tctaccaaaa 
ttgagaggct 
gatcgtgcaa 
aaccaaacaa 
cacaagagcc 
agate tcagg 
tgcagtggtg 
gcctcagctt 
cgtattttta 
caagtgaccc 
cccagctgga 
agttagggtc 
aatccatgga 
ggggtcagtc 
agtctggttt 
gtcaatctaa 
tgatatcatc 
ccactctggg 
gaagaagaaa 
caacactgac 
cccgtgcgcg 
ctggcgatcc 
ggtcccaggt 
cccatggctg 
geggggaect 
ccgggagatg 
tetgeaegtg 
tggtaggagg 
acgctcaccc 
ggctccgccc 
tggegggtae 
gcctgctgga 
tccaggcgcc 
cggtctgctc 



agggggagag 
ggggcaggtg 

gagtgtcatg 
ataaatgaaa 
ccacctggtc 
ttagcccttc 
aggcccccgg 
atatgtaaag 
ctaacagttc 
tcacgacctc 
tcctctttga 
cacctctgcc 
ctcctttcct 
atgagagtgg 
ctaggaagta 
gtgccttcag 
tgaagtggag 
taataataag 
agataatgta 
caggcatggt 
cttgaggtca 
atataaaaaa 
caggcaggag 
ttgcactcca 
aaagagtgtt 
taggtttccc 
atgtttgttt 
cgatcttggc 
cccaagtagc 
atagagatgg 
acctgccttg 
tctcaggatt 
atgccatggc 
agttgggatt 
ettaegtagg 

ggggttgggt 

ggccagggtt 
aacctaccaa 
tccagaaagc 
ctggctcccc 
tttaatgggg 
aggagggagg 
cacgcctcac 
aagctcccag 
agggcatggg 
gattctgggg 
caaggceggg 
gcttctgggc 
cggctcagtc 
atcccaagga 
tcccaatggt 
ccgggcgtcc 
gccgccggga 
ttaccttcat 
cctcctgccc 



atgcaagaag 
gtggatttat 
aaccaaggat 
taaaaattgc 
ttagtttaac 
acacacactg 
agttctattc 
atagtaggaa 
tgecttagea 
acctattggc 
ggcttctget 
acagcaccag 
gtctttgatg 
etattggeca 
aattcagtca 
atgagactgc 
gacccagcca 
tatttgttgt 
tacaggttct 
ggctcacgcc 
ggagttcgag 
ttagecagat 
aatcgcttga 
gectggaega 
gctatgaacg 
ttgggtatgt 
tgttttgaga 
tcactgcaac 
tgggactaca 
gttttcacca 
gcctcccaaa 
tacaaatgtt 
tgtagtgtgt 
gtctgtgggc 
gttgagggtg 
ttggacaagg 
agaagttagg 
gctgtggccg 
tctttcactc 
teagatgeaa 
cagccctgag 
ggegactget 
cccttcctct 
gaggggcagg 
gaactagect 
gtgtggacag 
cccaaggtgt 
tctggaaagc 

ccgggcctgc 
aggagggcac 

ctatccatcg 
ccgcgcggcc 
gaggtcttcg 
ctgcaggacc 
ggcctctgct 



cacatggccg 
aggtgttcat 
tatatttagt 
caactactcc 
actggctcca 
acccttgcta 
cctgaaactt 
tggctgatgg 
gtagcactgt 
ctcctacatg 
cctggtatcc 
gectgtaact 
aagtaagcta 
acaatgagta 
gcaaccaagt 
agttctattg 
agcctggcct 
tetaagctge 
tttcatttct 
tgtaatccca 
gccagcctag 
gtggtggcat 
actcaggagg 
cagagcaaga 
ttgttgtaac 
gctgggtata 
cagagtgtcc 
ctctgccccc 
ggtgcccgcc 
ttggccaggc 
gtggtgggat 
ccacattact 
tggccagggt 
agggttggag 
agtgtgtggc 
ttggagaaca 
ctgtcatcct 
cacaggaccc 
aaggcggggg 
aaagagaagg 
ccgtaacctt 
aegggcacat 
ccagctgctg 
ggagcccctc 
ggatggagac 
agecaaggga 
tggacactgc 
ggtccattct 
gctcctagag 
tcgggcccca 
gtgggtgggt 
cgggccgccg 
atccgcacat 
taagccgcac 
ggccgcgagc 



gtgaaagtta 
aagtgagttg 
ccaattcagt 
tgaccttgtt 
cctgaccctc 
tagcaccacg 
ttatccatgt 
agtggcccga 
gggcccagaa 
gcaaaggctg 
agatatagct 
cccttgttgg 
ccatactgga 
aaacactgaa 
aagcctggaa 
gccctttgct 
ggactcttga 
tacattggtg 
tttgetatta 
gcacttttgg 
ccaacatggt 
gcacctgtaa 
tggaggttgc 
ctccatttta 
atgtctccca 
tgcctaggaa 
ttctgttgcc 
cagcttcaag 
accacgcctg 
tggtcttgaa 
tacaggectg 
ggtagtcaga 
tgggagtaaa 
tegtacagag 
cacatttggg 
ctgtgagggt 
gtgeaaaace 
agccacccac 
tggggagatg 
caggcacagg 
caaaccttcc 
cgtcgatgtc 
cggtctccgt 
agcggcccgg 
gccgccgtcc 
ccgcccccca 
tttgggggac 
cctgacccgg 
ttcctgtccc 
geaggctegt 
ccggcgcggc 
ctcaccgctg 
tcttgagctg 
cgcgcaggcg 
ccccacccgc 
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agccttcata 
tcagcgatct 
cgaggcgccg 

cgggccctac 

ccctttcctg 
agagatcacc 
cccggggggt 
gtggaggaat 
cacaagtgac 
gccaccttac 
cccgggggtc 

c gggccccca 

tggcctttct 
ggagcggggt 
caggcgctgc 
cttgcgcatc 
tagacattgg 
aagggggcat 
atctgagctt 
tcttgatttt 
acagtctcag 
tacctgttca 
ttgtgcatct 
ggagagggga 
gtgggggtgg 
ggaaagggga 
ctaagtctcc 
atactgagtc 
taatcccaac 
cagcctggcc 

ggtggtgcat 

gaggtggagg 
tgagactctg 
ttcaggagcc 
cacacacaca 
catgtgataa 
aaacacatgc 
agccacatat 
acatgaaaag 
acacaaacac 
actaaacatg 
catagcaatg 
tcattaatgt 
atgtagacat 
acacacatct 
aaacacaaag 
acacacaagg 
acagagggat 
tcgccattgg 
acgacccttc 
ctaatccaga 
accattcccg 
gctacctcct 
agaaagcatc 
cagccaaagc 



gacccgtcac 
ggaggacgaa 
gcggccggtc 
actcaccctt 
cctcccgtct 
agcctggtga 
ccgggacacc 
gactgtgctg 
aggaagaagg 
ccacatccat 
cgctgtaagc 
ggccaggctc 
gtggtctggg 
ggcacctgta 
ctcagcggga 
acgtcatccg 
gatagcagat 
ggtggttgct 
ccttgtgtgg 
gtaatcggac 
cctgggcaac 
gatgacccca 
ccaccatctt 
gcagacatgg 
gagaatcagg 
aggataagga 
agcatacctc 
tctggtcctc 
actttgggag 
aacatggtga 
gtaaacccag 
ttgcagtgag 
tctcaaaaaa 
cagggaattc 
ctccaacacc 
gatcccccta 
tagacacaca 
gcacacagcc 
atgacagata 
cttcacaatc 
tcaacacata 
agtgagcaca 
ccctggaggg 
acatatatgg 
aaaatttgaa 
agtttgcaca 
gcacacacct 
gttgtctacc 
cgtcctcacg 
aggcgcttcc 
gactcacgtc 
actctcctgg 
ctcaggcccg 
acttctgtgc 
caaccaggac 



ttgcacggtc 

gctggggtcg 

ccagctgggg 
ccagggccct 
caggctcagc 
agtctggaac 
ttcacctcgg 
gagtctcatt 
acagaaggaa 
gtagccactg 
ccgggctcag 
tattagccaa 
gagcgccccc 
aggagccggc 
tgagcaccag 
gtactgaaaa 
tgggtcctgg 
gttctgagtg 
gtgttccgag 
gaattctgtc 
caagaccata 
gttgctgagc 
ggggaagtaa 
aaagtggggg 
catggcagag 
gggagacaaa 
agaggagctg 
cagtagagaa 
gctgaggcgg 
aaccccatct 
ctacttgggt 
ctgagatcat 
aaaaaaaaaa 
acatggatgc 
aaagttccag 
cacatagata 
taggcacaca 
tccccaaata 
agtatttctg 
acaaagaaac 
tgaacacaca 
tacagaaatg 
agttagaggc 
acacacatcg 
gtcccaggag 
gagacccgtt 
ccatcgctaa 
cacaggtaca 
gcctggccgc 
tttctcgcca 
ccagctggga 
gtacaaagac 
agctttgaaa 
ccacgccacc 
cccagaattc 



tcttgagagg 
ccctgcaacg 
tgggaagcag 
agaccaggcc 
gtaccgtgcc 
tcccagcttc 
gccttctggg 
ctcctgcttc 
agctccaacc 
ccatcctggg 
tcatccccaa 
gcacccctcc 
cacccagccc 
gccctgcttc 
ctccaccacg 
gttcagcctc 
ggaaatgaag 
tacagccctg 
gttccttgcc 
tgcgttgacc 
ggagatgatg 
ttctgctgga 
aacttgatga 
catgggtaag 
tggtggaacc 
gacattaaga 
catcccttcc 
tgagtcagcc 
gtagatcacc 
ctactaaaaa 
ggctgaggca 
gccactgcac 
aaaaaagaga 
atgatgcaca 
gagagtaaaa 
catagggcag 
tgtagaacac 
tacagataca 
tcattgagat 
acatgcacac 
catgctcaga 
cataaataca 
acccccccca 
ttcacacaca 
gcagattcct 
acacacgaac 
agtcccggga 
gctggtgcag 
cccagcggtg 
ctgcagaagc 
gcggccatat 
ttctgggagt 
aggcggttag 
tagtgaccag 
cgcggatcct 



ccaacagctc 
cccggttata 
cttagcggag 
ttcccttctc 
ccccaacccc 
tttacctgag 
atacacctga 
cccctcccta 
gggagggaga 
gage cage tc 
ggagccccat 
cccacactta 
ccagagcagt 
ctgcgcctct 
cctggggcgc 
ttcagtacct 
cgaggggatg 
ggggagtgaa 
tccttcctaa 
ctccccaccc 
gggctgeaca 
gagagttggc 
cctctgcaac 
gaaggaggag 
caatggaggg 
ttagtgatag 
ccactaattt 
agtgcagggg 
tgaggecagg 
tacaaaagtt 
gaagaatege 
tccagcctgg 
atgagtctcc 
cacacacaca 
gaetcaegta 
aatgtgtaga 
acatagacag 
gagatgeata 
cccaggagac 
aagacccccc 
ctcccagaag 
tatgeagaca 
acatgcacac 
gatgeaaata 
ccccactccc 
atacaggaca 
gaggtttege 
cgcctcctcg 
ccgggtcccg 
ccataactgc 
tgttttctga 
tgtagtttta 
ctgcccttgt 
ccactcatcc 
tctatagtgt 



ctgctccttt 
cgctggaggc 
gcttggacct 
cccctctagc 
agctgtgtgc 
ctgaccccct 
gacaaggcaa 
cttccactct 
aatggaaaag 
ctgaaagaac 
gtcccttcca 
tcacccaggc 
gggagaagct 
gcctctcctc 
actgegegat 
tcctgagggg 
ggggagaggg 
agcgcatcgc 
tagaagagtc 
tgccaccaaa 
ggtattctgc 
ggggacataa 
aaggactgag 
gtgagagaag 
gtggagatgc 
gtagaggaga 
aggaatggga 
ctcacacctg 
agttcgagac 
agccaggtgt 
ttgaactcag 
gcgacaagag 
aagagaagct 
cacacacaca 
agcagagaca 
cacatagaac 
aagcacaaaa 
gaagtacttg 
attcctagaa 
caaatacaag 
acatagaaca 
catatcttaa 
aggatagaac 
gaaaacaagc 
acaagcacac 
gaacatgcag 
ttgggccggg 
tccacgctgc 
ccccagcctc 
ctctgcctgc 
aaccatggaa 
tcccctcccc 
cttcttccca 
attttgggac 
cacctaaatg 
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tcgacggcca ggc 2 0 7433 



<210> 6 
<211> 23574 
<212> DNA 

<213> Homo sapiens 
<400> 6 

gctggacctt caacagagaa ggatggctgg cagagtccta gtctgatagg cgcccccttc 6 0 
ctagtgcctc tggagcagga gcacagtgaa aacccagctg tctgcgctga tacccctggg 120 
aggagccttt gctgcagcaa ctccccgccc tcattaagtc tccaccccag gctgcaccgt 180 
caggtaaacc tgggagccac gggattcggc gcctgaatgt gcctagacgg accccaacac 24 0 
actagccctg ccacgcacac ccagcaacac acagacaacc acccagggag acctggtctc 3 00 
tcccatgcta aactaggaac gctatgaaac agcatttttc ttttcctttt ttatataatt 360 
ctattttaaa cttaaaaaaa tttttttgaa tagagttggg gtctcgctct attgcgcagg 42 0 
ctggtcttga actcctgggc tcaaaccatc ctcccacctc ggcctctcaa agtgctagga 480 
tgacaggtgt gagccaccat gcctggccta ttttaaactt cttattgttg actaatttta 540 
gacttgcaga aaagttggaa aatgtaagca ttttccttac acccttcacc cagcttcccc 600 
tgatggtcca tcttacttaa tcatagtcca atcattacag ctaggaaact aaccttggta 660 
caatcctgtt aacgaaactg tagactgttt tgcatttctc atttttccac taatgtcctt 720 
tttctgttcc aagatccatt ctaggatccc acatcacagt tgtctcctta ttctcctcaa 780 
tctgggagtt ccttcatctt tccttatctt tacccttgac acttttgaag aatcctggcc 840 
agttattttg cagaatgttt ctcttgagtt gtctgatgtt ttattctgat cagaatgaga 900 
cacagcattg ttttgactaa ccaaaaagtt attctataag taaatattga ggttaaaaat 960 
ctcatccaac ctgggcaaca gagtaaggcc ctgtctcaaa taaagtctca acactaagat 102 0 
ttaaaaagtg accagaaaag cccccactat gatttgtctt gacttttttt taaaaaacaa 10 8 0 
acaaacaaac aaacaaacaa acaaaaacga tgtcttgctc tctctctccc tcaggctgga 114 0 
gtgcagtggc acgatcttgg ctcactgcaa cctccgcctc ccaggttcaa gcgattcttc 1200 
tgcctcagcc tcccaggtag ctgggattac agacgcccgc caccatgccc ggctaatttt 1260 
tgtattttta gtagagacaa ggtttcacca tgttggccag gctggtctcg aacttctaac 1320 
ctcaagtgac ccgcccacct ctgcctccca aagtgctgga attacagacg tgagcaactg 1380 
cgcttggcct cattagcatc ttaaatctcc acacaggggt gtgttcctta ctgttataag 144 0 
gagcaaagga tcagtttgag gacaggtaaa ataaaaatgc gcttgctgcc tagagggaga 15 00 
agtccctgct gaagatagct ttgcttgaat gagctcaatt gcaatgccag tgctgaggct 1560 
tgttgactgt acggtcacca cagttgctgc tgcgcgccta gaacatggtc actttcttga 162 0 
ctacctatcc tgtctcagta catctgtctg tggtttgtgg tggtccattt cctaattttt 168 0 
ttaatgaatc agaagactgt gatgtgcttt ccgctgtgct aaccatggcc gctgaagcaa 174 0 
aatgtaaacc aagatgcccc tgcagtggtt gtgcttcact ctacgacatc tgttaccgga 180 0 
aaggggtcca gattcagacc ccaggagagg gttcttggat ctcgtgcaag aaagaatttg 186 0 
agacgagtcc ataaagtgaa agcacattta ttaagaaagt aaaagaataa aagaatggct 192 0 
actccataga gagcgcagcc ctgagggctg ctggttgccc atttttatgg ttgtttctgg 1980 
atgatctgct aaacaggggt ggattgttca tgtctcccct ttttagacca tatatggtaa 2 040 
cttcctgata ctgccatggc atctgtaacc tgtcatggtg ctggtgggag tgtagcagtg 2100 
gggaccgacc agaggtcact ctcatcacca tcttggtttg ggtgggtttt agccagcttc 2160 
tatattgcaa gctgattttt ttggttggtt tggtttttga gacggagtct cgctccaggc 2220 
tggagtgcag tgacatgatc tcagctcact gcaacctcca cctcctgggt tcaagcgatt 22 80 
ctcctgcctc agcctcccaa gtagctggga ttacaggcac acaccaccat gcctggctag 2340 
tttttgtatt tttagtagag atgaggtttc accttgttgg ccaggctggt ctcgaacttc 2400 
tgacctcagg tgatccgccc acctcagcct cccaaagtgc tgggattaca ggcgtgagct 2460 
accgcgcctg gcctactgca aactatttta tcagcaaggt ctttatgacc tgtatctcct 252 0 
atctcatcct gtgacgcaga atgctgtaac tgtctggaaa cgcagcccag taggtctcag 2 580 
ccttattttg ctcagcccct attcaggatg gagttgctct ggttcacagg cctctgacac 2640 
atcctcttgt gttttggcgt gtgggaggaa agaggggtga gggaaggaaa ctcaaaacca 2 70 0 
agctctgacc acacagggca ggtacactct cccacctgtc tgtgggtgcc acaagtcaag 2 760 
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ggaggggcag agagagaaga aggtgtgaca 
agcccagttc ctcccggggc agcccaagta 
tcacagctga gctgggctcg ctcgctaccc 
cctcctgctc agcctcaggg tccatgccta 
ctcctgcttt tctgctgttg gccccatgcc 
gttctgcagc acagccaggt gatcctgggc 
ctcccctctt agagccctct ttccttccca 
gagaaggggt ggagggggtg ctgggcagtc 
ctccccaggg tcccttgtag ctggagtccc 
aagagcgggg gaggtatatt aacgtgtatt 
tttgagacgg agtcttgctc ttgttgccca 
actgcaacct ccacctccca tgttcaaaca 
ggattatagg cacgtgccac cacaccgggc 
ttcaccacat tggccaggct ggtctcgaac 
cctcccaaag tgctgggatt acaggcgtga 
ttatgatggt ttaacatctc ggggtggggg 
ctggagatca gaaacaactc atgcctttca 
accaaccacc cccttcaagg aactctcaca 
cagctcaggg ccaggcacct acccagacaa 
ccagaatgat tcaaaatgcc aatcctaccc 
ggaaacggca ctgtgggctg tggcctgtgc 
ccaaacctag tgcctcccca ctgtggccct 
tttcaacagc attggcctct gtgtcatcag 
caactgaggc aggagggggc tttagagagc 
tggagcccga ggtgaagttg cctccctgca 
aacagctggc tgatataggc cactcagcag 
cgcccaaggg aggcaggagt gacagagcaa 
tgtgcccaga cacatgcctg gggctctggc 
ccccgtcagc caccacccca ccaaccacca 
agggcacccc atgggttgaa tttacataca 
gtctttttaa aatttatttt agagacaggg 
tggtgccacc atagctcact gcagtgttga 
cctcctgagc agctaggcct acaggtacac 
tctgtagatg cggtgtcttg ctgtgttgcc 
gatcctccta ccccggcttc ccaaagtgct 
agccaacatg tctttctttt gacttctact 
tccccccgac acacacagaa tgggggagag 
gtgtagcact gggattgggg gtgggggctc 
gaagaaacag gaaggaaggt ccccaaaatt 
aggggagtgt ggactcagtc gaaccatagg 
tcatgccctc cacagagctg ggtccgtgga 
aggacagagc ccttttggga tggccagcac 
gggctgtgtg acctttgctt ctgggactga 
agtgttctcc agccctcagg aacttctaat 
gaagctccac tcggggaaga aacttccaag 
cactggctct gcccctgcag ttcaagttcc 
aaatctgttc attttaggag ctacctctca 
atctgctgag aatgaggagg atatgttgtc 
tgtgtggcag agagcccatc ctcactcagt 
ttctccctcc ccagcccagg ggcttccagc 
taaactcatt tccataataa cctgagccca 
ggtgggagag gtgggagggt gggcgatgag 
taccttgggg gtcaggcgag actgctggca 
gagagttctc agggtctggg agaaatggtg 
gagaccctgg ggctggggag gggcataggg 



gatggccgca ggccacagaa tgtcagagga 2 82 0 
gctggtagtt gggtggccaa acagagggcg 2 88 0 
ccagctcagc gtccactctg cccctcagta 2940 
ccctcctgct tcccagtcac ttctgcgtgc 3 000 
agctcctttc tgctgagctt ctcttctcca 3 060 
tccagacagg cctctccccc agtctggggc 312 0 
cgtggcctcc ccagggttcg ccactgaatg 318 0 
ttggggatta gccaagaggg cagagttggc 3 24 0 
gcggggtcta gacaccccct cctgaagggt 3 30 0 
tttagagtct ctcttttttt tttttttttt 3360 
ggctggagtg cagtggcacg atctcagctc 3 42 0 
attctcctgc cgcagcctcc cgagtagctg 3 480 
taatttttgt atttttagta gagatggggt 3 54 0 
ttctgacctc aaatgatcct cccgccttgg 3 60 0 
gccaccatgc ctggccttag attctctatt 3 660 
cttgttggct ggagagaaac tgcttgattc 3 72 0 
tatgcaaacc gaccagtctt gagtccatac 3 78 0 
tacgaaacca gtatttcccc tgccctaaac 3 840 
ttagagccca accccacgcc ccaaacccca 3 900 
tcttcccctg gcctgccttg cctccccagt 3 960 
cttccactcg ctcctgattc tgtccctgga 4020 
gcatggtggg aaactgtgag taataactta 4 08 0 
tcaccttcat aaattaaaat tttgcaacta 4140 
actctcacct ggccccttgg gcttgtggag 4200 
ctcagttttg ggatggtttt gttatgcggc 4260 
ctcttgcatg aagcaatggg agaatgtgaa 43 2 0 
agagggtgtt caaactaggc aacaccgtcc 43 8 0 
tgccatctag gctcggcctg ccagccacca 4440 
ccccgccaac caccaccccg tgtggtcctc 4500 
gaaaagacta accaaggtcc aagttaatat 456 0 
tcttgctctg tcacccaggc tggagtgcag 4620 
actcaggctc aagtgatcct cctgcttcag 4680 
accaccacgc ccagctaatt ttaaaatttt 4740 
caggctggtg tggacctgct ggcctctggt 48 0 0 
gggattatag gcatgagcca ccacgcctat 4 8 60 
ttggtatctt ttcttaaatg gttccctctg 4920 
gctgtcagat tctgagctcc agaacctcag 4 98 0 
aggaaccacc taggggagaa gacagggtgg 5 040 
atgtttgttt gcagaggcca gccaggctgc 5100 
gccccaggac cactagcttc tggccagcag 5160 
aattgcatgt aggagacaca ccagactccc 5 220 
tacccagcct ccactggtga gggaggtcag 5 280 
tggtttattg agctggagag tgtgcccagc 5340 
gtggctctgg gttcctggag tgggtgggtc 5400 
ctgcctgcag gtgctggagg tccggtgatt 5460 
tggagtggct gtcagtggcc acctgtcttt 552 0 
ccagaggcag gatcttggca tctggacttg 5580 
ccctaaggac tggggcccca ggctgcaagc 5640 
gaggaccagt gatccaggaa aagccacagc 5 700 
atcctggtct ccatgataac caagaggtca 5 760 
gaaacctgat tagggggcag caaactgagg 5 82 0 
agggggaggc tttgaatcca ggtccctgcc 5 88 0 
gaggcttctc agggtggctg ctgggctcat 594 0 
gagggtaaat gttgtgaata tggtcagcag 600 0 
gactcaaggt gactgggtgc tgcccatctg 6 06 0 
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gaaggaggca ggaggcatga gcccttccct 
cctcactcac ccaggggcca gggctgtcca 
acgccgccca gggggtggtc cctgtgtggg 
gtccaggagg ttggcacagt gatcctgagt 
agcccaggga gttctgcgtt tactgagttt 
tctcagctcc cttcctgctc cctcttcagg 
ctcatgccct cggctggcac ctcctctctc 
ccacccagca gcctggcact ctccagatgc 
atgtccgccc cttcggcagc tatgtctgct 
gatgattcca agccatcttt gactgtcccc 
gtgatgttcc ttcaaaacat tctcccctgc 
gccgccaccc tctgcagcgc cccctccgtg 
ctctgcccac agaccctcaa tcacaactcg 
gggcccaggt ggtgccctgc ccctgtctgt 
tcatgccatc tcccttggct tgcccggagc 
ccacagagtg cctgccctgc ctagggccct 
ttagtattat gtgactcccc actccgccac 
agggccctgg cacaggggca gccagcaccg 
gtgctgggcc cagcatttgg gaacttcaag 
gtggcaaagg gccgaaaact ggtgagacag 
tacagggact gcaatctgag ccagggaaaa 
ctggtgggag aagcaggaga gtggacaaga 
cttgcggtgc tggtggtggg gcagggatcc 
acctcctgtt caaagctact tctttccatg 
caaggctgag ggcctgcccc agctcccaca 
caggagtagg ctcaggaagc aggcgctcgg 
gcagccccag ctgcatcgct gcagatgggc 
gccggcccct gggagcagag gcagcaggac 
atggtctgcg gggattgggg gaaggggcgc 
cctgcccaca tgccccccac tggcatactt 
ccaccaaagc ctggcttgtc acagaagggt 
tggttgctat tgcaaacctg cagagaagag 
caaggtcacc cccactcctc gggctctcac 
ctgaagctcc tggaaggcat tcttcctgca 
caccaggcag tgagggggac actggcctgc 
ggcagagctc accattctag gtccacactg 
caggtccagc tgggcactgg ggagtgccaa 
taggtgaacg gtagagtcca ctggcaccat 
ctggcactgc agcttcccac acagggcatc 
ctgcagtacc ccccttcccc aaacccactc 
ctccctacct ccctgcacag ggcaggaagt 
gagcatctcc cgcagagttc accacctgga 
aggaagcatg ggccaggctg ggggcagctc 
ttcccctgcc catcttcccc acagtagaaa 
tgcaagggtg ctcgtgtcct ctcaccaggc 
gtgggacatg cgccatccca gcagtagcca 
aggtaaacgt ctgggggaca gtgggaggag 
tcacccatgg cctggcggca cagcgctcca 
tggggaaggc agagagaggc cacgtgcagt 
gctgggggag ccaggcctac ccaagcccag 
taactaaggc caacaggccg gctcccaagc 
gccctcacca ggcagcgcac gcagcagtcc 
gagcagttgt gagcaaagca gcagaggtcg 
ggcccaggtg agggcgcagc gccccagacc 
agccgactta acctggccag ggccgcagtc 



tctcccttcc ctctccacct ccccctggtg 6120 
gtggctgtgg ggcccaactc catggggtga 6180 
ccatctttgg ggctgagcaa cgtgataaga 6240 
gggttattgc ctccccgcag catggtgtcc 63 0 0 
cttggggcac ccatctgctc caagtcaccc 636 0 
ggagccttgg gatccaggct ccaagtgagc 64 2 0 
tagtcctaac atttcctcca ggctctgaca 6480 
tggcatcgct cagcttccaa agaaccttgg 6540 
ctccttgccc ctgggtgccc tgctgccctt 6600 
atcccatccc ccaaggcctt gtcatttcct 6660 
cctgagactc ccgcctgggg atgagaagca 672 0 
ctgacacgcc aggctctggc caccttgctc 6780 
ctttgtcagg gtcctcttag ctgccacccg 6840 
tatgccctct gcccccatct ctggcccaaa 6900 
actcccaaga ccaggctatg tcagacatgg 69 60 
ggtgcagggt gagtcctagg acagccatgc 7 02 0 
cacccaggtc acagagaact gggttaaggc 7 0 80 
cagctgacca gtggtatgga gtgaaaagat 7140 
ggggtgacag aggtgatttg tgcagaggaa 72 00 
aggctggaca ggcctccggg ggcagcatgg 72 60 
acagggcgaa gtcaagggtg aggcagccag 73 20 
ggagctgtac tgggaggtag agggccatgc 73 8 0 
accccctcgc ttgactggga ggccactgga 744 0 
gcctctgggg ctgctctctg cacctggggg 75 0 0 
gccccagcag agctctgagg aggggaaccg 75 60 
agcctaccca ctgcacgcag ggtcccttct 7620 
tcctgggagt cggtagcaac accaggccag 76 80 
gctgaggagc atggccagca ggaaggtgtc 7740 
tgagtcctga gcaggtgcac caccccagct 7800 
tcagcctgca cagggccact gtccatgctg 7860 
ggagcccagc ctggagcaca gtggcagtta 7 92 0 
aagaggaggg tcacgtagga ttaggaaccc 7 980 
cccgtggctg tggcaggcag tcaggcagcg 8 04 0 
gcgcctgctc tggcacacct acgggcagtg 810 0 
gggattcaaa cggcaaggag gggtcgggtg 816 0 
ggtgcctggc tctaccaggc ccaggccaag 822 0 
ggctccccga caagtcactt cctggccatc 828 0 
gtgcggtgcg agcaggctgg gctttccacc 83 4 0 
cctggggagg aagtagaggg gggtcaacag 8400 
catagcttct gctccctcca ctcagctcca 8460 
ggccctcgct gtcctggccg cagtttccat 8520 
aacaggcctc gggagctggg tgggagcctg 8 5 80 
ggagaggggc tgcgctcagc gggcctcagt 8 640 
actggcccca ccagaggatg gggggcaggg 8 70 0 
ccccagagct gctggcactg ctgctccagc 8 760 
ctgcccctgg cacagggtga gccgtccagt 8 82 0 
gtgcccgtgc aaaactcagg gaggtcacag 8880 
gccggcttca gctgcgcagg tgacgggtgg 8940 
gagaggtcca tgccgagagc gcggctcgga 9 00 0 
cacccaacgg gggaacctga gggcaccaat 9060 
tccccgaaac cctcaccctg aaccttccat 9120 
ccgtgggcgc actgggcccc cgggcgcagc 918 0 
cggcactcct gggaccagaa aggcaagaag 9240 
tgagcggaga gggcaagtgg gggccgggcg 93 00 
acactcctcg cccgcttcca cgaagccgtt 9360 
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cecgcagagc gccggcggca ccgggagtcc 
gccccccttg cggaagaagg cgcgcagctg 
aaacgggtgc ctaccggcac ggggagggca 
cgcgcttctc tgatccccac ccctgggctt 
agaggggaag taacccgcgc aaagtcacac 
aatagtgagc agcccgggac ccaaggtgga 
aggagtaacc tcgccaggtt actcggaaaa 
cctgagcttc ttccctttaa gcctcataaa 
ccccatttta cagaagagga aactgaggga 
caggactagc cgccccgccg cagccccgac 
atgacgcagc ctccggactc ggccgcagcc 
aggccgaggc tgtggccgat ctcatgggcc 
gagtggtcct ggggggccgt gggagggcgg 
cctcgccccc gcccgcgggg ctcaccgtgc 
tgccctcgac gggcgccagg cccactgtgg 
agcgcggcgt gaccaggcgg ggccgggagg 
ggtcagaggc acccacgtga gcagctgcgc 
ccggcgccac tgcaggaagg cccagagcgt 
gtcccgctcg gtccacacct ccaggccggt 
aagctgaggg cgaggcgggg ctgaagccgg 
tcgtggggcg cccttcctct tccccaaacc 
acccccatca ccgctctctc cccgccgccc 
aggagacgct gtttggtgtg gttcaagttt 
aatcccggtg gcttgagggg ctgagctggc 
ctctcctcac cagggtgtgg tctgccacaa 
gcgcttctcg cctgccctgc ggaggtgcaa 
tgagccctga cccccaaccc cagctcccag 
tttcttcaga cttcaataaa aatactggga 
atttggaggg ctatagatac agcattccca 
tctggggacc accaggaagg ctggtcatgc 
aggttccttt ccaggtgagc agctgctcca 
tggagccccg gggtggccag ggacgcagat 
ggccactagg gtgcagaggg gtaggagcgg 
ccagcctcct ctcccagagc tcacctcatc 
tcggggaagc cccttactcg cccttggtag 
ccccaaatct cagccagggc tggagcaaga 
gctccaaccg ccccttagga atgcaaggag 
tctggcggtg catcccagag cccatggaag 
actggctgcc catctgggcc gtagtgggtt 
agagggggtg ttacagggaa cactgaattc 
ccagccttcc tccctaaatg ctaatggagc 
cttagggaag ggacaggagc aatggtgact 
gaaggagcct tggtgatggc ttagttgtgg 
acgccgggga ggagtgggaa taggggaaga 
cagcaggatc cacaggatcg ggaggggagg 
tggagttggg gaaatgcggt gacctccccc 
ggcatctggt cgaccctctt gcccccacct 
cttctagacc ccacatacct cccacctgtc 
cagagaaaac agaatcgcca cctgcccacg 
ccactccggt aatgattcca tcttcaggct 
ggatcaatca taagtcaatc tgtcttcttt 
gcctcattac ctccagacca cccgctaatg 
ccacctgccc cagctctgcc ccctacccct 
aggtgatgtg accaccccat tgaaagggta 
tggaactaga gtttgttccc tgtccatccg 



ggggtccggg gcattggaga ggcaagcgcc 94 2 0 
gcggcggctg caggcgctga acacgcgcgg 94 8 0 
ttgggcatgg agggacagtc ccccaacccc 954 0 
ggctacagcc gccagacgcg cagagcccag 9 60 0 
aacaagcggg acaggggacg atgcggcccc 9 6 60 
atcgcgaccc gacggtgctc ctcccggtgt 9 72 0 
taatcttcat accgttgaga atccactttg 9780 
ccaccctgaa gcggacacta tgatcattat 9 840 
cgaccaaaga aacgcagcgg aggaagtccc 9 90 0 
cccccacccg cgtacccggt ggccgcagcc 9960 
tccacgcagc agccgtcggg gtcgtggctg 10 020 
atggtggctg cggcgccgat ggggagctcc 10 080 
tcactgcggc cgtagagcct cctgtctctc 1014 0 
tcacgcctcc cgagctctcg gcgcggcaca 10200 
cgecctggaa ggcgcggccc ctgggggcgg 10260 
tgaggccgcc ccacccggga cccgcgtccg 1032 0 
ggagtcgtgg ggccgctgcg cccacagccc 103 80 
ggcgttggcg tcctgcgtga cgcggctgcg 10440 
cagcgccacc tgaatgtcca gagtcctgag 10500 
gacagggcgc cccatcgcgc cggtggtcct 10560 
ccaccagcac ctgcctgtcc tgccgccgcc 10 62 0 
ccaacctggt ccacgtagtt ggcgacttcc 10 680 
cggtgccgag tcaagaactg ggaaggcaga 10740 
cccatccctg accccgccaa cccctggggt 10800 
tgtacagttc caggtacttc cgggtcctgc 108 60 
atggggaccc tgagtggaag ctgctgggct 10920 
aaggaagttt. aacatgtttt ctggaacttg 10980 
ctcgaggcct gtgaattcct gtctcttctg 11040 
ctcccatccg atcgatgccc ctgaccctgc 11100 
ccgctttgtt cccaggatcc ctgtggccac 11160 
tccgaaagat ctcgtgggtt gagaagtcct 112 2 0 
aatagctggc attcctgctg agggtgatca 112 8 0 
gtgtgaggga gctctttccc catcccaggc 11340 
ccagagcagg tgcagaggac tacccaggag 114 0 0 
tggcaatgat ccfcagggagg aaggggccag 114 60 
ggggcaagag ggagggtgtg gtaggggctg 1152 0 
gagtaggggt aggaatggtg ggggggtacc 11580 
catctcaccg tgfcggtfeggg ggccagcacc 11640 
tctatgtatc ctggggccag cagcctgctg 11700 
agcttcctcc tgcctcctcc aggatgtctc 11760 
agctttatga gtgagacact cacagtgtgt 1182 0 
tgctcagatc agaaactctt ggggctagag 118 8 0 
gagatgtgaa tatgggaagc accagggagg 1194 0 
gtttgtggtt cccaggggac ctgcagcagg 12 00 0 
agtcaggaga cactgccgaa gaatgggact 12 060 
agttcccctg cctgctgccc tcctttgttg 12120 
gccctagatc cttgaaatat tttcctcaga 12180 
cttcagtgat tgatgctcac cccctgcctc 12 240 
ctgcttccac cctccctgcc ttctccacac 12300 
ccatctcaac aggatctttc ccacacggat 12 3 60 
aaagaaaatc cttaacccaa cctcaccttg 1242 0 
atggctgctt cccccctccc aggcattcca 12480 
gccccacaca cacccgccac cctaggaggt 1254 0 
gggacgtcgg gaaaatatgg ttgggcacag 12 600 
actccacgag ggagaataaa atacgtgtca 12 660 
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agtfgctcaga acagcgcctg cggtcaagca 
tctgggtggt tttaagagcc tgcgctccag 
accttcctga gcacaggtct gcctgtccct 
gtccaggaca ctgagcacca aaatggcctt 
attcattccc ccctgctccc ctccctgtag 
tgctgccccc aagtgtagcc actgcatctt 
tctttttctt ttttttctct ttttttgaga 
ggtgcagtgg tgtgatcttg gctcaccaca 
ccgtctcagc cccctaagta gctgggatta 
ttgtattttt agtagagacg gggtttcacc 
cctcaggtga gccgcccgcc tcggcctccc 
ccttagcctg gggtgccggc tcatcttttc 
ttgcaaatgt ccactaggtg gtgctcactc 
tggggttggg agggcggagg tgccccacag 
gcctccaggg agccaaccac aggcctgcgt 
ggtcctctgc ccatccagcc acccgcacag 
atcccctcct cccagcccta ccccagcctg 
ggagctcctg gccttcagcc tccagggcca 
caagaaagag tgtggagctg agatggtggc 
tggcctgtgg agccggctgg ggaggaagtt 
aggacccccc acatactatg gagggcgtgg 
aactcaggat gtttgcagtc ctgacaacag 
aggcacacca gcatcctccc tagagaccag 
ccgccctctt ctttcatgcc cagaaggggc 
gaggacatta tgggggcatc gggtagccat 
tgaggcttgg ttccaaatga gctgctgctc 
cacttccaat tttggcaaac tgctccctct 
ccgtccctcc ggctccgaca cctgcttcac 
ccagccaaga gagcacactg gctctcagct 
ccctctctcc ttcccccacc ctcttggcgt 
ccfctgtccac tcagcctggc tgggggccca 
aggctgctgc gccttcggaa tgtttaaccc 
ctggggccct gtccaagggt ctcagagtct 
agcagatggt ccttgcctcc ctggccacag 
aggctggaat gccacttgct ttattgggga 
gaggtccctg ttcctctgca gcccctgcta 
ccttgatata tacccctcat cttgaaatgt 
tagcctctac tctttcttct gtaccatctt 
gtgggggaag cagagcccct ttggggctct 
agggaggccc agaggcagag tgaccccagt 
ggagggcaat ctcaaagctc aggccagtgc 
gggcctaggg gatcccagtg agggccctgg 
tcagccttca tggcaatgct cccctgcctt 
aaaattccaa acaaagaggg ctctctagga 
cgttcagacc ttagtgggtc tcaggctccc 
ttccccaagc tctctgggct tccgtttgcc 
gccctcttag tttgcagcac agatattatg 
aagcgtgtga agagcgggat tgggaagagg 
atggggtggt cttacatctg ggggacatca 
aagtggattc cacaggagcc agtgaggctg 
g a 9gg" a cagc agtgtgtcca cagggtcaca 
ttttggtgct tttttttttt tttttttttt 
aaactctgga cccatttctg cctggtgggg 
agggaaagag ggaggtgggg tggagaaagc 
agatccagat gtccaaaccc cagtgccttc 



ctcagtaggt gatatatact gataacataa 1272 0 
cccggacacc caccccaccc agcccaaggc 1278 0 
tccccagctc aaaatctttg gctctggatg 12840 
ctgtgatctg gccctcctgg gactctccaa 129O0 
atggagacct tccaggcagg tcagacaaac 1296 0 
tttctttttt cttttctttc tttttttttt 13020 
cggagtctca ctctttgccc aggccggagt 13 0 80 
acctctacct ccggggttca agcgattctc 1314 0 
caggcgccgc caccacgcct ggctaatttt 13200 
atgttggcaa ggctggtctc aaactcctga 132 60 
aaagccaccg catcttggtc cctgccattc 13320 
cctctaggat ttctttagac tcagcatatc 13380 
atcgccagca gggagctaac aagccgctcc 13440 
cggggctgac agcctcagcg gtcctcttca 13500 
gactctccct gtcatctgca ccctctctgg 13560 
atctgtgtca gtccctgccc cccaacactg 13620 
gcactcactg gttcttctcc agctcaagca 13680 
ccagccccat gtctggcttc gagacctggg 13740 
ctccaggcct cctgcctgcc agggagtagg 13 8 00 
cttggggaga acgtgggctg gggagtcagc 13 860 
aggaggtgag aacatacaaa gatgttccca 13 92 0 
ccacttggaa gggcgttggc acagcctgcc 13 980 
aggtcccaga aaggtgcccc tcccctggcc 14040 
atcaaaagca ggggaagaca gaggggtgct 1410 0 
ggtcagggcc tcctcagagc ctctgctacc 14160 
atttcctata gaattcaaat ttgactcctc 14220 
tccaaagttt tcctgggcct ccagcagccc 14280 
tggacccacg aagtaaacat ggacgccatt 143 40 
aggtgtcagg aggctggctt ggacggccag 1440 0 
ctcccaccct . gtgggaacac cccacttccc 14460 
gagttggagc cggcccagga gcttcctggg 1452 0 
ccgactcctt ttctccaaaa atgcactggc 1458 0 
ttggagggag ttcttccttc gcaagtgggg 1464 0 
gccccacaag gcctccagca tgagctcatg 1470 0 
aaggtctgca ccgggaaaaa ggccatactc 1476 0 
tctttactct tgccctcctg gtaccctgcc 14820 
gagtgtttcc tgccttttgg aggggatacc 148 8 0 
ggcaggcttc ctgggggcag gggcccaccg 14940 
cctcttggtc acagcccagg ccagacagac 15000 
gtgtgtccag ccttcccctc ctggggatgg 15060 
cgtgcttgac cagtggaatg ggggccttat 15120 
gttgggagct gctgggtctc tgggggcctc 1518 0 
ccctcttgct ggatttggac agtagggctg 1524 0 
gggg ca 9ggg tgtagccaat ggtttaaaat 153 00 
agcctaaaga gctgtgtgac catggacaat 153 6 0 
cctctgtaaa atgagcatat caaggctact 15420 
gcacaaacag atggggcatg gttattctgg 15480 
ctggggcaga gcgtcctgca gaagaagcac 1554 0 
ggagagtgac cactgccccc cccataccag 15 60 0 
aaggttcagg ccttcgtggc agggccctga 15 660 
tgttctggtc aactttgcaa aaggttttct 15720 
tagaggctcc tgaaaagctt caggacccac 15780 
gtgggggtgg cccagatcat ccagggaggg 15840 
tgaaatgact tccatgtgtg cgggctcacg 1590 0 
ttctgcccac ttgaggggca ggggaggcag 15960 
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ggg*cctatag gagtagtgac ttggtggttc 
tgtagggaga gacaaaaggg ctgggaggtt 
gcactgaccg gctcctccag gctgacggtg 
9gggtgactg gctgcccagg gatatgtcct 
acctgaggaa cacaggggca tgggacaaag 
ggaggcactg gaggcctttt ggggcagact 
agaaggaggg acgcccgtgg gcatccttgg 
agaaggctgg aaagggacag ctcagctggg 
ggcaggcttg ggctcggcga tcacaaggaa 
gaaagagcgc ggcagcctta gggattttta 
atgtacgaag agaggacact tgtgcccccc 
ttgccttgcg aggtccacct tgcccagagc 
ttttggtacc ctggcccaga aggcccctcc 
gacggagcct tggtgctccc tggccgggtg 
agccaagcat tactcagagg gcgcctggaa 
ctggccccaa ccactctcac aggcccgccc 
ctgcagcagc cctccttccc ccggtaccca 
cctccacgca gccctggcct cagtcctgct 
gctcctgcta ggccacccca actccttttt 
catcaaagac ccttcctcca agaagccctc 
agatgagggc cgagaggagt ctgccacctt 
agggctctga gtggatgtgg gggaggggaa 
ttaactgaga gtagagaacc aagctttgct 
ggtagggtgg gcccgggcac aggtgtggtg 
acatggcctt gcgcgcacgt gctggccacg 
gcagagctgg agctggagct ggagctccgg 
gagccaggct ggtattgggc acatagacgc 
gaccccagga tagggctcag aggggaggga 
gccagaacag accaggcccc tgtacctgtt 
gcccagtctg catagcacag cccagccccg 
cccggccagg cacttgctgt gcaggactgg 
tccagcccca tcttaacatc tttgtttgga 
ttccagagca ctgctaaaca gacaccttct 
acaaggcaac aatataaaca acatacacac 
ggtgagcagg gtggaggcca cagctagttc 
gggactcttg ggcatggggg cttcctagag 
aggagcagga gcgggtcaca cggtggcctg 
ccaggcagag ggggctacgg ctatgggctg 
gagggagcca tgccctctgc tttgccagcg 
gggcccgggg ccagcccaag cacacttgag 
ccaaagagtc cccacgccct gacctactgg 
gcgcgctgac ccgagctctg agcagaaccc 
cccctcaggg cgcaccccgc ccgcgtcctc 
ccagagcagc agcagtagta gcagcagcag 
ccagcccata gctgtgagct cctcggcctc 
caggctggat tgcggccgac ccgtgcccgg 
cccgccccca cctcccaccg cccgccctcg 
cggagcctgg ccaggggccc cgacggcgcg 
gaccgccgcc cctcgagggg gtggagctgg 
atgaccggcc cgagccgtcc ctccccgtcg 
gagagaaccg ctgggaaggt tcttggagga 
aaggccgcct ctcgctgctg aagttggccc 
gccgatctct gtcccctcct tgctacccgt 
agtcgggtca tccacctgca ctgggtgccc 
gccgctgtct tgggggcttc attgctgggc 



tggggacccc agcaaaacta gaagctgtaa 16020 
cagggcccct gtggagggcg gggagacatg 16080 
cgccagggtt gtccatccag gacccagtgc 16140 
ggagtaaaga cagagcacag ggtgaggggg 162 0 0 
cagagggagg ggggtagagg acatccccag 16260 
tcaccttcaa cacgcgtggg ctcagcctgg 1632 0 
atctgaggag ctatcaagga ggaggaaaag 16380 
gacacgggag tcccctgacc tttgtcgggg 16440 
gaggccaagg ccgccagtgc agaggggaag 16500 
gatgggcagc agatgccttt agggtgagag 16 560 
ccatcatctg agaaaaacaa cagccagatg 16620 
tccctcgggg actctgtcct ggtggcaggg 16680 
teat etc ttc aaggggaggg gacgcttccg 16740 
tgcctaaggg ggctctagga ggaatcccag 1680 0 
tgttcccctg gaatgctccc agcccctcca 16860 
tgeaggagee aggccccagg cacccagagc 16920 
gtcccagctc ccagaacaga cagcctcccc 16980 
gggctgatgg ctgcctgtgg aagtgactca 17040 
tctcctccac cttctctccc agactacaaa 17100 
cttgattgga tgagtgaatt gccatcaggc 17160 
ggaaaggagg ctagaggggc cagtgcaggg 17220 
ggaggggagg tctctcagcc cagagagcac 17280 
gctcctaggc ctctaagggt ttggggaaga 17340 
tgggtgcagt gtggtgtgtg ggtgctgtcc 17400 
ggcaccctga ccccaatgag ggagagaggg 17460 
tgaccgggtg aatgggggtg gaacccgagg 17520 
ccctctccca ggggtcccat cacctcccct 1758 0 
gcagtggacc gcctggggcc ctcccctggg 17640 
tggtccccac acagtgctgt ggaagccacc 1770 0 
catgccccct ccctggttgc cctccctgtt 17760 
ctaatcctcc cacccgcttg cagaggttgt 1782 0 
ggggttaccc cgaggagaca getgeagtet 17880 
atctggagag gcccttctct atctcaccaa 17940 
tgccctgctg ccctgggagg agggacgagg 18 00 0 
tgcagcctga gagcaaagca gggactctgg 18 060 
gatggagece cgctgagtcc taaggggtgg 1812 0 
cggatggaag ctggttgtga gagegagaat 18180 
ggggctgggg gctgggctgt ccccgagggg 18240 
gagtggcagc cgggcagtgt gggcaagtcc 18300 
cgtccctggg caggtcccac ggagaccccc 18360 
ccgtatggtg ccggggccgt gagaccctcc 1842 0 
atccccgcca ccaccaccgc gcctagcctg 18480 
accttgaagc accccggcgc ctggcactgg 18540 
caaeggggtc ccccgagctc tccggggcct 18600 
taggcagegg ctcgcaactc cggctccgcc 18660 
tgcagcctca ggccgccgcc tteggacett 1872 0 
ctcccgcctc ccctccccgc caaccccgct 18780 
cgccatgggg gagcegggtc gccactcccg 18 840 
geggaggagg gaatccgtgc ggcccctcgg 18900 
gtctcagagg gcctctactc ctgagaggag 18 960 
ccgcggcgtg gtgggatgag gcggtgggca 19020 
caggagegeg atettcegtg gtctcctggg 19080 
cctgccccga gggtgccctg gcggaggttg 19140 
ccaaggatag gaaggttcag gcaaccggct 19200 
aaaggcgatg cagcagaegg agacaacctt 19260 
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tcttccctgg cggtggccag agggcagaat 
gagacccttt cggcctcagt aacatctgtt 
gtgcaaattt ggatgagatg ttaacttttt 
ctctgtcgcc aggctggagt gcagcggcgc 
tggttcaagc gattctcctg cctcagcctc 
ccacccccag ctaacttttg tatttttagc 
tggtctcaat ctcctgatct cgtgatccac 
caggcatgag ccaccgcgcc cggccggaga 
tttttttttt tgagacagag tttctctctt 
ctgggctcac tgcaacctct gcctcccaga 
agtagctggc attacaggca ttcgccacca 
atggggtttc tccatgttgg tcaggctggt 
gcctcggcct cccaaagcgc tggaattaca 
aagtaaatct atttgttttt gagaatttgg 
gcagtgagag aaacctacat tcacatatct 
aaagtctccg tgctcttagc cctcctcacc 
ataggctttt agattatttt ttcacactca 
taacgttttt ctgcactttg attttctttt 
tcagtgcgtt tagagggtcc ttgttgttct 
tctgtcaccc acgctagagg acagtggcct 
gggctcaagg tatcctccca cctcagcctc 
ccaggcccag ctaaaatttt taatttttct 
gctggtctca aactcctggt cttggccagg 
tttgggaggc cgaggcgggc agatcactgg 
catggtgaaa ccccatctct actaaaaata 
ctgtagttcc agctacttgg gaggctgagg 
ggttgcagcg agccgagatc atgccattgc 
cgtctcaaaa aataaaaata aaaataaaaa 
tcagcttctc aaatcgctgg aattacagga 
ttattactta ttattactac tttggatttt 
cacaataaaa tgcacttatt ttaagtggcc 
ctacataagc atcactataa tgcagacaca 
cttccagcca aacttcccca ctcccaaccc 
tatagataag ttttgcctgt tctagaattt 
ttctgtgtct ggcttccttc cctctttccg 
catatcaata gtttgttcct tcgtattgct 
agtttctcta ctcaccagtg caccataggg 
gaactatatt tgcacagaga gagagagagg 
caattggctc acgtgattat ggaggccaaa 
agaacgagga aagccagtgg tgtgattcag 
accagtatgg aggtggctcg agctcagaac 
ccagagcagc ccctcagcga cacctcttca 
cttcagtgtg gatgtcaggt acagaaggca 
actggtattc ttacctcgaa ctggccaact 
gtcctgtcat tcagaggctg tcccttaccc 
ttctccctgc aacacgtttc agaaggccct 
aagctcctga gctccatgtg gtctgggaat 
gtgctaataa ctgactgtag aggggacaca 
ccttacacag ttgacacctc ccaaacccag 
attttctgcc gaagtctctg agaaagggtt 
atgaggccat caaaccaaag ctgaggaaga 
gggcgcctga cagcacctgg gaaagatggc 
agaggggcat caggaaatgt cctgggaacc 
gtgcttggca tgggtgacac ccaccatctg 
gaacaggatg tctgagggag gcactccgac 



tgcataaaag ctgcagactc ccaggcctgg 193 20 
tcatgtttta aacttttgtt ttcctactcg 193 80 
tttttttttt ttttttgaga tggagtctcc 19440 
gatcttggct cactgcaacc tccgactccc 195 0 0 
ccgagtagct gggactacag gcgcgcgcta 195 60 
agagacgagg tttcaccatt ttggccagga 19 62 0 
ccgcctcggc ctctcaaagc gctgggatta 19680 
tgttaacttt taagcaaatc tttttttttt 19740 
gttacccaga ctggagtgca atggcatgat 198 0 0 
ttcaagtgat tcttctgcct cagcctcccg 19860 
cgcctggcta attttgtatt tttagtagag 1992 0 
ctcgaactcc cgacctcagg tgatctgccc 1998 0 
ggcgtgagac accgcaccca gcctactttt 2 0 040 
aatgtagtaa tttggttagt gaaagttcga 2 010 0 
caaaatcaaa aagtacagaa agcataggga 2 016 0 
aacaggaaac caatatgatt agtttctttc 2 022 0 
agacaataca gacatatttt tttctcttat 20280 
tttttttggt cgcttaatac accttagata 20340 
tatgattatt atttagagac agggtctcac 2 0400 
gatcatgcct cattgcagcc ttgaaatcct 2 0460 
ctgagtagct ggaactacag gcacacggca 2 052 0 
gtagacaggg ggtctcactt tgtttcccag 2 0580 
cgcagtgtct catgcctgta atcccagcac 20640 
aggtcaggag ttcaagacca gtctggccaa 2 0700 
caaaaattag ccgggcatgg tggtgagcgc 2 07 60 
caggaaaatc gcttgaactc agaaggtgga 2 0820 
actccagcct gggcaacaag agcgaaactc 2 08 80 
gaactcctga tcttaagtga tcctcctgcc 20940 
gtgagtcacc acagctgtcc agctacgaga 210 00 
caaatcaact tcattaaggt ataatttaca 210 60 
agtaagatga gtttcgataa gtgtatataa 21120 
ttccctcact cacagaaaga gccctgtgcc 2118 0 
cagacagcca ctgatctgtt gttctctgtc 21240 
catataaatg gaatcatgcg gcatgcactc 213 0 0 
atgtttttga gattcattta cactattttg 21360 
gaatagtgtt cggtggtttg agggaaccac 2142 0 
ttattttcca gttaggggct cttataattg 21480 
aagaaagagg gagagagata tttattatag 21540 
aagttcccga atctgccatc tgcaagctgg 21600 
tttgagttca aaggcctgag aaccaggagc 2166 0 
aagttgggga caggaaagca gagcagcacc 2172 0 
gtaaagcaag gctgaacaca gaggggctgg 2178 0 
gctcgaggag ctactctggc gttcttgctt 2184 0 
cctacttaaa ctgcaggcca tggctttaat 21900 
aaagccaggt tagcatcccc tgactgacac 21960 
gtagtcgtcc acttccctgt ctctctcccc 2202 0 
atgtgtgttg ctcacttcct agcacagtca 2 20 80 
gtcgaaaagc cacatgggga tcagagtcat 2 214 0 
atgagctgtg tccaagtgca ggtcagagga 2 2200 
tatttacatt ttgaggttgc aggggaggag 2 22 60 
gggatcctag gatgcaccga gcagctccgg 22320 
ttctccactg gcttgttggc gtcaccctcc 22380 
aggcaaacca gtgagcatta acccttagaa 22440 
taaacacgac ttctcccaag gagtgacgca 2 250 0 
tccagccttc agagatcgcc agggtggcac 22560 



76/189 



WO 02/083077 



PCT/US02/12063 



ct^gtgacga caggctgatg cttgggtgcc ccagaaaagg tcatgtgtgt gaatgggggc 22620 
cccaaagcca acgcttcatc cctgacagcc tggtgcattt agaggggaac tttttgtccc 22 680 
ttggcaaggt gggtggaatt tcaggttcat agggcaaggg tattttagct ttaatagata 22740 
ttgtcaaaca gttttccaaa gtcattgtac acactctgtg attctactta tgtaaagttt 2 2 800 
aaaaacaggc aaatcaaatc tatggtgttc gaagtcaaga cagtagttac ccttgtgggg 22 860 
gctgcaactg gtacagagtg taagggggga ctgtaggatg gtctatttct tgatctgggt 22 920 
gtgttcgctt ttggaaaagt ccttgagttg catttataat gtgtgaactt ttctgtatgt 22980 
tacactttaa ttgaatgtac aaaaagtctc aggaggcctc agaccactgg aagcggacac 23 04 0 
aactaacccc tctgagagcc tccaatccaa gatggacata tgtccccttg gaagtatgca 2310 0 
gaagcaggtg aagactccta agccggatat tcccaaatcc ccccagtagc cgcagcttca 2316 0 
gcagctgctt atggtcctcc ctacaccctc tcttccccag acagccccca aacatctggc 23 22 0 
tgcatttgac ttgctctctc cctgtcccac ctctggattt agtccatgtt ctccaccctc 23280 
cccactgtca gcaatgtaga caagacaaac gcttagttca cgtgcccacc tactgcgtgc 23340 
catgcacggg gctggtcatt gtgggtggca aatgtgagca acacacgaag cctcaaggag 2 3 400 
cagaaaggga cacaaatcac ttcagcgtaa ggtaatttgt gataaatgtc atgtaacttg 23460 
cagcccctgg ccccctccta cagatggtgt ctaagaataa accccactaa catgtgactc 2352 0 
ctctgttcta gcccagctgt ttgggttgca agaaagagac tcactccagt tgcg 2 3 574 



<210> 7 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 7 

agtctccgtg ctcttagccc tcctcaccaa caggaaacca atatgattag tttctttcat 6 0 
aggct 65 

<210> 8 
<211> 656 
<212> DNA 

<213> Homo sapiens ■ 
<400> 8 

gtgcagcctc aggccgccgc cttcggacct tcccgccccc acctcccacc gcccgccctc 60 
gctcccgcct cccctccccg ccaaccccgc tcggagcctg gccaggggcc ccgacggcgc 120 
gcgccatggg ggagccgggt cgccactccc ggaccgccgc ccctcgaggg ggtggagctg 180 
ggcggaggag ggaatccgtg cggcccctcg gatgaccggc ccgagccgtc cctccccgtc 240 
ggtctcagag ggcctctact cctgagagga ggagagaacc gctgggaagg ttcttggagg 3 00 
accgcggcgt ggtgggatga ggcggtgggc aaaggccgcc tctcgctgct gaagttggcc 3 60 
ccaggagcgc gatcttccgt ggtctcctgg ggccgatctc tgtcccctcc ttgctacccg 42 0 
tcctgccccg agggtgccct ggcggaggtt gagtcgggtc atccacctgc actgggtgcc 480 
cccaaggata ggaaggttca ggcaaccggc tgccgctgtc ttgggggctt cattgctggg 540 
caaaggcgat gcagcagacg gagacaacct ttcttccctg gcggtggcca gagggcagaa 600 
ttgcataaaa gctgcagact cccaggcctg ggagaccctt tcggcctcag taacat 65 6 



<210> 9 

<211> 177 

<212> DNA 

<213> Homo sapiens 

<400> 9 

cgggcacggg tcggccgcaa tccagcctgg gcggagccgg agttgcgagc cgctgcctag 6 0 



77/189 



WO 02/083077 



PCT/US02/12063 



aggccgagga gctcacagct atgggctgga ggccccggag agctcggggg accccgttgc 12 0 
tgctgctgct actactgctg ctgctctggc cagtgccagg cgccggggtg cttcaag 177 

<210> 10 

<211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 10 

gacatatccc tgggcagcca gtcaccccgc actgggtcct ggatggacaa ccctggcgca 60 
ccgtcagcct ggaggagccg 8 0 



<210> 11 

<211> 77 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gtctcgaagc cagacatggg gctggtggcc ctggaggctg aaggccagga gctcctgctt 60 
gagctggaga agaacca 7 7 

<210> 12 

<211> 79 

<212> DMA 

<213> Homo sapiens 

<400> 12 

caggctgctg gccccaggat acatagaaac ccactacggc ccagatgggc agccagtggt 60 
gctggccccc aaccacacg 7 9 



<210> 13 

<211> 119 

<212> DNA 

<213> Homo sapiens 

<400> 13 

caggctgctg gccccaggat acatagaaac ccactacggc ccagatgggc agccagtggt 60 
gctggccccc aaccacacgg tgagatgctt ccatgggctc tgggatgcac cgccagagg 119 

<210> 14 

<211> 77 

<212> DNA 

<213> Homo sapiens 

<400> 14 

gatcattgcc actaccaagg gcgagtaagg ggcttccccg actcctgggt agtcctctgc 60 
acctgctctg ggatgag 77 



<210> 15 
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<21*1> 190 
<212> DNA 
<213> Homo sapiens 



<400> 15 

tggcctgatc accctcagca ggaatgccag 
ctccaaggac ttctcaaccc acgagatctt 
aacctgtggc cacagggatc ctgggaacaa 
ccagagcagg 



ctattatctg cgtccctggc caccccgggg 6 0 
tcggatggag cagctgctca cctggaaagg 12 0 
agcgggcatg accagccttc ctggtggtcc 18 0 

190 



<210> 16 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 16 

ggcaggcgag aagcgcgcag gacccggaag tacctggaac tgtacattgt ggcagaccac 60 
accctg ~ 66 



<210> 17 

<211> 72 

<212> DNA 

<213> Homo sapiens 



<400> 17 

ttcttgactc ggcaccgaaa cttgaaccac accaaacagc gtctcctgga agtcgccaac 60 
tacgtggacc ag 72 



<210> 18 

<211> 167 

<212> DNA 

<213> Homo sapiens 



<400> 18 

cttctcagga ctctggacat tcaggtggcg 
gaccgcagcc gcgtcacgca ggacgccaac 
cgggggctgt gggcgcagcg gccccacgac 



ctgaccggcc tggaggtgtg gaccgagcgg 6 0 
gccacgctct gggccttcct gcagtggcgc 12 0 
tccgcgcagc tgctcac 167 



<210> 19 
<211> 85 
<212> DNA 
<213> Homo 



sapiens 



<400> 19 

gggccgcgcc ttccagggcg ccacagtggg cctggcgccc gtcgagggca tgtgccgcgc 6 0 
cgagagctcg ggaggcgtga gcacg 85 



<210> 20 
<211> 143 
<212> DNA 
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<213> Homo sapiens 
<400> 20 

gaccactcgg agctccccat cggcgccgca gccaccatgg cccatgagat cggccacagc 60 
ctcggcctca gccacgaccc cgacggctgc tgcgtggagg ctgcggccga gtccggaggc 12 0 
tgcgtcatgg ctgcggccac egg 143 



<210> 21 

<211> 178 

<212> DNA 

<213> Homo sapiens 



<400> 21 

gcacccgttt ccgcgcgtgt tcagcgcctg 
caaggggggc ggcgcttgcc tctccaatgc 
getctgeggg aaeggctteg tggaagcggg 



cagccgccgc cagctgcgcg ccttcttccg 60 
cccggacccc ggactcccgg tgccgccggc 120 
cgaggagtgt gactgcggcc ctggccag 17 8 



<210> 22 

<211> 90 

<212> DNA 

<213> Homo sapiens 

<400> 22 

gagtgccgcg acctctgctg ctttgctcac aactgctcgc tgcgcccggg ggcccagtgc 60 
gcccacgggg actgetgegt gcgctgcctg 90 



<210> 23 

<211> 196 

<212> DNA 

<213> Homo sapiens 

<400> 23 

ctgaagcegg ctggagcgct gtgccgccag 
tgeaegggea cctcctccca ctgtccccca 
gecaggggea gtggctactg ctgggatggc 
cagctctggg ggcctg 



gccatgggtg actgtgacct ccctgagttt 60 
gaegtttace tactggaegg ctcaccctgt 12 0 
gcatgtccca cgctggagca gcagtgccag 180 

196 



<210> 24 

<211> 107 

<212> DNA 

<213> Homo sapiens 

<400> 24 

gctcccaccc agctcccgag gcctgtttcc aggtggtgaa etctgeggga gatgetcatg 6 0 
gaaactgegg ccaggacagc gagggecact tcctgccctg tgcaggg 107 



<210> 25 

<211> 199 

<212> DNA 

<213> Homo sapiens 
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<400> 25 

ggatgccctg tgtgggaagc tgcagtgcca 
catggtgcca gtggactcta ccgttcacct 
cttggcactc cccagtgccc agctggacct 
ccagtgtgga cctagaatg 



gggtggaaag cccagcctgc tcgcaccgca 60 
agatggccag gaagtgactt gtcggggagc 12 0 
gcttggcctg ggcctggtag agccaggcac 18 0 

199 



<210> 26 

<211> 109 

<212> DNA 

<213> Homo sapiens 

<400> 26 

gtttgcaata gcaaccataa ctgccactgt gctccaggct gggctccacc cttctgtgac 60 
aagccaggct ttggtggcag catggacagt ggccctgtgc aggctgaaa 109 



<210> 27 

<211> 148 

<212> DNA 

<213> Homo sapiens 



<400> 27 

accatgacac cttcctgctg gccatgctcc 
ccggcctggc ctggtgttgc taccgactcc 
gctgcagaag ggaccctgcg tgcagtgg 



tcagcgtcct gctgcctctg ctcccagggg 60 
caggagccca tctgcagcga tgcagctggg 12 0 

148 



<210> 28 

<211> 92 

<212> DNA 

<:213> Homo sapiens 

<400> 28 

ccccaaagat ggcccacaca gggaccaccc cctgggcggc gttcacccca tggagttggg 60 
ccccacagcc actggacagc cctggcccct gg 92 



<210> 29 
<211> 72 
<212> DNA 
<213> Homo 



sapiens 



<400> 29 

accctgagaa ctctcatgag cccagcagcc accctgagaa gcctctgcca gcagtctcgc 6 0 
ctgaccccca ag 72 



<210> 30 

<211> 1031 

<212> DNA 

<213> Homo sapiens 

<400> 30 
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cag'atcaagt ccagatgcca 
gatttaaaga caggtggcca 
cagtgaatca ccggacctcc 
gcttcgaccc acccactcca 
acactgctgg gcacactctc 
agcccctgac ctccctcacc 
gtcctgggag tctggtgtgt 
catgactgct ggccagaagc 
cccctgcagc ctggctggcc 
tcttcccacc ctgtcttctc 
tacacctgag gttctggagc 
ggggacagag ggaaccattt 
tggctatagg cgtggtggct 
ggatcaccag aggccagcag 
acagaaaaat tttaaaatta 
ggaggctgaa gcaggaggat 
caccactgca ctccagcctg 
aagacataaa a 



agatcctgcc tctggtgaga 
ctgacagcca ctccaggaac 
agcacctgca ggcagcttgg 
ggaacccaga gccacattag 
cagctcaata aaccatcagt 
agtggaggct gggtagtgct 
ctcctacatg caatttccac 
tagtggtcct ggggccctat 
tctgcaaaca aacataattt 
ccctaggtgg ttcctgagcc 
tcagaatctg acagcctctc 
aagaaaagat accaaagtag 
catgcctata atcccagcac 
gtccacacca gcctgggcaa 
gctgggcgtg gtggtgtgta 
cacttgagcc tgagttcaac 
ggtgacagag caagaccctg 



ggtagctcct aaaatgaaca 60 
ttgaactgca ggggcagagc 12 0 
aagtttcttc cccgagtgga 180 
aagttcctga gggctggaga 240 
cccagaagca aaggtcacac 3 00 
ggccatccca aaagggctct 3 60 
ggacccagct ctgtggaggg 42 0 
ggttcgactg agtccacact 480 
tggggacctt ccttcctgtt 54 0 
cccaccccca atcccagtgc 600 
ccccattctg tgtgtgtcgg 660 
aagtcaaaag aaagacatgt 72 0 
tttgggaagc cggggtagga 7 80 
cacagcaaga caccgcatct 84 0 
cctgtaggcc tagctgctca 900 
actgcagtga gctatggtgg 960 
tctctaaaat aaattttaaa 1020 

1031 



<210> 31 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 31 

gtgtgccaga gcaggcgctg caggaagaat gccttccagg agcttcagcg ctgcctgact 6 0 
gcctgccaca gccacggg 78 



<210> 32 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: polyhistidine tag 
<400> 32 

His His His His His His 
1 5 



<210> 33 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG epitope tag 
<400> 33 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 
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<210> 34 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> Description of Artificial Sequence: Primer 
<400> 34 

aactcttgaa atgagaagcg tg 22 



<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 35 

aatatcatgc accatgaccc ac 22 



<210> 36 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 36 

tggagtaagt attgtaaact at 22 



<210> 37 

<211> 22 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 37 

ggagcttatc ctggattatc ta 22 



<210> 38 

<211> 22 

<212> DNA 

<213> Artificial 



Sequence 



<220> 
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<2-23> Description of Artificial Sequence: Primer 



<400> 38 

agagccacac atccatgtcc tg 



22 



<210> 39 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 39 

aagccactct gtgaattgcc at 22 



<210> 40 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Primer 
<400> 41 

gtctggcaat ggagcatgaa aa 2 2 

<210> 42 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 40 

gagtagtcgt agtaccagat gg 



22 



<400> 42 

attagagcac atgaaggaaa gg 



22 



<210> 43 
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<2I1> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 43 

acactgcttt gggggacagg ct 22 



<210> 44 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 45 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 45 

aaccaccacg gattcacgct tc 22 

<210> 46 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 47 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 44 

cacgacgcca cagagccagc tc 



22 



<400> 46 

ataaccagat ggctgtgggt ca 



22 
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<4C0> 47 

atccccgcaa tgaaatagtt ta 



22 



<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence': Primer 
<400> 48 

gttgagagcc cacttagata at 22 

<210> 49 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 50 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 50 

gccactagga ggcaatggca at 2 2 



<210> 51 
<211> 22 
<212> D3STA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 49 

gcattggggg aagccaggac at 



22 



<400> 51 

cgacggcatc acggccatct gg 



22 



<210> 52 
<211> 22 
<212> DNA 
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<2l3> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 52 

tccaggctca ttcattttca tg 



22 



<210> 53 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 53 

tgacatcaac ttctcctttc ct 22 



<210> 54 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 55 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 55 

tctgggagag gacggagctg gc 22 

<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 54 

agttgcagag acctagcctg tc 



22 



<400> 56 

tgtaggacta tattgctc 



18 
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<210> 57 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 57 

cgacatttag gtgacact 18 



<210> 58 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: BstXI-linker 
adapter 

<400> 58 

gtcttcacca cgggg 15 



<210> 59 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: BstXI-linker 
adapter 

<400> 59 

gtggtgaaga c 11 

<210> 60 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 60 

Asp Pro Gin Ala Asp Gin Val Gin Met 
1 5 



<210> 61 



88/189 



WO 02/083077 



PCT/US02/12063 



<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 61 

Asp Pro Gin Asp Gin Val Gin Met 
1 5 



<210> 62 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> (1) . . (11) 

<223> "Xaa" represents a variable amino acid 
<220> 

<223> Description of Artificial Sequence: Zn-binding 
consensus sequence 

<400> 62 

His Glu Xaa Xaa His Xaa Xaa Gly Xaa Xaa His 
15 10 



<210> 63 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 63 

ctgcctagag gccgagga 18 



<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 64 

caggagacca cggaagatcg 2 0 
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<210> 65 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 65 

ttgcctgaac cttcctatcc 



<210> 66 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 66 

cccctgtgtt cctcaggtc 



<210> 67 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 67 

gctccacact ctttcttgcc 



<210> 68 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 68 

aggcaggagg aagctgaat 



<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Primer 



<400> 69 

cctaccacac cctccctctt 



20 



<210> 70 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 70 

cctacccctc tgcacccta 19 



<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 72 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 72 

cacaccctgg tgaggagaga 2 0 

<210> 73 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 71 

aacttccttc tgggagctgg 



20 



<400> 73 



ccacgaagga ccaccg 



16 



<210> 74 
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<2I1> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 74 

ctcacgtggg tgcctctg 

<210> 75 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 75 

ctctacggcc gcagtgac 

<210> 76 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 76 

gtccctccat gcccaatg 



<210> 77 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 77 

caggttaagt cggctcgc 



<210> 78 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



92/189 



WO 02/083077 



PCT/US02/12063 



<400> 78 

ctctctctgc cttccccac 



19 



<210> 79 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 79 

tctactgtgg ggaagatggg 2 0 



<210> 80 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 81 

gaccttgggg ttcctaatcc 2 0 



<210> 82 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 80 

cccctctact tcctcccca 



19 



<400> 82 

gtgcacctgc tcaggactc 



19 



<210> 83 
<211> 21 
<212> DMA 
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<2I3> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 83 

cctggactct tatcacgttg c 



21 



<210> 84 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 84 

ttaccctcca ccatttctcc 20 



<210> 85 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 86 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Primer 
<400> 86 

ccccatgggt tgaatttaca 2 0 

<210> 87 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 85 

gtggagaggg aagggagaag 



20 



<400> 87 

gcagctaggc ctacaggtac a 



21 
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<210> 88 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 88 

accacgccta tagccaacat 2 0 



<210> 89 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 89 

aggtgtagca ctgggattgg 20 



<210> 90 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 90 

ccccaggacc actagcttct 20 



<210> 91 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 91 

attgagctgg agagtgtgcc 2 0 



<210> 92 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 



<400> 92 

ttcaagttcc tggagtggct 



20 



<210> 93 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 93 

acaaggaccc tctaaacgca 2 0 

<210> 94 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 95 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Primer 
<400> 95 

gtgttgctac cgactcccag 2 0 

<210> 9S 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 94 

acccttctgt gacaagccag 



20 



<400> 96 

cccaggtgca gagagcag 



18 
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<210> 97 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Primer 
<400> 97 

gctcctcttg tccactctcc t 21 



<210> 98 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 98 

gccacttcct ctgcacaaat 2 0 



<210> 99 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 99 

ttctctgtga cctgggtggt 2 0 



<210> 100 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 100 

atttgggcca gagatggg 18 



<210> 101 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 101 

ggcagaggag caaggtgg 18 

<210> 102 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 102 

atggcttgga atcatcaagg 2 0 



<210> 103 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 103 

tagagagagg aggtgccagc 2 0 



<210> 104 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 104 

aaagatggcc cacacagg 18 

<210> 105 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 105 

agaactctca tgagcccagc 2 0 



<210> 106 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 106 

agctctgagc agaacccatc 2 0 



<210> 107 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 107 

ctcgaggggg tggagctg 18 



<210> 108 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 108 

gagaggagga gagaaccgct 2 0 



<210> 109 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 109 

agtgacttgg tggttctggg 2 0 



<210> 110 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of- Artificial Sequence: Primer 
<400> 110 



99/189 
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tgtcatctgc accctctctg 



<210> 111 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 111 

aagagggagg gtgtggtagg 



<210> 112 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 112 

gtgatcaggc cactagggtg 

<210> 113 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 113 

atacagcatt cccactccca 



<210> 114 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 114 

gaaggcagaa atcccggt 

<210> 115 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 115 

caccagcacc tgcctgtc 



<210> 116 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 116 

gggtcagagg cacccac 

<210> 117 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 117 

gccgtagagc ctcctgtct 



<210> 118 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 118 

gacgaccaaa gaaacgcag 

<210> 119 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 119 

tgagcggaga gggcaagt 



101/189 
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<210> 120 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 120 

aaaccctcac cctgaacctt 



<210> 121 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 121 

aagggtgctc gtgtcctct 



<210> 122 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 122 

ccactcagct ccactcccta 



<210> 123 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 123 

ggattcaaac ggcaaggag 

<210> 124 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Primer 
<400> 124 

gctgagtcct gagcaggtg 19 



<210> 125 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 125 

gaaccgcagg agtaggctc 19 



<210> 126 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 126 

atatggtcag caggagaccc 2 0 



<210> 127 
<211> 21 
<212> DMA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 127 

gcatcctggt ctccatgata a 21 



<210> 128 
<211> 20 
<212> DMA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 128 

gaggctttga atccaggtcc 2 0 



<210> 129 
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<2T1> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 129 

cagcaagaca ccgcatctac 2 0 



<210> 130 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 130 

gggacagagg gaaccattta 2 0 



<210> 131 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Primer 
<400> 131 

ttccttcctg tttcttccca 20 



<210> 132 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 132 

gtcctgggag tctggtgtgt 2 0 



<210> 133 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 133 

aggaacccag agccacacta 



<210> 134 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 134 

tgcctctggt gagaggtagc 



<210> 135 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 135 

ttcctggatc actggtcctc 



<210> 136 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 136 

ttcgagcagt gagagaaacc t 



<210> 137 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 137 

ctgggagtcg gtagcaaca 



<210> 138 
<211> 18 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 138 

aggccactgg aacctcct 18 



<210> 139 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 139 

gcagcatggt acagggactg 2 0 



<210> 140 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 140 

cagctgacca gtggtatgga 2 0 



<210> 141 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 141 

tgtcagacat ggccacagag 2 0 



<210> 142 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 142 

agggtcctct tagctgccac 2 0 
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<210> 143 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 143 

aggccttgtc atttcctgtg 2 0 



<210> 144 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 144 

caaagaacct tggatgtccg 2 0 



<210> 145 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 145 

ctcagctccc ttcctgctc 19 



<210> 146 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer 
<400> 146 

ctgtgtgggc catctttg 18 



<210> 147 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : Primer 
<400> 147 

ggagaaatgg tggagggtaa 2 0 

<210> 148 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 148 

aaagccacag cttctccct 19 



<210> 149 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 149 

aggtttctgg gctcaggtta 2 0 



<210> 150 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 150 

gtaggtgtgc cagagcagg 19 

<210> 151 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 151 

tgtggaccta gaatggtgag c 21 
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<21*0> 152 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<22 0> 1 

<223> Description of Artificial Sequence: Primer 
<400> 152 

caaagtcaca caacaagcgg 



<210> 153 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 153 

caggatcttg gcatctggac 



<210> 154 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 154 

ctggcttgtc acagaagggt 

<210> 155 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 155 

ctggagcaca gtggcagtta 



<210> 156 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 156 

tttggtcgtc cctcagtttc 20 

<210> 157 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 157 

cctctcagga gtagaggccc 2 0 



<210> 158 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 158 

agcggttctc tcctcctctc 20 



<210> 159 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 159 

cctctcagga gtagaggccc 2 0 

<210> 160 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 160 

atgttactga ggccgaaagg 2 0 



<210> 161 
<211> 20 
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<2i2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 161 

ccctttccag ccttctcttt 20 



<210> 162 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 162 

caggactgca aacatcctga 2 0 



<210> 163 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 163 

tccctggtgc ttcccata 18 



<210> 164 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 164 

aggcaggagg aagctgaat 19 



<210> 165 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 165 
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cctcttgccc ctcttgct 18 



<210> 166 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 
<400> 166 

cctgaatgtc cagagtcctg a 21 



<210> 167 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 167 

ggcctcgagt cccagtattt 20 



<210> 168 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 168 

agagcctcct gtctctccct 20 



<210> 169 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 169 

tcgccctcag cttctcag 18 



<210> 170 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 170 

tcacgtgggt gcctctga 18 



<210> 171 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 171 

gggttacttc ccctctctgg 20 



<210> 172 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 172 

ctgggctttc caccctgg 18 



<210> 173 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 173 

ctgggctttc caccctgg 18 



<210> 174 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 174 

tccaggtggt gaactctgc 19 
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<210> 175 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 175 

tagaatggtg agctctgccc 2 0 



<210> 176 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 176 

gaccttgggg ttcctaatcc 20 



<210> 177 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 177 

ccaagcacac ttgagcgtc 19 



<210> 178 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 178 

agccatgccc tctgcttt 18 



<210> 179 
<211> 18 
<212> DNA 
<213> Artificial 

<220> 



Sequence 
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<400> 179 

cagcccaagc acacttga 



PCT/US02/12063 

Sequence : Primer 

18 



<210> 180 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer 
<400> 180 

cccatagctg tgagctcctc 2 0 



<210> 181 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 181 

aaagcttcag gacccacaaa 20 



<210> 182 
<211> 19 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 182 

atcttggtcc ctgccattc 19 



<210> 183 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 183 

gagggagctc tttcccca 18 



<210> 184 
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<2ll> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 184 

ggaccaccag gaaggctg 18 



<210> 185 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 185 

aaccccagct cccagaag 18 



<210> 186 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 186 

ctgctcacct ggaaaggaac 2 0 



<210> 187 
<211> 19 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 187 

actgcaggaa ggcccagag ■ 19 



<210> 188 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



116/189 



WO 02/083077 PCT/US02/12063 
<400> 188 

accgaaactt gaaccacacc 20 



<210> 189 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Primer 
<400> 189 

tgagggacga ccaaagaaac 2 0 



<210> 190 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 190 

caaagtcaca caacaagcgg 2 0 



<210> 191 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : Primer 
<400> 191 

gaacctgagg gcaccaatta 2 0 



<210> 192 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 192 

ttggccttag ttaattggtg c 21 



<210> 193 
<211> 21 
<212> DNA 
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<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 193 

ttggccttag ttaattggtg c 21 



<210> 194 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 194 

ctggagcaca gtggcagtta 2 0 



<210> 195 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 195 

aggagtaggc tcaggaagca 2 0 



<210> 196 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 196 

tgtactggga ggtagagggc 2 0 



<210> 197 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Primer 
<400> 197 

agagggtgac ttggagcaga 2 0 
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<2I0> 198 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 198 

aggcaataac ccactcagga 2 0 



<210> 199 
<211> 21 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 199 

cccatgggtt gaatttacat a 21 



<210> 200 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 200 

gcctctggtg atcctcctac 2 0 



<210> 201 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 201 

actcagtcga accatagggc 2 0 



<210> 202 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 202 

tgtgtgacct ttgcttctgg 2 0 



<210> 203 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 203 

gcatgaagca atgggagaat 2 0 



<210> 204 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 204 

actcagtcga accatagggc 2 0 



<210> 205 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 205 

gcaggaaggt gtcatggtct 2 0 



<210> 206 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 206 

gcaggaaggt gtcatggtct 2 0 
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<21'0> 207 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 207 

gggcattgga gaggcaag 



<210> 208 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 208 

tctgcctccc agattcaagt 



<210> 209 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 209 

agaatgcctt ccaggagctt 



<210> 210 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 210 

gtgttgctac cgactcccag 

<210> 211 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 211 

ctgcttcctg agcctactcc 20 



<210> 212 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 212 

aacaggaggt tccagtggc 19 



<210> 213 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 213 

agcgagttgt gattgagggt 2 0 



<210> 214 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 214 

tgtgcaggct gaaagtatgc 2 0 

<210> 215 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 215 

gccacttcct ctgcacaaat 2 0 



<210> 216 
<211> 20 
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<2l2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 216 

ctgagcccag aaacctgatt 2 0 



<210> 217 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 217 

gtgagtgagg caccaggg 18 



<210> 218 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 218 

cctagatggc caggaagtga 2 0 



<210> 219 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 219 

ccagaaacct gattaggggg 2 0 



<210> 220 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 220 



123/189 



WO 02/083077 PCTYUS02/12063 

ta'cctctcac cagaggcagg 2 0 

<210> 221 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 221 

gccagaagct agtggtcctg 20 



<210> 222 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 222 

gcaggcagct tggaagttt 19 



<210> 223 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 223 

ttatcatgga gaccaggatg c 21 

<210> 224 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 224 

gacctggatt caaagcctcc 2 0 



<210> 225 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 225 

atgttggcta taggcgtggt 2 0 

<210> 226 
<211> 21 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 226 

ttatcatgga gaccaggatg c 21 



<210> 227 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 227 

ctgagtggag ggagcagaag 2 0 



<210> 228 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 228 

ctgagtggag ggagcagaag 2 0 

<210> 229 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 229 

ccatgagatc ggccacag 18 
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<210> 230 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 230 

atttcaaggc tgcaatgagg 2 0 



<210> 231 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 231 

acttctttcc atggcctctg 20 



<210> 232 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 232 

accacccagg tcacagagaa 2 0 



<210> 233 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 233 

tcccaagacc aggctatgtc 20 



<210> 234 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<2^3> Description of Artificial Sequence: Primer 
<400> 234 

ctggggatga gaagcagc 



<210> 235 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 235 

cttctccctt ccctctccac 



<210> 236 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 236 

atttgtgcag aggaagtggc 



<210> 237 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 237 

catttcctcc aggctctgac 



<210> 238 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 238 

tcagagcctg gaggaaatgt 
<210> 239 
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<2I1> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 239 

gttcctggag tgggtgggt 19 



<210> 240 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 240 

ctgggagtcg gtagcaaca 19 



<210> 241 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence Sequence 
containing SNP 

<400> 241 

gccctctgag accgacgggg agggacggct cgggccggtc a 41 



<210> 242 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 242 

caagaacctt cccagcggtt ctctcctcct ctcaggagta g 41 



<210> 243 
<211> 41 
<212> DNA 
<213> Artificial 

<220> 



Sequence 
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<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 243 

caccatctca gctccacact ctttcttgcc caggtctcga a 41 



<210> 244 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 244 

ccaccatctc agctccacac tctttcttgc ccaggtctcg a 41 

<210> 245 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 245 

acaactaagc catcaccaag gctccttcct ctagccccaa g 41 

<210> 246 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 246 

tggtgcttcc catattcaca tctcccacaa ctaagccatc a 41 

<210> 247 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 
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<400> 247 

caggatacat agaaacccac tacggcccag atgggcagcc a 

<210> 248 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 248 

ccctccaaat cagaagagac aggaattcac aggcctcgag t 

<210> 249 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 249 

agctgctcac ctggaaagga acctgtggcc acagggatcc t 

<210> 250 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 250 

acttccttct gggagctggg gttgggggtc agggctcaag c 

<210> 251 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 251 

ttcctgcagt ggcgccgggg gctgtgggcg cagcggcccc a 
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<210> 252 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 252 

ggttcagggt gagggtttcg gggagcttgg gagccggcct g 41 

<210> 253 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sequence 
containing SNP 

<400> 253 

cagagaagcg cgggggttgg gggactgtcc ctccatgccc a 41 

<210> 254 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 254 

cccctctctg ggctctgcgc gtctggcggc tgtagccaag c 41 

<210> 255 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 255 

cagccgccgc cagctgcgcg ccttcttccg caaggggggc g 41 



<210> 256 
<211> 41 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SMP 

<40Q> 256 

agtggcctcc cagtcaagcg agggggtgga tccctgcccc a 41 

<210> 257 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 257 

tgctggccat gctcctcagc gtcctgctgc ctctgctccc a 41 



<210> 258 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 258 

ctgctgcctc tgctcccagg ggccggcctg gcctggtgtt g 41 



<210> 259 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 259 

gaagtagctt tgaacaggag gttccagtgg cctcccagtc a 41 

<210> 260 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 260 

gcctctgtct caccagtttt cggccctttg ccacttcctc t 41 



<210> 261 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 261 

acaaatcacc tctgtcaccc ccttgaagtt cccaaatgct g 41 

<210> 262 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 262 

tccataccac tggtcagctg cggtgctggc tgcccctgtg c 41 



<210> 263 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 263 

ggtgctggct gcccctgtgc cagggccctg ccttaaccca g 41 

<210> 264 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 
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<400> 264 

ggaaatgaca aggccttggg ggatgggatg gggacagtca a 41 

<210> 265 
<211> 41 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 265 

agggctcatg cctcctgcct ccttccagat gggcagcacc c 41 



<210> 266 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 266 

gcccctcccc agccccaggg tctcctgctg accatattca c 41 



<210> 267 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 267 

cctgggcggc gttcacccca tggagttggg ccccacagcc a 41 

<210> 268 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 268 

gccccacagc cactggacag ccctggcccc tgggtgagtg a 41 
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<210> 269 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 269 

gccctggccc ctgggtgagt gaggcaccag ggggaggtgg a 41 

<210> 270 
<211> 41 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 270 

tgcagcctgg ggccccagtc cttaggggac aacatatcct c 41 



<210> 271 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 271 

cactgagtga ggatgggctc tctgccacac agcttgcagc c 41 

<210> 272 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 272 

ctggtcctca ctgagtgagg atgggctctc tgccacacag c 41 
<210> 273 
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<21U> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 273 

atgacctctt ggttatcatg gagaccagga tgctggaagc c 41 



<210> 274 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 274 

agcaagacac cgcatctaca gaaaaatttt aaaattagct g 41 



<210> 275 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 275 

ggaggatcac cagaggccag caggtccaca ccagcctggg c 41 

<210> 276 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 276 

atcccagcac tttgggaagc cggggtagga ggatcaccag a 41 

<210> 277 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 277 

agcctggctg gcctctgcaa acaaacataa ttttggggac c 41 

<210> 278 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 278 

actgagtcca cactcccctg cagcctggct ggcctctgca a 41 

<210> 279 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 279 

tccaggaacc cagagccaca ttagaagttc ctgagggctg g 41 



<210> 280 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 280 

ttcttccccg agtggagctt cgacccaccc actccaggaa c 41 



<210> 281 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
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containing SNP 
<400> 281 

tcctcattct cagcagatca agtccagatg ccaagatcct g 41 



<210> 282 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 282 

ctgaggacca cacggggtgg tggttggcgg ggtggtggtt g 41 



<210> 283 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 283 

ggctggcagg ccgagcctag atggcagcca gagccccagg c 41 



<210> 284 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 284 

ctttgctctg tcactcctgc ctcccttggg cgttcacatt c 41 



<210> 285 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 285 

gtgagctctg cccacccgac ccctccttgc cgtttgaatc c 41 



<210> 286 
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<21*1> 41 

<212> DNA 

<213> Homo sapiens 

<400> 286 

tggcgaggtt actcctacac cgggaggagc accgtcgggt c 41 



<210> 287 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 287 

ggctgctcac tattggggcc gcatcgtccc ctgtcccgct t 41 



<210> 288 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 288 

gccgcatcgt cccctgtccc gcttgttgtg tgactttgcg c 41 



<210> 289 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 289 

gccgtcccac cccgtcg 17 



<210> 290 
<211> 17 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 290 

cctcctctct tggcgac 17 



<210> 291 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 291 

tccacactct ttcttgcc 



<210> 292 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 292 

gctccacact ctttcttgcc 



<210> 293 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 293 

tcaccaaggc tccttcct 



<210> 294 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 294 

cagaagagac aggaattcac a 

<210> 295 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 295 

tggaaaggaa cctgtggcc 
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<210> 296 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 296 

gggtttcggg gagcttg 17 



<210> 297 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 297 

gggttggggg actgtc 16 



<210> 298 
<211> 16 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 298 

ctctgcgcgt ctggcg 16 



<210> 299 
<211> 17 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 299 

gccgtccctc cccgtcg 17 



<210> 300 
<211> 20 
<212> DNA 
<213> Artificial 

<220> 



Sequence 
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<223> Description of Artificial Sequence: Primer 
<400> 300 

tcctcctcta ttggcgaccc 2 0 



<210> 301 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 301 

ctccacactt tttcttgccc a 21 



<210> 302 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 302 

gctccacact ctttcttgc 19 



<210> 303 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 303 

tcaccaagcc tccttcct 18 



<210> 304 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 304 

agaagagacg ggaattcac 19 



<210> 305 
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<2ll> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 305 

tggaaaggag cctgtgg 



<210> 306 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer 
<400> 306 

agggtttcgt ggagcttgg 

<210> 307 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 307 

ggggttggag gactgtcc 



<210> 308 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 308 

gctctgcgca tctggcgg 



<210> 309 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 309 

agtcaagcga gggggtgg 



18 



<210> 310 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 310 

cctcagcgtc ctgctg 16 



<210> 311 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<210> 312 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 312 

accagttttc ggcccttt 18 



<210> 313 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> Description of Artificial Sequence: Primer 



<400> 311 

aacaggaggt tccagtgg 



18 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 313 

ctgtcacccc cttgaagt 



18 



<210> 314 
<211> 16 
<212> DNA 
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<2i3> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer 
<400> 314 

tcagctgcgg tgctgg 16 



<210> 315 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 315 

gccttggggg atgga 15 



<210> 316 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 316 

tcctgcctcc ttccag 16 



<210> 317 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 317 

actggacagc cctggc 16 



<210> 318 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 318 

ctgtgtggca gagagccca 19 
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<210> 319 
<211> 22 
<212> DMA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 319 

aattatgttt gtttgcagag gc 2 2 



<210> 320 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 320 

gaacttctag tgtggctct 19 



<210> 321 
<211> 17 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 321 

ccaagggagg caggagt 17 



<210> 322 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 322 

agtcaagcgt gggggtgg 18 



<210> 323 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Primer 
<400> 323 

ctcctcagca tcctgctgc 19 



<210> 324 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 324 

gaacaggagt ttccagtggc 2 0 



<210> 325 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 325 

caccagtttt tggccctttg 20 



<210> 326 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 326 

ctgtcaccca cttgaagttc 2 0 



<210> 327 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 327 

ggtcagctgt ggtgctgg 18 
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<210> 328 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 328 

aggccttggg agatgggat 19 



<210> 329 
<211> 16 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 329 

tcctgccttc ttccag 16 



<210> 330 
<211> 16 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 330 

actggacagt cctggc 16 



<210> 331 
<211> 16 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 331 

tgtggcaggg agccca 16 



<210> 332 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 332 

attatgtttg cttgcagagg 2 0 



<210> 333 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 333 

ggaacttcta atgtggctct g 21 



<210> 334 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 334 

cccaagggaa gcaggagtga 2 0 



<210> 335 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<400> 335 

Cys Cys Phe Ala His Asn Cys Ser Leu Arg Pro Gly Ala Gin Cys Ala 
15 10 15 

His Gly Asp Cys Cys Val Arg Cys Leu Leu Lys Pro Ala Gly Ala Leu 
20 25 30 

Cys Arg Gin Ala Met Gly Asp Cys Asp Leu Pro Glu Phe Cys Thr Gly 
35 40 45 

Thr Ser Ser His Cys Pro Pro 
50 55 



<210> 336 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 336 

Thr Met Ala His Glu lie Gly His Ser Leu Gly 
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10 



<210> 337 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 337 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg 
85 



<210> 338 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
1 5 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 



<210> 339 
<211> 178 
<212> PRT 

<213> Homo sapiens 
<400> 339 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 
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Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 6 0 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr lie Glu Thr His 
85 90 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Val 
100 105 110 

Arg Cys Phe His Gly Leu Trp Asp Ala Pro Pro Glu Asp His Cys His 
115 120 125 

Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val Val Leu Cys 
130 135 140 

Thr Cys Ser Gly Met Ser Gly Leu He Thr Leu Ser Arg Asn Ala Ser 
145 150 155 160 

Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp Phe Ser Thr 
165 170 175 

His Glu 



<210> 340 
<211> 113 
<212> PRT 
<213> Homo sapiens 

<400> 340 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
1 5 * " 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His He Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Gly Leu He Thr Leu Ser Arg Asn Ala Ser Tyr 
85 90 95 
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Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp Phe 
100 105 



Ser Thr His 
110 



Glu 



<210> 341 
<211> 165 
<212> PRT 

<213> Homo sapiens 
<400> 341 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr lie Glu Thr His 
85 90 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Asp 
100 105 110 

His Cys His Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val 
115 120 125 

Val Leu Cys Thr Cys Ser Gly Met Ser Gly Leu lie Thr Leu Ser Arg 
130 135 140 

Asn Ala Ser Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp 
145 150 155 160 

Phe Ser Thr His Glu 
165 



<210> 342 
<211> 168 
<212> PRT 

<213> Homo sapiens 
<400> 342 

Leu Ala Pro Gly Tyr lie Glu Thr His Tyr Gly Pro Asp Gly Gin Pro 
1 5 10 15 
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Val Val Leu Ala Pro Asn His Thr Asp His Cys His Tyr Gin Gly Arg 
20 25 30 

Val Arg Gly Phe Pro Asp Ser Trp Val Val Leu Cys Thr Cys Ser Gly 
3 5 40 45 

Met Ser Gly Leu lie Thr Leu Ser Arg Asn Ala Ser Tyr Tyr Leu Arg 
50 55 60 

Pro Trp Pro Pro Arg Gly Ser Lys Asp Phe Ser Thr His Glu lie Phe 
65 70 75 80 

Arg Met Glu Gin Leu Leu Thr Trp Lys Gly Thr Cys Gly His Arg Asp 
85 90 95 

Pro Gly Asn Lys Ala Gly Met Thr Ser Leu Pro Gly Gly Pro Gin Ser 
100 105 110 

Arg Gly Arg Arg Lys Ala Arg Arg Thr Arg Lys Tyr Leu Glu Leu Tyr 
115 120 125 

lie Val Ala Asp His Thr Leu Phe Leu Thr Arg His Arg Asn Leu Asn 
130 135 140 

His Thr Lys Gin Arg Leu Leu Glu Val Ala Asn Tyr Val Asp Gin Leu 
145 150 155 160 

Leu Arg Thr Leu Asp He Gin Val 
165 



<210> 343 
<211> 167 
<212> PRT 

<213> Homo sapiens 
<400> 343 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
1 5 10 15 

Gin Gin Leu Trp Gly Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe 
20 25 30 

Gin Val Val Asn Ser Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp 
35 40 45 

Ser Glu Gly His Phe Leu Pro Cys Ala Gly Arg Asp Ala Leu Cys Gly 
50 55 60 

Lys Leu Gin Cys Gin Gly Gly Lys Pro Ser Leu Leu Ala. Pro His Met 
65 70 75 80 

Val Pro Val Asp Ser Thr Val His Leu Asp Gly Gin Glu Val Thr Cys 
85 90 95 
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Arg Gly Ala Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu 
100 105 110 

Gly Leu Val Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Asn 
115 120 125 

Ser Asn His Asn Cys His Cys Ala Pro Gly Trp Ala Pro Pro Phe Cys 
13 0 13 5 14 0 

Asp Lys Pro Gly Phe Gly Gly Ser Met Asp Ser Gly Pro Val Gin Ala 
145 150 155 160 

Glu Asn His Asp Thr Phe Leu 
165 



<210> 344 
<211> 193 
<212> PRT 

<213> Homo sapiens 
<400> 344 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
15 10 15 

Gin Gin Leu Trp Gly Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe 
20 25 30 

Gin Val Val Asn Ser Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp 
35 40 ~ 45 



Ser Glu Gly His Phe Leu Pro Cys 
50 55 

Lys Leu Gin Cys Gin Gly Gly Lys 
65 70 

Val Pro Val Asp Ser Thr Val His 
85 

Arg Gly Ala Leu Ala Leu Pro Ser 
100 

Gly Leu Val Glu Pro Gly Thr Gin 
115 120 



Ala Gly Arg Asp Ala Leu Cys Gly 
60 

Pro Ser Leu Leu Ala Pro His Met 
75 80 

Leu Asp Gly Gin Glu Val Thr Cys 
90 95 

Ala Gin Leu Asp Leu Leu Gly Leu 
105 110 

Cys Gly Pro Arg Met Val Cys Gin 
125 



Ser Arg Arg Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu 
13 0 135 140 

Thr Ala Cys His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His 
145 150 155 160 

Cys Ala Pro Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly 
165 170 175 
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Gly Ser Met Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe 
180 185 190 

Leu 



<210> 345 
<211> 126 
<212> PRT 

<213> Homo sapiens 
<400> 345 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
15 10 15 

Gin Gin Leu Trp Gly Pro Asp Gly Gin Glu Val Thr Cys Arg Gly Ala 
2 0 25 30 

Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu Val 
35 40 45 

Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg Arg 
50 55 60 

Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala Cys 
65 70 75 80 

His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala Pro 
85 90 " 95 

Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser Met 
100 105 110 

Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu 
115 120 125 



<210> 346 
<211> 93 
<212> PRT 

<213> Homo sapiens 
<400> 346 

Ala Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser 
15 10 15 

Trp Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro 
20 25 30 

His Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro 
35 40 45 

Thr Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu 
50 55 ~ 60 
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Pro Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro 
65 70 75 80 

Gin Ala Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
85 90 



<210> 347 

<211> 236 

<212> PRT 

<213> Homo sapiens 

<400> 347 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
15 10 15 

Gin Gin Leu Trp Gly Pro Asp Gly Gin Glu Val Thr Cys Arg Gly Ala 
20 25 30 

Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu Val 
35 40 ~ 45 

Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg Arg 
50 55 60 

Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala Cys 
65 70 75 80 

His Ser His Gly Val Cys Asn Ser Asn -His Asn Cys His Cys Ala Pro 
85 90 95 

Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser Met 
100 105 110 

Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu Leu Ala 
115 120 125 

Met Leu Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala Gly Leu Ala 
130 135 140 

Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser Trp 
145 150 155 160 

Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro His 
165 170 175 

Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro Thr 
180 185 190 

Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu Pro 
195 200 205 

Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro Gin 
210 215 220 
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Ala Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
225 230 235 



<210> 348 
<211> 302 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
15 10 15 

Gin Gin Leu Trp Gly Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe 
20 25 30 

Gin Val Val Asn Ser Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp 
35 40 J 45 

Ser Glu Gly His Phe Leu Pro Cys Ala Gly Arg Asp Ala Leu Cys Gly 
50 55 60 

Lys Leu Gin Cys Gin Gly Gly Lys Pro Ser Leu Leu Ala Pro His Met 
65 70 75 80 

Val Pro Val Asp Ser Thr Val His Leu Asp Gly Gin Glu Val Thr Cys 
85 90 95 

Arg Gly Ala Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu 
100 105 110 

Gly Leu Val Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin 
115 120 125 

Ser Arg Arg Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu 
130 135 140 

Thr Ala Cys His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His 
145 150 155 160 

Cys Ala Pro Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly 
165 170 175 

Gly Ser Met Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe 
180 185 190 

Leu Leu Ala Met Leu Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala 
195 200 205 

Gly Leu Ala Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg 
210 215 220 

Cys Ser Trp Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp 
225 230 235 240 
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Gly Pro His Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu 
245 250 255 

Gly Pro Thr Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser 
260 265 270 

His Glu Pro Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro 
275 280 285 

Asp Pro Gin Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
290 295 300 



<210> 349 
<211> 235 
<212> PRT 

<213> Homo sapiens 
<400> 349 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys' 
15 10 15 

Gin Gin Leu Trp Gly Pro Asp Gly Gin Glu Val Thr Cys Arg Gly Ala 
20 25 30 

Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu Val 
35 40 45 

Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg Arg 
50 55 60 

Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala Cys 
65 70 75 ' 80 

His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala Pro 
85 90 95 

Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser Met 
100 105 110 

Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu Leu Ala 
115 120 125 

Met Leu Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala Gly Leu Ala 
130 135 140 

Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser Trp 
145 150 155 160 

Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro His 
165 170 " 175 

Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro Thr 
180 185 190 



158/189 



WO 02/083077 



PCT7US02/12063 



Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu Pro 
195 200 205 

Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro Gin 
210 215 220 

Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
225 230 235 



<210> 350 
<211> 339 
<212> DNA 
<213> Homo sapiens 

<400> 350 

cgggcacggg tcggccgcaa tocagcctgg 
aggccgagga gctcacagct atgggctgga 
tgctgctgct actactgctg ctgctctggc 
atatccctgg gcagccagtc accccgcact 
tcagcctgga ggagccggtc tcgaagccag 
gccaggagct cctgcttgag ctggagaaga 



gcggagccgg agttgcgagc cgctgcctag 60 
ggccccggag agctcggggg accccgttgc 12 0 
cagtgccagg cgccggggtg cttcaaggac 18 0 
gggtcctgga tggacaaccc tggcgcaccg 240 
acatggggct ggtggccctg gaggctgaag 3 00 
accacaggc 33 9 



<210> 351 
<211> 225 
<212> DNA 
<213> Homo sapiens 

<400> 351 

cgggcacggg tcggccgcaa tccagcctgg 
aggccgagga gctcacagct atgggctgga 
tgctgctgct actactgctg ctgctctggc 
atatccctgg gcagccagtc accccgcact 



gcggagccgg agttgcgagc cgctgcctag 60 

ggccccggag agctcggggg accccgttgc 12 0 

cagtgccagg cgccggggtg cttcaaggac 180 

gggtcctgga tggac 225 



<210> 352 
<211> 562 
<212> DNA 

<213> Homo sapiens 
<400> 352 

gcctagaggc cgaggagctc acagctatgg 
cgttgctgct gctgctacta ctgctgctgc 
aaggacatat ccctgggcag ccagtcaccc 
gcaccgtcag cctggaggag ccggtctcga 
ctgaaggcca ggagctcctg cttgagctgg 
acatagaaac ccactacggc ccagatgggc 
tgagatgctt ccatgggctc tgggatgcac 
gagtaagggg cttccccgac tcctgggtag 
tgatcaccct cagcaggaat gccagctatt 
aggacttctc aacccacgag at 



gctggaggcc ccggagagct cgggggaccc 60 
tctggccagt gccaggcgcc ggggtgcttc 120 
cgcactgggt cctggatgga caaccctggc 180 
agccagacat ggggctggtg gccctggagg 240 
agaagaacca caggctgctg gccccaggat 3 00 
agccagtggt gctggccccc aaccacacgg 3 60 
cgccagagga tcattgccac taccaagggc 420 
tcctctgcac ctgctctggg atgagtggcc 480 
atctgcgtcc ctggccaccc cggggctcca 540 

562 
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<210> 353 
<211> 362 
<212> DNA 
<213> Homo sapiens 

<400> 353 

gaggccgagg agctcacagc tatgggctgg 
ctgctgctgc tactactgct gctgctctgg 
catatccctg ggcagccagt caccccgcac 
gtcagcctgg aggagccggt ctcgaagcca 
ggccaggagc tcctgcttga gctggagaag 
gccagctatt atctgcgtcc ctggccaccc 
at 



aggccccgga gagctcgggg gaccccgttg 6 0 
ccagtgccag gcgccggggt gcttcaagga 12 0 
tgggtcctgg atggacaacc ctggcgcacc 18 0 
gacatggggc tggtggccct ggaggctgaa 2 40 
aaccatggcc tgatcaccct cagcaggaat 3 00 
cggggctcca aggacttctc aacccacgag 3 60 

362 



<210> 354 
<211> 518 
<212> DNA 

<213> Homo sapiens 
<400> 354 

gaggccgagg agctcacagc tatgggctgg 
ctgctgctgc tactactgct gctgctctgg 
catatccctg ggcagccagt caccccgcac 
gtcagcctgg aggagccggt ctcgaagcca 
ggccaggagc tcctgcttga gctggagaag 
gaaacccact acggcccaga tgggcagcca 
tgccactacc aagggcgagt aaggggcttc 
tctgggatga gtggcctgat caccctcagc 
ccaccccggg gctccaagga cttctcaacc 



aggccccgga gagctcgggg gaccccgttg 6 0 
ccagtgccag gcgccggggt gcttcaagga 12 0 
tgggtcctgg atggacaacc ctggcgcacc 180 
gacatggggc tggtggccct ggaggctgaa 240 
aaccacaggc tgctggcccc aggatacata 3 00 
gtggtgctgg cccccaacca cacggatcat 3 60 
cccgactcct gggtagtcct ctgcacctgc 420 
aggaatgcca gctattatct gcgtccctgg 48 0 
cacgagat 518 



<210> 355 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<400> 355 

ctggccccag gatacataga aacccactac 
cccaaccaca cggatcattg ccactaccaa 
gtagtcctct gcacctgctc tgggatgagt 
tattatctgc gtccctggcc accccggggc 
cggatggagc agctgctcac ctggaaagga 
gogggcatga ccagccttcc tggtggtccc 
acccggaagt acctggaact gtacattgtg 
cgaaacttga accacaccaa acagcgtctc 
ctcaggactc tggacattca ggtggc 



ggcccagatg ggcagccagt ggtgctggcc 60 
gggcgagtaa ggggcttccc cgactcctgg 120 
ggcctgatca ccctcagcag gaatgccagc 18 0 
tccaaggact tctcaaccca cgagatcttt 240 
acctgtggcc acagggatcc tgggaacaaa 3 00 
cagagcaggg gcaggcgaaa agcgcgcagg 360 
gcagaccaca ccctgttctt gactcggcac 420 
ctggaagtcg ccaactacgt ggaccagctt 4 80 

506 



<210> 356 
<211> 503 
<212> DNA 
<213> Homo sapiens 

<400> 356 
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cagtggctac tgctgggatg gcgcatgtcc 
ggggcctggc tcccacccag ctcccgaggc 
tgctcatgga aactgcggcc aggacagcga 
tgccctgtgt gggaagctgc agtgccaggg 
ggtgccagtg gactctaccg ttcacctaga 
ggcactcccc agtgcccagc tggacctgct 
gtgtggacct agaatggttt gcaatagcaa 
tccacccttc tgtgacaagc caggctttgg 
tgaaaaccat gacaccttcc tgc 



cacgctggag cagcagtgcc agcagctctg 60 
ctgtttccag gtggtgaact ctgcgggaga 12 0 
gggccacttc ctgccctgtg cagggaggga 180 
tggaaagccc agcctgctcg caccgcacat 240 
tggccaggaa gtgacttgtc ggggagcctt 3 00 
tggcctgggc ctggtagagc caggcaccca 3 60 
ccataactgc cactgtgctc caggctgggc 42 0 
tggcagcatg gacagtggcc ctgtgcaggc 4 80 

503 



<210> 357 
<211> 581 
<212> DMA. 
<213> Homo sapiens 

<400> 357 

cagtggctac tgctgggatg gcgcatgtcc 
ggggcctggc tcccacccag ctcccgaggc 
tgctcatgga aactgcggcc aggacagcga 
tgccctgtgt gggaagctgc agtgccaggg 
ggtgccagtg gactctaccg ttcacctaga 
ggcactcccc agtgcccagc tggacctgct 
gtgtggacct agaatggtgt gccagagcag 
tcagcgctgc ctgactgcct gccacagcca 
ctgtgctcca ggctgggctc cacccttctg 
cagtggccct gtgcaggctg aaaaccatga 



cacgctggag cagcagtgcc agcagctctg 6 0 
ctgtttccag gtggtgaact ctgcgggaga 12 0 
gggccacttc ctgccctgtg cagggaggga 180 
tggaaagccc agcctgctcg caccgcacat 240 
tggccaggaa gtgacttgtc ggggagcctt 300 
tggcctgggc ctggtagagc caggcaccca 3 60 
gcgctgcagg aagaatgcct tccaggagct 42 0 
cggggtttgc aatagcaacc ataactgcca 4 80 
tgacaagcca ggctttggtg gcagcatgga 540 
caccttcctg c 581 



<210> 358 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<400> 358 

cagtggctac tgctgggatg gcgcatgtcc cacgctggag cagcagtgcc agcagctctg 60 
ggggcctgat ggccaggaag tgacttgtcg gggagccttg gcactcccca gtgcccagct 12 0 
ggacctgctt ggcctgggcc tggtagagcc aggcacccag tgtggaccta gaatggtgtg 180 
ccagagcagg cgctgcagga agaatgcctt ccaggagctt cagcgctgcc tgactgcctg 2 40 
ccacagccac ggggtttgca atagcaacca taactgccac tgtgctccag gctgggctcc 3 00 
acccttctgt gacaagccag gctttggtgg cagcatggac agtggccctg tgcaggctga 3 60 
aaaccatgac accttcctgc 380 



<210> 359 
<211> 324 
<212> DNA 

<213> Homo sapiens 
<400> 359 

ggcctggtgt tgctaccgac tcccaggagc ccatctgcag cgatgcagct ggggctgcag 60 
aagggaccct gcgtgcagtg gccccaaaga tggcccacac agggaccacc ccctgggcgg 120 
cgttcacccc atggagttgg gccccacagc cactggacag ccctggcccc tggaccctga 180 
gaactctcat gagcccagca gccaccctga gaagcctctg ccagcagtct cgcctgaccc 24 0 
ccaagcagat caagtccaga tgccaagatc ctgcctctgg tgagaggtag ctcctaaaat 3 00 
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gaacagattt aaagacaggt ggcc 324 

<210> 360 
<211> 753 
<212> DMA 
<213> Homo sapiens 

<400> 360 

cagtggctac tgctgggatg gcgcatgtcc cacgctggag cagcagtgcc agcagctctg 60 

ggggcctgat ggccaggaag tgacttgtcg gggagccttg gcactcccca gtgcccagct 120 

ggacctgctt ggcctgggcc tggtagagcc aggcacccag tgtggaccta gaatggtgtg 18 0 

ccagagcagg cgctgcagga agaatgcctt ccaggagctt cagcgctgcc tgactgcctg 24 0 

ccacagccac ggggtttgca atagcaacca taactgccac tgtgctccag gctgggctcc 3 00 

acccttctgt gacaagccag gctttggtgg cagcatggac agtggccctg tgcaggctga 3 60 

aaaccatgac accttcctgc tggccatgct cctcagcgtc ctgctgcctc tgctcccagg 42 0 

ggccggcctg gcctggtgtt gctaccgact cccaggagcc catctgcagc gatgcagctg 4 80 

gggctgcaga agggaccctg cgtgcagtgg ccccaaagat ggcccacaca gggaccaccc 54 0 

cctgggcggc gttcacccca tggagttggg ccccacagcc actggacagc cctggcccct 60 0 

ggaccctgag aactctcatg agcccagcag ccaccctgag aagcctctgc cagcagtctc 660 

gcctgacccc caagcagatc aagtccagat gccaagatcc tgcctctggt gagaggtagc 720 

tcctaaaatg aacagattta aagacaggtg gcc 753 

<210> 361 
<211> 1154 
<212> DNA 

<213> Homo sapiens 
<400> 361 

cagtggctac tgctgggatg gcgcatgtcc cacgctggag cagcagtgcc agcagctctg 60 

ggggcctggc tcccacccag ctcccgaggc ctgtttccag gtggtgaact ctgcgggaga 120 

tgctcatgga aactgcggcc aggacagcga gggccacttc ctgccctgtg cagggaggga 18 0 

tgccctgtgt gggaagctgc agtgccaggg tggaaagccc agcctgctcg caccgcacat 240 

ggtgccagtg gactctaccg ttcacctaga tggccaggaa gtgacttgtc ggggagcctt 300 

ggcactcccc agtgcccagc tggacctgct tggcctgggc ctggtagagc caggcaccca 3 60 

gtgtggacct agaatggtgt gccagagcag gcgctgcagg aagaatgcct tccaggagct 42 0 

tcagcgctgc ctgactgcct gccacagcca cggggtttgc aatagcaacc ataactgcca 48 0 

ctgtgctcca ggctgggctc cacccttctg tgacaagcca ggctttggtg gcagcatgga 54 0 

cagtggccct gtgcaggctg aaaaccatga caccttcctg ctggccatgc tcctcagcgt 60 0 

cctgctgcct ctgctcccag gggccggcct ggcctggtgt tgctaccgac tcccaggagc 660 

ccatctgcag cgatgcagct ggggctgcag aagggaccct gcgtgcagtg gccccaaaga 72 0 

tggcccacac agggaccacc ccctgggcgg cgttcacccc atggagttgg gccccacagc 780 

cactggacag ccctggcccc tggaccctga gaactctcat gagcccagca gccaccctga 840 

gaagcctctg ccagcagtct cgcctgaccc ccaagatcaa gtccagatgc caagatcctg 900 

cctctggtga gaggtagctc ctaaaatgaa cagatttaaa gacaggtggc cactgacagc 960 

cactccagga acttgaactg caggggcaga gccagtgaat caccggacct ccagcacctg 102 0 

caggcagctt ggaagtttct tccccgagtg gagcttcgac ccacccactc caggaaccca 108 0 

gagccacatt agaagttcct gagggctgga ■ gaacactgct gggcacactc tccagctcaa 1140 

taaaccatca gtcc 1154 

<210> 362 
<211> 953 
<212> DNA 
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<2l3> Homo sapiens 
<400> 362 

cagtggctac tgctgggatg gcgcatgtcc cacgctggag cagcagtgcc agcagctctg 60 
ggggcctgat ggccaggaag tgacttgtcg gggagccttg gcactcccca gtgcccagct 12 0 
ggacctgctt ggcctgggcc tggtagagcc aggcacccag tgtggaccta gaatggtgtg 180 
ccagagcagg cgctgcagga agaatgcctt ccaggagctt cagcgctgcc tgactgcctg 240 
ccacagccac ggggtttgca atagcaacca taactgccac tgtgctccag gctgggctcc 3 00 
acccttctgt gacaagccag gctttggtgg cagcatggac agtggccctg tgcaggctga 3 60 
aaaccatgac accttcctgc tggccatgct cctcagcgtc ctgctgcctc tgctcccagg 42 0 
ggccggcctg gcctggtgtt gctaccgact cccaggagcc catctgcagc gatgcagctg 480 
gggctgcaga agggaccctg cgtgcagtgg ccccaaagat ggcccacaca gggaccaccc 54 0 
cctgggcggc gttcacccca tggagttggg ccccacagcc actggacagc cctggcccct 600 
ggaccctgag aactctcatg agcccagcag ccaccctgag aagcctctgc cagcagtctc 660 
gcctgacccc caagatcaag tccagatgcc aagatcctgc ctctggtgag aggtagctcc 72 0 
taaaatgaac agatttaaag acaggtggcc actgacagcc actccaggaa cttgaactgc 78 0 
aggggcagag ccagtgaatc accggacctc cagcacctgc aggcagcttg gaagtttctt 840 
ccccgagtgg agcttcgacc cacccactcc aggaacccag agccacatta gaagttcctg 90 0 
agggctggag aacactgctg ggcacactct ccagctcaat aaaccatcag tec 953 



<210> 363 
<211> 812 
<212> PRT 

<213> Homo sapiens 
<400> 363 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr lie Glu Thr His 
85 90 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Asp 
100 105 110 

His Cys His Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val 
115 120 125 

Val Leu Cys Thr Cys Ser Gly Met Ser Gly Leu lie Thr Leu Ser Arg 
130 135 140 

Asn Ala Ser Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp 
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145 150 155 160 

Phe Ser Thr His Glu lie Phe Arg Met Glu Gin Leu Leu Thr Trp Lys 
165 170 175 

Gly Thr Cys Gly His Arg Asp Pro Gly Asn Lys Ala Gly Met Thr Ser 
180 185 190 

Leu Pro Gly Gly Pro Gin Ser Arg Gly Arg Arg Glu Ala Arg Arg Thr 
195 200 205 

Arg Lys Tyr Leu Glu Leu Tyr lie Val Ala Asp His Thr Leu Phe Leu 
210 215 220 

Thr Arg His Arg Asn Leu Asn His Thr Lys Gin Arg Leu Leu Glu Val 
225 230 235 240 

Ala Asn Tyr Val Asp Gin Leu Leu Arg Thr Leu Asp lie Gin Val Ala 
245 250 255 

Leu Thr Gly Leu Glu Val Trp Thr Glu Arg Asp Arg Ser Arg Val Thr 
260 265 270 

Gin Asp Ala Asn Ala Thr Leu Trp Ala Phe Leu Gin Trp Arg Arg Gly 
275 280 285 

Leu Trp Ala Gin Arg Pro His Asp Ser Ala Gin Leu Leu Thr Gly Arg 
290 295 300 

Ala Phe Gin Gly Ala Thr Val Gly Leu Ala Pro Val Glu Gly Met Cys 
305 310 315 320 

Arg Ala Glu Ser Ser Gly Gly Val Ser Thr Asp His Ser Glu Leu Pro 
325 330 335 

lie Gly Ala Ala Ala Thr Met Ala His Glu lie Gly His Ser Leu Gly 
340 345 350 

Leu Ser His Asp Pro Asp Gly Cys Cys Val Glu Ala Ala Ala Glu Ser 
355 360 365 

Gly Gly Cys Val Met Ala Ala Ala Thr Gly His Pro Phe Pro Arg Val 
370 375 380 

Phe Ser Ala Cys Ser Arg Arg Gin Leu Arg Ala Phe Phe Arg Lys Gly 
385 390 395 400 

Gly Gly Ala Cys Leu Ser Asn Ala Pro Asp Pro Gly Leu Pro Val Pro 
405 410 415 

Pro Ala Leu Cys Gly Asn Gly Phe Val Glu Ala Gly Glu Glu Cys Asp 
420 425 430 

Cys Gly Pro Gly Gin Glu Cys Arg Asp Leu Cys Cys Phe Ala His Asn 
435 440 445 



164/189 



WO 02/083077 



PCT/US02/12063 



Cys Ser Leu Arg Pro Gly Ala Gin Cys Ala His Gly Asp Cys Cys Val 
450 455 460 

Arg Cys Leu Leu Lys Pro Ala Gly Ala Leu Cys Arg Gin Ala Met Gly 
465 470 475 480 

Asp Cys Asp Leu Pro Glu Phe Cys Thr Gly Thr Ser Ser His Cys Pro 
485 490 495 

Pro Asp Val Tyr Leu Leu Asp Gly Ser Pro Cys Ala Arg Gly Ser Gly 
500 505 510 

Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys Gin Gin 
515 520 525 

Leu Trp Gly Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe Gin Val 
530 535 540 

Val Asn Ser Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp Ser Glu 
545 550 555 560 

Gly His Phe Leu Pro Cys Ala Gly Arg Asp Ala Leu Cys Gly Lys Leu 
565 570 575 

Gin Cys Gin Gly Gly Lys Pro Ser Leu Leu Ala Pro His Met Val Pro 
580 585 ' 590 

Val Asp Ser Thr Val His Leu Asp Gly Gin Glu Val Thr Cys Arg Gly 
595 600 605 

Ala Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu 
610 615 620 

Val Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg 
625 630 635 640 

Arg Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala 
645 650 655 

Cys His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala 
660 665 670 

Pro Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser 
675 680 685 

Met Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu Leu 
690 695 700 

Ala Met Leu Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala Gly Leu 
705 710 715 720 

Ala Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser 
725 730 735 
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Trp Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro 
740 745 750 

His Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro 
755 760 765 

Thr Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu 
770 775 780 

Pro Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro 
785 790 795 800 

Gin Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
805 810 



<210> 364 
<211> 21960 
<212> DNA 
<213> Mus sp. 

<400> 364 

ggagaatagt ggggttgcag taggaggtag aaggttggga atgaattgga aaaggagtga 6 0 
ctgtgtagcc aaggctgtgc cttgctcatt ctaggccaat ttcttccttc ttgtctctcc 12 0 
ctccctccct ccctccctcc ctccctccct ccctccctcc ctccctccct ccctacctcc 180 
ctcctagcta tccttccttc cttcccttta cttctatctt ccttctttct ttccttccct 240 
tcccttctct tccccattcc ccctagacca ggtcccatgt agcccaaact gacttcaatc 3 00 
tcattacaga tgaccttgaa cttcagatcc tcagtcccca tttctttttg tgcctgaata 360 
tcgtattttc tctgggagcc tgtccagtat tttcctccaa taaatttcat tccaggatac 420 
aactggagtc aattttcttc ttgcaaccaa aatgtctagg ctagaatgtg gagaattcaa 480 
atattaaacg ctagtgtctc agacaccatc tgtgggcatg tggccaggga gcataaatcg 540 
cacagtgctt gcttcatact gtctgctgaa gtcatccggg ccccttcctg ctcctcatag 600 
ctcctcacat gataactcac ggggtgggct gctcgctcac agggactggc acaggcttgg 6 60 
cacatggcag gtggcacatg gttcttgtac atggttgaat agaagcacag aaacccagct 720 
ctaccagagg caagacaggg agagaacaaa ccaaagatgg caagatgttc caggatgacc 7 80 
gagagcaaaa gccacaccta aagggtggtg gctgctgagg gtttggggac atgtgacttg 84 0 
aatctttgct ctcaagtact tccaagaggg atgttggaga ctacttgggg tggggctcat 90 0 
tgggcccaat cttcagtagt ctaaagcccc tcagaaataa ttacttaget gtaattcaag 960 
cacaatatat acacatggta agtgtattag caaaatggct ttctatgtaa tataaataag 102 0 
ataaaaaaac aaacaaacaa aaaacaaaac cagaaatacc agcacttcag agtaccccac 108 0 
tcccaacccc tgcccagtgt agctgctaac ttcactaatc aagttaattt aaaaatcaaa 1140 
tataatacca cagattttat ttttcctgtg tttctgcttt atacaaatgg tatcatattt 120 0 
taaacatgag aacattgtga gatgactagc cagtgtctgt ctgtctgtct ctctcagata 1260 
tttatttatt tatttcatgt atgtaggtac actgtcactg tcttcagaca caccagaaga 13 2 0 
ggacattgga tcccattaca gatggttgtg agccaccatg tggttgctga gaattgaact 13 8 0 
caggacctct ggaagagtag tcagtgctct taaccactgg gccatctttc cagtcctgac 144 0 
cagtgtctct taatggtgaa atattcttcc cttggcccta gcattttatc cttgggcatt 150 0 
ttctcaggaa aaatgaaggc gcgtgtcacc ctgagatttg ttactaggat gttcataaca 1560 
gtgctaccca tcagccagga actgaaaaca ataaatgctc ggcagcaggg gaatgaccga 162 0 
gtagacagag ctacgcccac acagctatga gagctgtgcc tacagctgag tgtggttctc 168 0 
agacggtcag tgggaggggc tggcagcgat ggctgcctgt ggcagttaca tgtcctaccg 174 0 
ctctattaaa ggagctaaca aaggcaacct taaggaagga aacatttatt ttgggtcacg 18 0 0 
gttagtaggc acagtctcag agtggttgga aagatggcag caagtgtctc tagctgaggg 18 60 
ggcaggagtg ggaggccctg gtcactttgt gtcggcgagg aagcagacag agaggtgggg 192 0 
gtgccgtttg cttcttactt tcctcttcct acccagtcca ggatgacaga acatcaggta 198 0 
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tt^ccaccca catttggagt gggtcttctg 
cactgacctg atgtgcccgg aacaaggtga 
caaggacagg tatagttaag gaatgggata 
agggatgggc gggagcacaa tggagattcg 
tgaaaatctc acatttctga atctatattt 
ccacttttcc tacgccttgt gctgtgtatc 
ctgagatcag cctgagcaac acaataaaac 
tgtggaggtg cacgccttta atcccagcac 
catctcaaat tcagacccag gggctgtcta 
catagtcata caattggaaa ctacttatcg 
ataggttggg taaaactcac aaattgggaa 
tcgcctttaa acatagaggt cctttaaact 
aaatgagcta cacatgaata tgttactata 
cagcagctta tgtaagaaat tggtagtagt 
atggagacta aaggaagact ctcctaagca 
gtagtggtgg tggttgtttt tgattctttt 
cagtgtttct ttgtggagcc ctggttgtcc 
gagattctcc tgcctctggg cttaaaggtg 
tttgagacgg ggtcttgctg tgtagctcca 
tgaactacag atatctccca gtctcctctc 
attcacctgt gtgtgcatgt gtgactcact 
attttttgag atagggtctc ttactgaacc 
agtccaacac ttggcttaca agtttgtgca 
cttgaactca agaaccaacg caggtccttt 
ctcaccccac tgccacacac caggtctcca 
tttgaacttg tcttttgcct ctctgctggg 
tgggatgctg gggatggaac tcagggcccc 
tctcgctctc attctctctc tctctctctc 
ccctccttcc cttccctctc tcttccatga 
tcttgagtgt cctgtctcga tccaattgcc 
ttttggtttt ttcgagactg ggtttctctg 
gtagaccagg ctggcctcga actcagaaat 
ttaaaggcgt gcaccaccac gcctttaatc 
ttttttcttc tgccgttgga acttttacat 
tcttatttca aagttaaatc tcttgctatc 
tttcaacacg aaagcaatgt ccttaaaatt 
gtggtataga acaattctgt cctggccatc 
attgccttta ccaggaacca cagacttcag 
ccagatgcat ctcagctgca gtgggacagc 
gacagatcct gggtagaaag gagaatccag 
tgtgctaggc agacagagca gctgtgggga 
gagaggttct ctaagaacct tcccggactt 
tggataggat tgttgcagat aacgccaggg 
cctgttttgc gggccagggt agggccacac 
cgctgagggc gggaggaggc ggccccctct 
gtgagaggga aaggcactcc cagccatggg 
ggtgctgcta ttgctgccgc tgttgctgcc 
tccaggtgag gatacaggcg gcgggggggg 
tggtggtggt catgggttcc agtgcaacta 
ctggccagga caggatgtgg gaagggtctg 
cttgcccaca cttacttagc aaagcaaata 
agctctgtcc ggaagcttct gcacttccaa 
ggaaggacct ggggtccttg gagtcctggt 
tcgtcctctt ctcatgctgg gtctgggcag 
tggcagttct tcaaacaggg gtgtctgcct 



gtctctgtta aaactctctg gaaacaccct 2040 
ttctcagccc aagcaagtca cagcgaagac 210 0 
gaagtcaata cacagccact ggtgggcaaa 2160 
agagccaggg aagtgagcag gttcactttg 2 220 
tggaaagaaa acttaattta aaaaaatctg 2 28 0 
gacatgggag gataccttga ggccaggaga 23 40 
ctcatttctt acaaaacaga acaaattggg 2 400 
ccagggagca gagccaatca cccttgtaac 2 460 
gcggcagaag agtccatgga acactgtgac 252 0 
acaagaataa atggccatag gtaataggca 2 580 
aatacaaccc atactccaga caagggccaa 2 64 0 
gttaaaagaa ggaccaacca ctcactagaa 2 70 0 
tttaaaaaaa aaaaaaacct gtgtctaaaa 2 7 60 
ggctttctgt ggggaaggag tacccaaagg 2 82 0 
ttttatacct tctcccttgt ttgctttgtt 2880 
ttgtttggtt ggttttggtt tttttcaaga 2 940 
tggaactcac tcaggctgcc cttgaactca 3 00 0 
tacctcacca ctgtcaggct tgggtttgtt 3 06 0 
gctgcctctg aagtgctaga attacagaca 312 0 
attctccctc cctgccccct ttgtatacac 318 0 
ctgagttctt ccagtgttac tctccatcag 324 0 
tggacctcaa acatttgcct gtctctgtcc 33 00 
atggttctgg ctttctctga ttgctggaga 33 60 
acttactaag ccttctcctt gatgacaccc 3420 
ggctgatctc acactttttc catataggac 3480 
atacaggtgt gtgctgctac acccaattta 3 540 
aatcttgtat caactgaggt gcgctcgctc 3 600 
tctctctctc tctctctctc tctctctccc 3660 
gacttctttc tatattgcac aggctagcta 3720 
tgacctatca tactggtttt ctgtttttgg 3 780 
tgtagccctg gctgtcctgg aactcacttt 3 840 
ccgcctgcct ctgcctcccg agtgctggga 3 900 
ccagcatttg atgttcgaat cgtataaatg 3 960 
tccatattct cagaacgaaa ttattaatca 4 020 
cgaaatcata ccaagtgtac gaagtgatct 4 080 
tgggaaaggg tccctcatgc ctgggggtct 4140 
gctgttcctc tgccctgtgc ccaccactgc 42 0 0 
caacgcggcc accacaagac ccctaggtac 4260 
ttcactgctg tgggacacta ggtgcaggta 43 2 0 
atgtcactag ggagactgcg accagagatg 43 8 0 
tcacttagct agtgcctcta ttcagcacca 4440 
tccagcccat cagccatccc ttggccagcg 4500 
agggtcatcc tggctcccgc agcgcgacat 4560 
tcctgcgctc ctgtgaggtg gcgcgaggcc 4 62 0 
agccggggtg gagccaacag cacagcaaga 4 680 
ctcgaggtgc gggagacccg gggggtctcc 4 74 0 
ctcgtgtccg ctgcggagcg ctcggatgtt 4 8 00 
999999999° ggctctaggg cagggtgaag 4 860 
cagaacatag agttcccgac cctcggctac 4 920 
tgctcctggt aagggtggcc ctggagcaga 498 0 
ggcaacacta ccctttcctc agtgctgccc 5040 
actgccaggc cagagtgtct ccctgctcct 5100 
ctgcccacag gcctagctgt ctcattcggc 5160 
tgtgtatggt gtttatacag ttgccttatg 5220 
agccgtgcct ggggctacca ctttctcttc 52 80 
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ctactgtaac ctctccttac tccgggacga 
gtgaatctcc aaactccaca tagcaagcac 
gggaggaggc agtaagggct gagggctgga 
tgttgactgc tgcttccgta aggcttctga 
gcctgcggat agttaatggt gaggagagag 
tcctcagttt ccactttagt ccaccggact 
gctgcacaga gaacactcac acctgaatcc 
gcaagacttg gttctccata ctcagttaag 
tcaggaaggc ttctctgtcc ttccttctcc 
tccaaagcca cacttgcaca tccccacttg 
gactttaagg acgaggttcc tggagctgct 
cgttttgcct gcacatgcat ctgtgtaccg 
agagagaggc ggatcccctg gagctggggt 
ctgagaactg aacccggtcc tcaggaagaa 
ctccagctcc cattccagaa tcttgatgtc 
ctgggatgac ccaggaaggg ctgagccaac 
ggagtggctg gggtttctat gtttttttcc 
agatgcagcg gcagagggat aagtgggctt 
caggataggt cctgtgagtg tttctgggac 
ctgaagaaca ttccaacacc gtctgagtaa 
cttggaaacg cctggcccag gatcactgag 
gccaggtccg tcagagtccc aacaccagca 
acagacattt ggtggctgtc ttactcagat 
ccataaaggc atctactacc cagtctaaaa 
cctctgcgct ggggtctggg cttcttcccg 
ctctggccgc ttcaaattcc ccaatgggga 
cacctccctg gttgcttctc catttagggg 
ctacacatgg agtccagctt tcaggacccc 
gcaattttaa cactattctc tgctttcacc 
ccccactgga tcctggaggg cagactctgg 
atccgttatc ctgtccttat taggggccct 
atgcagcgtg acagtttttc tggggttctc 
ccctgttggg tgggtgggac aaagagacat 
tcattttggg cttcctccct cccaccccct 
gggggaacat tcaggccatg cccactcaac 
taaagccttt caggagagct ctgtgtgtgt 
gtgtgtgtgt attctgttct gttttgtttt 
ggggggcttc aagtcatgag ccatcatgcc 
caaaatgtcc tctttggccg tgtttatggg 
cctgtaacct cagcttccat ggctggaagc 
tcctcctgtc cccagatgta ttctcactct 
cctcctaact tctgctcatt gtgttccccc 
ctgcccatgg ggaggccggc actgatccgc 
agagcttgta aagttgctca tgcttgaata 
tgggacctaa gggcttcaga tgattggctg 
tggagagagt tcaattctgt acctcctgac 
tgttgcaaca gtgaggatgc attcccagaa 
ggggtccccc agacttctag gacttctctc 
cgggttcccc tgatggtggc ctctctcctt 
gggactctgc tctgcgttag gaagcctgtg 
gtgtgtattt cacaaacagg gttgtactgc 
agaccctgga gcagaagtaa aggcagcaag 
gctactgcct ttctcccaga gcagacattt 
taagctcatg tttgggccct tgtggagcct 
acacctgctg cagaagggcc ctgaaagatt 



cagggctctt agcatctcca agcctgtagg 534 0 
taggctggcc agaaggcaga aggaccaccg 54 0 0 
tattgggcag tctggaaacc accccatatc 546 0 
gggggctgac tgacagctgt attgccttta 5520 
agcctctgtg tttcctaatg cagactggct 5580 
cctagcttgc ctccttcctt gtcttgcagg 5640 
ttatcctgca cccttggtaa tccgaggcca 5700 
ccttctcttg gcggagagat ctttaacctc 5760 
tcctctccct tctctctctt cagcctccat 5820 
cacatggctc actaattacg aagcacatgt 5880 
gctatttcag attttgaaaa tgtatatgag 5940 
cggaggtacc aggaaaacct gggggccgga 6 0 00 
tacagatagt tatgaggcac catgtgggtg 6 0 60 
cagcgagtgc tctcaatccc cgatccacct 6120 
tgggatctag ggaggcggtg gaacaaggag 618 0 
agagctctcc actcaggcaa ggccagagct 624 0 
tttttttttc tgtaacctgt gccaactgga 6300 
gctgctatct ctgcaggagc ttggctcctg 63 6 0 
cagataggag atggggaggt aggaacattc 6420 
tgcttggctc cgattctctt cctggagccc 6480 
cctctgaacc ctagctcctt caggttctgg 6540 
tcaaccctaa tgggagtcag gagctgtggg 66 00 
ggcaaaggag gcctccaaac acctccctca 6660 
atacccaagg ctgcctcctt cttcccagcc 6720 
ggacctctga cctcaaacca gctctctaag 6780 
fcggttcctgg tgcctcggtt tctcttttgc 6840 
atgccccatg- aataccaaat cctggaagtg 6 900 
tccttccccg atggcttcct gctcctaact 6960 
ccaggaaatg cccatggaga gctagtcact 7 02 0 
ctcaaggtca ccctggagga gccggtaagt 7 080 
aggtctcccg ctctctgtcc tttcccccaa 7140 
agaaccaaat cactgtaccg ttaagccact 7200 
ctagatggcg tgaggctgca cacatggaag 7260 
tctattcctg gccctaaccc tcttctccat 7320 
taagcaaaga tgagttcata gtttggggcc 73 8 0 
gtgtgtgtgt gtgtgtctgt gtgtgtgtgt 7440 
gattgagaca gggtcttgct atgtagctct 7500 
agccacagga gatttttaaa agaaggacac 7560 
gaacatgtga ctccgtggac atgctgctgt 7 620 
cacctgggac aaggaagctg tgggccattc 7 68 0 
ggtttgcttc tccagagccc tgctcagcct 7740 
agaagagcca tgtgtgcctt cactgcttcg 78 0 0 
ctggcttaca gagaggcaag cgagccccac 7 8 60 
gttcattggg cctagaaacc tgaggctccg 7920 
cctccccttc tttcctataa ctactcaggc 7980 
agagggcagc cccactgtcc aaacctagag 8 040 
acctaaacct tccaagtcag ggccttcact 8100 
aaccaagcaa agatggcttg tggacatgca 8160 
gcctggctcg gtctgtgaga gaatcccagg 822 0 
aggccaggcc aggaagagct ttggcagggt 828 0 
aggatgggga tggtgcacaa agggggaggg 8340 
ggccgcagag ggaagggacc ttccactggg 840 0 
tcccataaag caagaggcac tccaaccata 8460 
gtggctgggg aagtgagggc catcttctct 852 0 
cttggccagg gtcccagccc agtgcatttt 858 0 
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gggataaaaa gggaaagcca tcgtgggtgg 
ctcccctgga atctcttggg ctagttccag 
gcctgagaga gaccaagtca gtgagagagg 
tctgacatct agctaagaga gtcaacactt 
acttcatgag ttcagttatg aggccagctc 
cagaactgaa ggtggaggaa aaagtttggg 
atggagagat gtgcaaccag gagctgtgga 
atgtgcaagg ccctgggttt gatctatagc 
agcatacaaa caaaacagaa gcaaggttgc 
cccacccaat gacccacatc accctcagtg 
atggggctcc tgactatggc catagcatat 
gcaacgtagc aactgctatg tccgtatatg 
acaaacacaa gcactgagaa catctctaag 
aatgcgagag ccagctgttc ctatgagctt 
tcctgggcag ccaggtggcc ttggggccta 
cttgaagcct gactcggtgc tggtggcttt 
actggagaag aagcagtgag taccagcggg 
tctggagagg atcagtgctt tc.tgggaggg 
gagtgacagg gagagtggcg gatgtacttg 
aagtacatac ctggtcatct cataccctgg 
aagagtgagc accagaacag acattgactg 
aagggaactt ccacagatgg cagaagaaaa 
ttcatctggt ctacctggaa tgcctccatc 
acctggaatg cctccatctg cgtggaaggg 
ttcccagtgg ggccgatcga agagggcatc 
gagtcaaaga acgggaggtg gtaaaggaga 
gcaagatgga ggcgtggaac tgcagactcc 
ctttcttacg gacattgcag aaggcactca 
gactgggaac aaactgcaaa caccacgcca 
tcctactaaa cggtcttgta catgtgtgca 
ccagcaaagc agcagagttg taatgtgtgt 
ttttgatgga ggacaatgtg aatttggagt 
gtcaggcatg gaggtagagg cctgtaagtc 
aagtcaggga ctgcctgagt tataaagaat 
aaatccagtc tcaaaatata aaggagtgag 
agaaggagga aaagaaaaga aagaaggaaa 
gaaggaagat aaagaaagag agagagacag 
gaaagacaga aagaaagaaa gaaaggcaga 
tcctggtggg gacagggaag aagcctgaaa 
aggtaaaggg acccttcccc gggtggctga 
gaagtacgca tcactgaggg acagatggag 
ga-gggcctgg ggtccgctca gcaaaggcag 
ggtagtatct cctggcacac acacacacac 
acctgcctgc agcaggctcc acagcaccaa 
ccacacttcc catccccttg actaacccac 
ttttgcctga ggttttctct ttgaggcaca 
cctccgctag cactcaggga ggagagctgg 
agtgactcct aaccctcccc ctttccagca 
actacaggcc agatgggcat ccggtagtgc 
atgggatctg agaacatggg gagggtgcag 
tcctaaggag gttgtagtct gccttcaggc 
ctcctactca gatgtgggta tgagcctctt 
cgtgtgaggg gcttccggga atcctgggtg 
ggcgttgggg gagagggcct ggggctgcgg 
ttgctatcca ctgcactcta gtggccttat 



ggaaaacatt taaaaaaaaa atacagcagc 8 64 0 
ttctggcttc tagccaggct aagtggacta 870 0 
agaggtggct agagggccaa ggccagccct 8760 
tttagggagc caaagttggg tttcatgttt 8 82 0 
agaggaattt cagaggatcg ggcagtttgt 8880 
ttctctcagg aatgagggaa aggcagcaag 894 0 
tgcagctggg tgatttggtg cttgcttggc 90 0 0 
acccacagca aactagcaag caagcaaaca 9 060 
aggtagtaca agagatggcc aaagctgtag 912 0 
cccactgcct cctgcttttc ctgccttttc 9180 
gtccatagca gacataacaa catatgccag 924 0 
gtggttatgc atctttaagg gtccgttgtc 93 0 0 
ttacaactcc cacacatatc cactccctgc 9360 
tctcactagg cagctctaca tcctaccggt 942 0 
gtgtgtcact gcgttccttc tcggtcagat 948 0 
agaggctgaa ggccaggatc tcctgcttga 954 0 
ggggttgctg aagtccagac aaagaccctc 960 0 
ggttgggggc tgggtgggaa gcagcagtgg 966 0 
gggctacagt ggactgaacc caactgtagg 972 0 
agcagctgga tgggcggctc cccgctggag 978 0 
ataccttaat tcaaagggaa ttcttaggca 9 84 0 
agctgcgaag gtcaaatcca atggtgacag 9900 
tgcgtggaag agaacagaga catctggtct 9960 
aacagagagg agagggctgg ggtggggggc 10 02 0 
ttagtgctaa agatcttgga tggacctaaa 10 08 0 
aacacagtgt cgaaggagtg cagagggtaa 1014 0 
accaccaacc acaccactcg ctcatgtctg 10200 
ttcattttat tctctccatg gaatcagaca 10260 
aaccacagtt cctgacagtg ccacatccct 103 2 0 
cacattgaac gttagcatgt atgagtgcag 103 8 0 
aaggatggct ggccgtacaa cagcctgggt 104 40 
caggattttc tgtaaggaag agcaattgaa 10500 
cggctcctct gggcaggaga acacagagtc 10 5 60 
gagttcaaga ccagcctagg caacttagca 10 62 0 
gaagagggaa agagaaaaag agggagaagg 10 6 80 
gaaagaaaga gaaggaagga agaaaggaag 10 740 
acagacagaa agaaagacag aagacagaaa 10 8 00 
aaagcagcta tttatttatg gttgcaatgt 108 60 
atggaaacat tactgataga gtctagctcc 10 920 
acagaagcag ctacaggcag gtggaagtcg 10980 
acatgagatc tagcagtctg aggagaactg 11040 
gagtaccctg gccctacttg gcctctctga 11100 
acacacacac acacacacac acacacacac 11160 
aggtccccag cctgtccttc tgggtgcttc 11220 
ctcaggactc gctccagacc tttgtcttct 1128 0 
gtgtctcctt tctctctgac acactgaatg 11340 
aagcaagcgg caggatgcag gtctaaatcc 1140 0 
agcttctggc cccaggatac acagaaaccc 11460 
tgtcccccaa ccacacggtg agacactttc 1152 0 
cctcaatgcc taccctgtcc tgttttgcat 1158 0 
cccacttcct ccttgtccca ccaacttgag 1164 0 
cctctctagg atcattgcca atatcacggg 1170 0 
gttctcagca cctgctctgg gatgaggtaa 117 60 
cagaggtgag cgcttcctgt tcacacatcc 1182 0 
tgtgctcagc agcaaagtca gctattatct 11880 



169/189 



WO 02/083077 



PCT/US02/12063 



gcaacctcgg actcctgggg ataccaaaga 
gcagttgttc acctggagag gggtccagag 
ggccagtctt cctcatgtcc cccagagccg 
tccttcaaag cagaggacac aggaatgcgg 
agaggctgaa gaatcaagtt tgagcagaag 
atcacggtgt gtggccaagg attcggtcaa 
aggtgaggcg agaggcgcgc aggagtccca 
acaccctggt gaggagagac ctaagaggtc 
atgaccaggg acctctctgc cttctcagtt 
gagacagcgc ctcctggagg ttgccaattg 
aaggttggga gtgaggcagt tgaaaagctc 
gtctctccct cagattctca ggactctgga 
gtggaccgag caggatctca gtcgcatcac 
cctacagtgg cgccgcgggg tgtgggccag 
gtgggtgcca tttacacaca cacacacaca 
caccccggtt gcgggatgtc cttatttcca 
cctfcaggggc cgcaccttcc agggtaccac 
ccgcgcggag agctccggag gtgtgagcac 
gaggtttggg tagggtttgt ggtcctcagt 
tccccatcgg cacagcagcc accatggccc 
atgatcccga gggctgctgc gtgcaggccg 
cagccacagg gtacgcaaag aaggtcgttg 
ctttggttac ccctcagttc cctctttagt 
tgggaaagat gctgataagg tggagtctca 
tgatccttgg tcctccttgg ggcccggact 
ccagttccta ggctcctctc tctctcgatt 
cttgagggct gtgtggctaa cctcagggcc 
gtggctccag cacccgagat ggtccccaca 
gcaccctttc ccgcgcgtct tcagcgcctg 
caaagggggc ggtccttgcc tctccaacac 
ccgctgcgga aacggcttct tggaagcagg 
cagatcgtca tcttcgtccc tggggttcaa 
tctagcttgc tccaccatca cgtgttctgt 
acccctgcfcg ctttgcccac aattgctccc 
attgctgtgc gaggtgcctg gtgaggatac 
tgggacttct agtccctttt ctgagtgtga 
ctccctcggg ctttcagcaa agatctctct 
aaagtccgcg ggcacgcctt gtcgtcctgc 
caccggcacc tccccgtatt gccccgcaga 
tgagggtcgc ggctattgcc tagacggctg 
gctatggggg cctggtgaga ccgcacgctg 
gcggttttct actgagggca agctccaccc 
tactccccct ccccccaggg tccaagccgg 
ccatggggaa ttcgcaaggg aactgtggcc 
ctcagaggtg aggtgtgatg ctgagggtct 
gggaacactc caagttgttg accaccttcc 
aactgctgtg ccagggaggg gagccgaacc 
ccacaattct cctagagggc cgcgaagtgg 
gtcaccfcgga ccagcttgac ttgggtctgg 
tggtgagccc tgcccaccca acccctcctg 
cctcactgcc cagtgggcac aatgcccata 
tacctcccag gagctggaac gttgcttgac 
aaggggtcag ggtgaccttg gaggtccttg 
ggtttgcaat agcaatcgta actgtcactg 
caagcctggc ttgggtggta gcgtggatag 



cttcccaacc cacgagatct tccggatgga 11940 
agacaagaac tcccaataca aagcaggaat 12 00 0 
ggtaaggggt acagactaga gtgggactgg 12 0 60 
gtcctggacg gggtctccct gttttaattc 1212 0 
tttaactatc ccctccccta gcccgggggg 12180 
gcagcctgcc cctcaccatt tgctcctccc 12240 
ggtacctgga actgtacata gtggctgacc 12300 
tgctgggtgg ggggactgct cagcaccgtt 123 60 
cttgcttcag catcagaact tgaaccacac 1242 0 
cgtggaccag gttggatgat tagggcgctg 124 80 
tggtgagtgc ctggttcctg gcttcagcag 12540 
tatacagttg gtgttgaccg ggctggaagt 12 6 00 
tcaggacgca aacgaaacgc tctgggcttt 12 66 0 
gagaccacac gactccacac aachgctcac 12 72 0 
cacacacaca cacacacaca cacacacaca 1278 0 
actccaccca gtcacgccct gctccatcafc 12840 
ggtgggcctg gcacctgtgg agggcatatg 12 90 0 
agtaagctct gctggggacc taggaggaga 12960 
gaaacccggc tcctcaggac cactcggaac 13 02 0 
acgagatagg ccacagcctg ggcctccacc 13080 
atgcagagca aggaggctgc gtcatggagg 1314 0 
ttaccggacc tagacaaagc gactgcgtat 13 2 00 
aaagataaca gcctgctaat agggcctcgg 132 60 
gaatgactgc gggactaggg tactccactc 13 32 0 
gcccgatttt agcgtcacag cctcctgtct 133 80 
gctcgcttag cttagaaaga aacagcacat 1344 0 
cagagtgcag aagtagcctg agcttaggag 13 500 
ccaccaatga catccatctt ccatctgcag 13560 
cagccgccgc cagctgcgca ccttcttccg 13 62 0 
ctcggcgccg gggctcctgg tgctgcccag 13 68 0 
agaagagtgc gactgcggtt ctggccaggt 13 740 
ggctaaccca tgctcccagc ctttcccagg 13 8 00 
tctctccttt ctgacttcag aagtgcccag 13860 
tgcgtgcggg ggctcaatgt gcccacggtg 13 92 0 
tggggagcgc cctgtacatc ttagctgggc 13980 
gtttgaccca gtgtatgggt tcacgtagca 14040 
gctttccctt aatgggtctc tggtgtagtt 14100 
tgcgactgac tgcgatctcc ccgagttctg 14160 
tgtttaccta ctggatggct caccctgcgc 1422 0 
gtgtcccacg ctggagcagc agtgccagca 142 8 0 
gtcctggtgc cctgaccaat actaaaacct 14340 
gtggaactga ggcccgagct gcccgcttct 14400 
ccccagagcc atgtttccag cagatgaact 1446 0 
aggaccacaa gggtagcttc ctgccttgtg 14520 
gcagctgtaa agtagggcgg agcatgcgga 1458 0 
acttcctccc cagggacgct ctgtgtggga 14640 
cactagtgcc gcacatagtg actatggact 1470 0 
tttgccgagg ggcctttgtg ctcccagata 14760 
tagagccagg caccggctgt ggacctagaa 1482 0 
gttattgagt cctccatgcc aaagtgttct 1488 0 
ggtgtgccag gacaggcact gtcagaatgc 1494 0 
tgcctgccat aacggtgggg tgagtagqct 15000 
ctacctggtg acttttctat cctcatctta 15060 
tgctgctggc tgggctccac ccttctgtga 1512 0 
tggccctgca cagtctgcaa gtatcccaat 1518 0 
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ggggtggggg caggcaggaa ccacctgggc 
cctccacaga ccgagatgcc ttccccttgg 
tcccttgggc ttgcctagcc tggtgctact 
gactgtgctg caggagggac cccctatgga 
cctgagcctg cccccatggc tcctgcttct 
ggctacaggg agctggggct ttaggaagct 
agagctccca ggctgtggga agaagtccgt 
gtggtgtggt cctttccatg ggcttccctt 
ttggtttttt tctgctttaa agaaattcaa 
cgactctagc ttccttgtct caatatcagc 
ccttcaggtc cctcggtgct agccaaggac 
atagtgttgc tctttctgtg ccaggactct 
gtaaacccaa ttttctggca ggcagaaatg 
gcagcctcct cactaggccc ctcgggcaag 
gaagtgttct agaccaattg agggaagggc 
acctcaggcc agaaacagca ccagcaggcc 
ccaggtagtg cctggggggt gggggcggtg 
gttgggaggg ggcacttcag aggggtggca 
aggacagtgg gaatatctgg atgaaccatc 
cccggcagtt cagagtctag actggcagag 
ttcccaccaa gggagacatc tctgcacttc 
atctagactg agtgggggct ggagtggagt 
ggaggccctg ggacataagt gaggtctgga 
tcgaagcttg agtggaatgg agaagcaatc 
accttctgaa ctcttatctg gtctgtcact 
gtgcatccgg tggagtttgg ctccatcatc 
gccccttggg acatggagag ggaagcaagg 
atgctctcct taacacacct tcaatcctct 
acagcgttcg caccctccat ctcttgactt 
ctaagaacta ccctctgaag cagcctggtc 
ctatggtatg gaagcaccct gaggacctcc 
ttgttgtccc ctcctcagat ttacaggctt 
agagaagctg ttgtcataga gaccatgatg 
actgtggttc ttcttgggtc cttatagagg 
cagtcctcag tttctcagac gcactcttat 
ctcagcatag gaaccccaga gcccgatcat 
tattctgaaa tgttagaatg tatctaataa 
tggctgtgac ctgttaaaag ggtctaagtt 
tgagaaagaa attagaaaca acatctggac 
gcagacagat ctctgagttc aaggccagcc 
gggccacaca gaggaaaccc tgttttgaaa 
agtagcacct tgtacagaca gagagaaagg 
aggtggcaag gcagagtctc aaagcccagg 
caaagggcgc gtgggacagg cagggagtac 
ccctcacccc tggatgtctg ttagcaaggg 
tgtgagaggg gaaaggctgc tgcagacatg 
cactttaggg cctgctctct cccctgggtg 
gctgtgagag agaggttgct gcaagtgtgt 
atcatgattt ccctcgacat ctagtcatta 
actaccaata ttaactaata atgtgtataa 
catagatgtt agtacatata aatattatat 
aattttattt tatcatttga tattatttta 
gtgtaatata aaatataatt ttattattta 
ttatacctat atatttagta catatacata 
gccatcaagg agaagctaaa agccagcaag 



agtagcctgc ttgagactca gcacccctgc 1524 0 
ccatgctcct cagcttcctg ctgcctctgc 15300 
accagctccc aacattcttg tcatcgaagg 15360 
ataggtaggt tcggtgctca ggtctctctt 1542 0 
cagaactctt cagggctttg tagagtgaga 1548 0 
agatgggatc cttattctct agatgtagtg 1554 0 
ggtgtgtatc actgccctga ccagcacttg 15600 
gatttttttg ttttatgttt tttgttgttg 15660 
cacagcctgt gctccgcttt gtcctgaaga 1572 0 
catctcccat ggcctttgcc ctaattactc 15780 
ccctttgtgc tccctataaa ctggccagct 15840 
gtgcctctgt cccctgtaga tcacagtgct 15900 
ccacctgata gcacacatta tctaccgagg 15960 
tcaggcacat tatgtccctg tctgaacttg 16020 
agggttgggt tggagatgtg actagagggc 16 080 
ccaggagcca gtgagacggt ctggggaaag 16140 
tctgcagaca gggaaacagg tggagtgaca 162 0 0 
gctgcacacc gttatcggga tagggtgtca 16260 
caaagaaatg gcaaggtctg tgagaaaggt 16320 
caeggctaaa cggagcatgg gaaggcacag 16380 
agetctaggt gggccctcgg tgacgcctac 16440 
cacctcggga gaaaggatgt ccacagccct 16500 
catcttaagg acagaacagg aatgtggaat 16560 
cccctttgtc tcatacacgg tcactttcca 1662 0 
ggcecctgca gagacatacc cctgggcagt 16680 
actggagagc cctcgccccc tcgtaagtgt 16740 
Ogtgggtgct tgccatctgc ctccttctaa 16800 
gccctgctca- gccccatgga cctcttgcca 16860 
gctctcagac cctgcgaact ctgagcttac 16920 
tacagattga gttccagacc tgccctatcc 1698 0 
tgttgcccag tcacctacct ctgtctcagt 17040 
gcatcaataa agaaatgaga catgggcctc 1710 0 
ctggaagccc taggggcagg gaagggagac 17160 
gaggacaaat gtgccctgcc atgtgacttg 1722 0 
aattcctatg ggctgtatgc tgagctctta 17280 
gttgtatccc gcctgccctg agagctgtgc 17340 
caataaatcc acaagttata tcagtgttgt 1740 0 
gtttattaaa aagatatgga gatggattac 17460 
agtggaggag ccagcactgg ggaggaaagg 17520 
tggtctacag attgagttcc aggatagcta 17580 
accaaacagt caaataataa aaacaaactt 17640 
ttcccaggag agctcagacc ccagcaccag 17700 
tggcaaaagt gggatctgtt agcataaacc 17760 
cctttgatgc aagctcactc tggtgagggc 1782 0 
aatcagtcag tgtctcagtc tgttaacatc 1788 0 
gcctgagcag catctggatt cgaacatttg 1794 0 
gggcactcgc cattcattgc ctttacgaca 18000 
atggctgggt tagctcgagc ccaccaggca 18 06 0 
acaaacacag gttcttttac tacaatttta 1812 0 
aatatataac acagtacaca tatataagta 1818 0 
atatatatat aattgtaatt aatattactt 1824 0 
ctgttagtta taacaatgtg cataatatat 18300 
atattattat ataaatttaa ttaatattaa 18360 
ggttacagaa tggctacaaa agtgccagga 1842 0 
tgatcttcct gagacggttc tgccatggac 184 8 0 



171/189 



WO 02/083077 



PCT/US02/12063 



tgtacaatta gtgatggatt tgcttctgta ggcaaggacg aggagatttc attttaggaa 1854 0 
agattcctgc tattaatatg cttttcctgg tattattaaa tatatataac aatcactagg 18600 
tattagccca ccgttttgaa cagaatgttc tgcagaacaa tgaagatgta ctctctfcgta 18660 
atgatgctat atagacaaat agattatttc ttttttaaaa aagaaaaaag agccgggcga 1872 0 
tggtggcaca tgcctttaat cccagcactt gggaggcaga ggcaggcaga tttctgagtt 18 78 0 
caaggccagc ctggactaca gagtgagttc caggacagcc agggctactc agagaaactc 1884 0 
tgtcttggaa aaaaaaaaag aggaagaaag aaaaaagatt tatttattta ttttatacat 1890 0 
atgagtacac catcagacac acaagaagag ggcaccagac cccattacag atggttgtga 18 96 0 
gccaccatgt ggttgctggg aattgaactc aggacctctg gaagaacagt tggtgctctt 1902 0 
aacccctgag ccatctctcc agcccaaata gatgatttct taattcttaa ggatgatcct 1908 0 
ataagaattc ctaaacttac attagtaatt attaagctct tttacaatag gacttctatt 1914 0 
aagtcttctc taatatgaaa acttcaataa gaactctgcc agtcttcaag tgtcatgagt 192 0 0 
tagttgcttc tgagatagca agtaggcatc aacaacttag agcacattct aggaggttgt 1926 0 
aaaaccatta accagtggtc ttaaaaaggg aactaacaat aggctatagg tgcaaggaca 1932 0 
gaagataaaa tattgactag gtttatcaat acaaaattta cccacaaaag ttatgttttt' 193 8 0 
gacttttcat aaaaactctt tatgaacctg tagaactggt gaaagatgac gaatgcttag 1944 0 
ccagataatt actcctaata gatatgcatg tgaatattct gtgctgtaaa cttatttatg 1950 0 
tttgaacttc cagtgaactt ttgtttaaaa aagggggggg gttgaaaaag ccatgtgatc 1956 0 
tattctccta gaaagggtac agaagactaa gaaagattac attggagatg taaccttgga 1962 0 
gagaaagctt tgggagcaag agcatagaga gcaaggccat tgtggcatca gagcaggagg 1968 0 
agagagcaag attagaagga gatgcagagt ggaataactt agaaactata aggcaacata 1974 0 
aaaaattaag agagccatat gcagaatgca gagggaaaga gaaaaaaaaa aaaaaaaaga 198 00 
agctgcaggg agagcagaag gagcaggcag gcttctcctg accatggggt agaacagggc 198 6 0 
ttttcttaat accaaggcag gcttagtctt aaggataata aagcttttct ttcttacaga 1992 0 
cttggtttta attcatttag caataaaagt gtaaaagtgt tttctttccc tatgcaataa 1998 0 
agattggagc ttatttttca gccagaatga gtgagttctc tctgcaacgg tgcttggtct 2 0040 
tttgcttcat atacacacat aagtgtgtgt gtgcgcgcat gcgtgtgtgt gtgtgtgtgt 2 010 0 
gtgtgtgtgt gtgtaagtgt gcaattatca gatggcatgg aagctgggct caattggttc 20160 
aaatggggac ttgtgagggt atatgcatga atctgtatat gaattcatgt gagcttatat 2 022 0 
atatttgctt gtgtaaaagt ttttccttct gtgagtgtga ctctcttctc ctggttcaat 2 0280 
agaggtttat tgcttcaaac ttccccctag cctgacagtc gagaggcatc tggacaagag 2 0340 
agaaaaggct ctagccatta atccttttct tagatccatt ttcttagaga actttcttag 2 0400 
gaaactgttt agagagaaca tagaaacagg ctgaaatcac ttgtcaaact gtcccctttt 2 04 60 
cttcctaagg acttctacta gcagactggg agttagagct gcacagtccc tgaggagata 2 052 0 
gaacaaaggc tgctttactg aatcccctgc tgttttaaga tgaggttcta aaggagattg 2 058 0 
cagtttctga cccccaaaag gaactcaggc aggtcagcta cagtatcaaa gtgacttaaa 2 064 0 
cttaagatag ggatatgttt tattattaaa cagctaccct aaatatctca taagatcaag 2 0700 
cttaccccgg tgacacttcc ccctctgttg cctcaagagg aaccaagcag aaagaaccgc 2 076 0 
cagggctggc tcctggcaca aatgggttaa agatgttgta gcatggggaa atgaagagat 2 082 0 
ggctcagcta ttaagagaat atcttactct tccagaggac cagtgttcaa ttcccagcaa 2 0880 
acatatcagg tgccacacca tcacttgtag ctccagctgc agatctgcta catctggcct 2 0940 
ccataggcac ccacacacag gtggcaccca caattaaaaa aataagataa atctaaaaca 210 0 0 
gcaaagttaa agcatgagct gaaactagta aagtgcttgt gtggcataga ccaagacctg 21060 
ggtttggtcc ctacctgtta gaaatagtct cagtatcaca caaaggaaca cccaagcgaa 21120 
gcaaaagctc ccagcaagac aaaactacag tcttcattga gagtgtgcac gctgaagacc 2118 0 
gagcacactg ggtgcaaaat gtacttggat tctgtttgct tgttttgttc cagacagggt 2124 0 
ttctctgcat aacagccttg gctgtccctg aaattcactg tgtagaccag gctggcctca 213 0 0 
aaccctcaga gatccgttca cccacgctta tctaggcttc agtctcaccc tgtgagatgg 21360 
cctgaaagtt gttagaaccg cgcgggatct atttctgaca gactggctgg catcttttcc 2142 0 
ttctctcagc atgagattcc tggggcgttc ccatttcagc atcaagcatg gtagcagagt 2148 0 
tggaacctga gggctgaggg ctcagactca gaccataaac tggaagcaga gagaactgga 2154 0 
gattgtggga ggctttgaaa cctcagctcc tgccccagca aataccttcc agcaaggcca 21600 
cacctcttaa acctccccaa acagggtcac caactgggga cctaatattc aaatgcccac 21660 
aaatatggga gacatgacat ccaaaccgcc aggacaggtg tatacctcca tgcttggttt 2172 0 
ccgtagtaag aaacactaaa cattagcctt tcctaataaa cactgatata aagccctgct 21780 
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attctcgatg tttttcctct gttctgctcc tccttctcca cctgcttctc tgttctctga 21840 
cctcttctgt gtcacagata gccctgccat gtccatctgc cagccatgtt ctgtctactt 21900 
gcctctctct ctgctctgga ctcttctaga tgcctctggc tgttctttct catatctaca 21960 



<210> 365 
<211> 2874 
<212> DNA 
<213> Mus sp. 



<400> 365 

aaaggcactc 

attgctgccg 

tgcccatgga 

caccctggag 

ccaggatctc 

aacccactac 

ccaatatcac 

tgggatgagt 

tcctggggat 

ctggagaggg 

tcatgtcccc 

gtacatagtg 

tgccaattgc 

ggaagtgtgg 

ggctttccta 

gctcacgggc 

ccgcgcggag 

agccaccatg 

ctgcgtgcag 

tttcccgcgc 

gggcggtcct 

cggaaacggc 

ggacccctgc 

tgattgctgt 

tgactgcgat 

cctactggat 

cacgctggag 

atgtttccag 

gggtagcttc 

aggggagccg 

gggccgcgaa 

tgacttgggt 

gcactgtcag 

tggggtttgc 

tgacaagcct 

tgccttcccc 

agcctggtgc 

ggacccccta 

catcatcact 

ccctccatct 

ctctgaagca 

agcaccctga 

cctcagattt 

gtcatagaga 



ccagccatgg 
ctgttgctgc 
gagctagtca 
gagccgatct 
ctgcttgaac 
aggccagatg 
gggcgtgtga 
ggccttattg 
accaaagact 
gtccagagag 
cagagccggg 
gctgaccaca 
gtggaccaga 
accgagcagg 
cagtggcgcc 
cgcaccttcc 
agctccggag 
gcccacgaga 
gccgatgcag 
gtcttcagcg 
tgcctctcca 
ttcttggaag 
tgctttgccc 
gcgaagtgcc 
ctccccgagt 
ggctcaccct 
cagcagtgcc 
cagatgaact 
ctgccttgtg 
aacccactag 
gtggtttgcc 
ctggtagagc 
aatgctacct 
aatagcaatc 
ggcttgggtg 
ttggccatgc 
tactaccagc 
tggaatagag 
ggagagccct 
cttgacttgc 
gcctggtcta 
ggacctcctg 
acaggcttgc 
ccatgatgct 



gctcgaggtg 
cctcgtgtcc 
ctccccactg 
tgaagcctga 
tggagaagaa 
ggcatccggt 
ggggcttccg 
tgctcagcag 
tcccaaccca 
acaagaactc 
tgaggcgaga 
ccctgaactt 
ttctcaggac 
atctcagtcg 
gcggggtgtg 
agggtaccac 
gtgtgagcac 
taggccacag 
agcaaggagg 
cctgcagccg 
acacctcggc 
caggagaaga 
acaattgctc 
tgttaaagtc 
tctgcaccgg 
gcgctgaggg 
agcagctatg 
ccatggggaa 
ctcagaggga 
tgccgcacat 
gaggggcctt 
caggcaccgg 
cccaggagct 
gtaactgtca 
gtagcgtgga 
tcctcagctt 
tcccaacatt 
acatacccct 
cgccccctcc 
tctcagaccc 
cagattgagt 
ttgcccagtc 
atcaataaag 
ggaaccccta 



cgggagaccc 
gctgcggagc 
gatcctggag 
ctcggtgctg 
gcacaagctt 
agtgctgtcc 
ggaatcctgg 
caaagtcagc 
cgagatcttc 
ccaatacaaa 
ggcgcgcagg 
gaaccacacg 
tctggatata 
catcactcag 
ggccaggaga 

ggtgggcctg 

agaccactcg 
cctgggcctc 
ctgcgtcatg 
ccgccagctg 
gccggggctc 
gtgcgactgc 
cctgcgtgcg 
cgcgggcacg 
cacctccccg 
tcgcggctat 
ggggcctggg 
ttcgcaaggg 
cgctctgtgt 
agtgactatg 
tgtgctccca 
ctgtggacct 
ggaacgttgc 
ctgtgctgct 
tagtggccct 
cctgctgcct 
ctgtcatcga 
gggcagtgtg 
cccatggacc 
tgcgaactct 
tccagacctg 
acctacctct 
aaatgagaca 
ggggcaggga 



ggggggtctc 
gctcggatgt 
ggcagactct 
gtggctttag 
ctggccccag 
cccaaccaca 
gtggttctca 
tattatctgc 
cggatggagc 
gcaggaatgg 
agtcccaggt 
agacagcgcc 
cagttggtgt 
gacgcaaacg 
ccacacgact 
gcacctgtgg 
gaactcccca 
caccatgatc 
gaggcagcca 
cgcaccttct 
ctggtgctgc 
ggttctggcc 

ggggctcaat 

ccttgtcgtc 
tattgccccg 
tgcctagacg 
tccaagccgg 
aactgtggcc 
gggaaactgc 
gactccacaa 
gatagtcacc 
agaatggtgt 
ttgactgcct 
ggctgggctc 
gcacagtctg 
ctgctccctg 
aggggactgt 
catccggtgg 
tcttgccaac 
gagcttacct 
ccctatccct 
gtctcagttt 

tgggcctcag 

agggagacac 



cggtgctgct 
ttccaggaaa 
ggctcaaggt 
aggctgaagg 
gatacacaga 
cggatcattg 
gcacctgctc 
aacctcggac 
agttgttcac 
ccagtcttcc 
acctggaact 
tcctggaggt 
tgaccgggct 
aaacgctctg 
ccacacaact 
agggcatatg 
tcggcacagc 
ccgagggctg 
cagggcaccc 
tccgcaaagg 
ccagccgctg 
agaagtgccc 
gtgcccacgg 
ctgctgcgac 
cagatgttta 
gctggtgtcc 
ccccagagcc 
aggaccacaa 
tgtgccaggg 
ttctcctaga 
tggaccagct 
gccaggacag 
gccataacgg 
cacccttctg 
caaaccgaga 
gggctggcct 
gctgcaggag 
agtttggctc 
agcgttcgca 
aagaactacc 
atggtatgga 
gttgtcccct 
agaasctgtt 
tgtggttctt 



60° 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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cttgggtcct tatagaggga ggacaaatgt gccctgccat gtgacttgca gtcctcagtt 27 0 0 
tctcagacgc actcttataa ttcctatggg ctgtatgctg agctcttact cagcatagga 2760 
accccagagc ccgatcatgt tgtatcccsc ctgccctgag agctgtgcta ttctgaaatg 2 82 0 
ttagaatgta tctaataaca ataaatccac aagttatatc aghaaaaaaa aaaa 2 8 74 



<210> 366 
<211> 791 
<212> PRT 
<213> Mus sp. 

<400> 366 

Met Gly Ser Arg Cys Gly Arg Pro Gly Gly Ser Pro Val Leu Leu Leu 
15 10 15 

Leu Pro Leu Leu Leu Pro Ser Cys Pro Leu Arg Ser Ala Arg Met Phe 
20 25 30 

Pro Gly Asn Ala His Gly Glu Leu Val Thr Pro His Trp lie Leu Glu 
35 40 45 

Gly Arg Leu Trp Leu Lys Val Tiir Leu Glu Glu Pro lie Leu Lys Pro 
50 55 60 

Asp Ser Val Leu Val Ala Leu Glu Ala Glu Gly Gin Asp Leu Leu Leu 
65 70 75 80 

Glu Leu Glu Lys Lys His Lys Leu Leu Ala Pro Gly Tyr Thr Glu Thr 
85 90 • 95 

His Tyr Arg Pro Asp Gly His Pro Val Val Leu Ser Pro Asn His Thr 
100 105 110 

Asp His Cys Gin Tyr His Gly Arg Val Arg Gly Phe Arg Glu Ser Trp 
115 120 " ^ 125 

Val Val Leu Ser Thr Cys Ser Gly Met Ser Gly Leu lie Val Leu Ser 
130 135 140 

Ser Lys Val Ser Tyr Tyr Leu Gin Pro Arg Thr Pro Gly Asp Thr Lys 
145 150 155 160 

Asp Phe Pro Thr His Glu lie Phe Arg Met Glu Gin Leu Phe Thr Trp 
165 170 175 

Arg Gly Val Gin Arg Asp Lys Asn Ser Gin Tyr Lys Ala Gly Met Ala 
180 185 190 

Ser Leu Pro His Val Pro Gin Ser Arg Val Arg Arg Glu Ala Arg Arg 
195 200 205 

Ser Pro Arg Tyr Leu Glu Leu Tyr lie Val Ala Asp His Thr Leu Asn 
210 215 220 

Leu Asn His Thr Arg Gin Arg Leu Leu Glu Val Ala Asn Cys Val Asp 
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225" 230 235 240 

Gin lie Leu Arg Thr Leu Asp lie Gin Leu Val Leu Thr Gly Leu Glu 
245 250 255 

Val Trp Thr Glu Gin Asp Leu Ser Arg lie Thr Gin Asp Ala Asn Glu 
260 265 270 

Thr Leu Trp Ala Phe Leu Gin Trp Arg Arg Gly Val Trp Ala Arg Arg 
275 280 285 

Pro His Asp Ser Thr Gin Leu Leu Thr Gly Arg Thr Phe Gin Gly Thr 
290 295 300 

Thr Val Gly Leu Ala Pro Val Glu Gly He Cys Arg Ala Glu Ser Ser 
305 310 315 320 

Gly Gly Val Ser Thr Asp His Ser Glu Leu Pro He Gly Thr Ala Ala 
325 330 " 335 

Thr Met Ala His Glu He Gly His Ser Leu Gly Leu His His Asp Pro 
340 345 350 

Glu Gly Cys Cys Val Gin Ala Asp Ala Glu Gin Gly Gly Cys Val Met 
355 360 365 

Glu Ala Ala Thr Gly His Pro Phe Pro Arg Val Phe Ser Ala Cys Ser 
370 375 380 

Arg Arg Gin Leu Arg Thr Phe Phe Arg Lys Gly Gly Gly Pro Cys Leu 
385 390 395 400 

Ser Asn Thr Ser Ala Pro Gly Leu Leu Val Leu Pro Ser Arg Cys Gly 
405 410 415 

Asn Gly Phe Leu Glu Ala Gly Glu Glu Cys Asp Cys Gly Ser Gly Gin 
420 425 430 

Lys Cys Pro Asp Pro Cys Cys Phe Ala His Asn Cys Ser Leu Arg Ala 
435 440 445 

Gly Ala Gin Cys Ala His Gly Asp Cys Cys Ala Lys Cys Leu Leu Lys 
450 455 460 

Ser Ala Gly Thr Pro Cys Arg Pro Ala Ala Thr Asp Cys Asp Leu Pro 
465 470 475 " 480 

Glu Phe Cys Thr Gly Thr Ser Pro Tyr Cys Pro Ala Asp Val Tyr Leu 
485 490 495 

Leu Asp Gly Ser Pro Cys Ala Glu Gly Arg Gly Tyr Cys Leu Asp Gly 
500 505 510 

Trp Cys Pro Thr Leu Glu Gin Gin Cys Gin Gin Leu Trp Gly Pro Gly 
515 52 0 525 
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Ser Lys Pro Ala Pro 
53 0 

Asn Ser Gin Gly Asn 
545 

Cys Ala Gin Arg Asp 
565 

Glu Pro Asn Pro Leu 
580 

Leu Leu Glu Gly Arg 
595 

Asp Ser His Leu Asp 
610 

Gly Cys Gly Pro Arg 
625 

Thr Ser Gin Glu Leu 
645 

Val Cys Asn Ser Asn 
660 

Pro Phe Cys Asp Lys 
675 

Ala Gin Ser Ala Asn 
690 

Phe Leu Leu Pro Leu 
705 

Gin Leu Pro Thr Phe 
725 

Pro Leu Trp Asn Arg 
740 

Phe Gly Ser lie He 
755 

Ser Cys Gin Gin Arg 
770 

Pro Ala Asn Ser Glu 
785 



Glu Pro Cys Phe Gin Gin 
535 

Cys Gly Gin Asp His Lys 

550 555 

Ala Leu Cys Gly Lys Leu 
570 

Val Pro His He Val Thr 
585 

Glu Val Val Cys Arg Gly 
600 

Gin Leu Asp Leu Gly Leu 
615 

Met Val Cys Gin Asp Arg 
630 635 

Glu Arg Cys Leu Thr Ala 
650 

Arg Asn Cys His Cys Ala 
665 

Pro Gly Leu Gly Gly Ser 
680 

Arg Asp Ala' Phe Pro Leu 
695 

Leu Pro Gly Ala Gly Leu 
710 715 

Cys His Arg Arg Gly Leu 
730 

Asp He Pro Leu Gly Ser 
745 

Thr Gly Glu Pro Ser Pro 
760 

Ser His Pro Pro Ser Leu 
775 

Leu Thr 
790 



Met Asn Ser Met Gly 
540 

Gly Ser Phe Leu Pro 
560 

Leu Cys Gin Gly Gly 
575 

Met Asp Ser Thr He 
590 

Ala Phe Val Leu Pro 
605 

Val Glu Pro Gly Thr 
62 0 

His Cys Gin Asn Ala 
640 

Cys His Asn Gly Gly 
655 

Ala Gly Trp Ala Pro 
670 

Val Asp Ser Gly Pro 
685 

Ala Met Leu Leu Ser 
700 

Ala Trp Cys Tyr Tyr 
720 

Cys Cys Arg Arg Asp 
735 

Val His Pro Val Glu 
750 

Pro Pro Pro Trp Thr 
765 

Asp Leu Leu Ser Asp 
780 



<210> 367 
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<2ll> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 367 

agctggggtt gggggt 16 



<210> 368 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 368 

gccgggggct gtggg 15 



<210> 369 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 369 

tggtgcctca ctcaccc 17 



<210> 370 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 370 

ggagctggga tt gggggt 18 



<210> 371 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 371 

cgccggggac tgtgggc 17 



<210> 372 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 372 

cctggtgcct aactcaccca 2 0 



<210> 373 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 373 

caagaacctt cccagcggtt atctcctcct ctcaggagta g 41 



<210> 374 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 374 

gccctctgag accgacgggg tgggacggct cgggccggtc a 41 



<210> 375 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 375 

ccaccatctc agctccacac actttcttgc ccaggtctcg a 41 
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<2l0> 376 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 376 

caccatctca gctccacact ttttcttgcc caggtctcga a 41 



<210> 377 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 377 

tggtgcttcc catattcaca cctcccacaa ctaagccatc a 41 



<210> 378 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 378 

acaactaagc catcaccaag cctccttcct ctagccccaa g 41 



<210> 379 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 379 

caggatacat agaaacccac cacggcccag atgggcagcc a 41 



<210> 380 
<211> 41 
<212> DNA 
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<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 380 

agctgctcac ctggaaagga gcctgtggcc acagggatcc t 41 



<210> 381 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sequence 
containing SNP 

<400> 381 

ccctccaaat cagaagagac gggaattcac aggcctcgag t 41 

<210> 382 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 382 

acttccttct gggagctggg attgggggtc agggctcaag c 41 



<210> 383 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 

containing SNP 
<400> 383 

ttcctgcagt ggcgccgggg actgtgggcg cagcggcccc a 41 

<210> 384 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 384 

tggcgaggtt actcctacac tgggaggagc accgtcgggt c 



<210> 385 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sequence 
containing SNP 

<400> 385 

ggctgctcac tattggggcc tcatcgtccc ctgtcccgct t 

<210> 386 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 386 

gccgcatcgt cccctgtccc acttgttgtg tgactttgcg c 



<210> 387 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 387 

cccctctctg ggctctgcgc atctggcggc tgtagccaag c 

<210> 388 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sequence 
containing SNP 
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<400> 388 

cagagaagcg cgggggttgg aggactgtcc ctccatgccc a 41 

<210> 389 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence; Sequence 
containing SNP 

<400> 389 

cagccgccgc cagctgcgcg tcttcttccg caaggggggc g 41 

<210> 390 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 390 

ggttcagggt gagggtttcg tggagcttgg gagccggcct g 41 



<210> 391 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 391 

gtgagctctg cccacccgac tcctccttgc cgtttgaatc c 41 



<210> 392- 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 392 

tgctggccat gctcctcagc atcctgctgc ctctgctccc a 41 



182/189 



WO 02/083077 



<210> 393 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 393 

ctgctgcctc tgctcccagg cgccggcctg gcctggtgtt g 



<210> 394 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 394 

gaagtagctt tgaacaggag tttccagtgg cctcccagtc a 

<210> 395 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 395 

agtggcctcc cagtcaagcg tgggggtgga tccctgcccc a 

<210> 396 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 396 

gcctctgtct caccagtttt tggccctttg ccacttcctc t 



<210> 397 
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<2ll> 41 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 397 

acaaatcacc tctgtcaccc acttgaagtTT~cceaaatgct g 41 



<210> 398 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 398 

tccataccac tggtcagctg tggtgctggc tgcccctgtg c 41 



<210> 399 
<211> 41 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 399 

ggtgctggct gcccctgtgc tagggccctg ccttaaccca g 41 



<210> 400 

<211> 41 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 400 

ggaaatgaca aggccttggg agatgggatg gggacagtca a 41 



<210> 401 
<211> 41 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 401 

cctgggcggc gttcacccca cggagttggg ccccacagcc a 41 

<210> 402 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 402 

gccccacagc cactggacag tcctggcccc tgggtgagtg a 41 

<210> 403 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 403 

gccctggccc ctgggtgagt taggcaccag ggggaggtgg a 41 

<210> 404 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 404 

agggctcatg cctcctgcct tcttccagat gggcagcacc c 41 

<210> 405 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
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containing SNP 
<400> 405 

gcccctcccc agccccaggg gctcctgctg accatattca c 41 

<210> 406 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 406 

atgacctctt ggttatcatg cagaccagga tgctggaagc c 41 

<210> 407 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 407 

ctggtcctca ctgagtgagg gtgggctctc tgccacacag c 41 

<210> 408 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 408 

cactgagtga ggatgggctc cctgccacac agcttgcagc c 41 

<210> 409 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 409 
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tgcagcctgg ggccccagtc attaggggac aacatatcct c 



<210> 410 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 410 

tcctcattct cagcagatca tgtccagatg ccaagatcct g 41 



PCT7US02/12063 

41 



<210> 411 
<211> 41 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 411 

ttcttccccg agtggagctt tgacccaccc actccaggaa c 41 



<210> 412 
<211> 41 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 412 

tccaggaacc cagagccaca ctagaagttc ctgagggctg g 41 



<210> 413 
<211> 41 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 413 

actgagtcca cactcccctg gagcctggct ggcctctgca a 41 
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<210> 414 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 414 

agcctggctg gcctctgcaa gcaaacataa ttttggggac c 41 

<210> 415 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 415 

atcccagcac tttgggaagc tggggtagga ggatcaccag a 41 

<210> 416 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 416 

ggaggatcac cagaggccag gaggtccaca ccagcctggg c 41 

<210> 417 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 417 

agcaagacac cgcatctaca aaaaaatttt aaaattagct g 41 

<210> 418 
<211> 41 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 418 

ctgaggacca cacggggtgg cggttggcgg ggtggtggtt g 41 

<210> 419 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 419 

ggctggcagg ccgagcctag gtggcagcca gagccccagg c 41 

<210> 420 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence 
containing SNP 

<400> 420 

ctttgctctg tcactcctgc ttcccttggg cgttcacatt c 41 
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